Can AI Crack The Biology Code?

⁠¶ Intro / Opening

00:00

These days there is a lot of news. It can be hard to keep up with what it means for you, your family, and your community. Consider this from NPR as a podcast that helps you make sense of the news. Six days a week, we bring you a deep dive on a story and provide the context, the backstory, and analysis you need to understand our rapidly changing world. Listen to the Consider This podcast from NPR. You're listening to Shortwave.

00:26

From NPR. Hey, hey, short wavers. Emily Kwong here with producer Burley McCoy. What's up, Burley? Hey, Emily. Hello. What do you have for us today?

⁠¶ The Challenge of Protein Folding

00:35

Okay, so Emily, today I want to dig into how AI has shaken up the field of protein science, as in the fundamental building blocks of life, proteins. I've heard of them. Yeah. I mean... This is like what you studied back in your scientist days. Yes, yes. I love proteins. Oh.

00:53

We love that you love them. How has AI moved the needle in this field, though? Well, scientists have used it to dig into a problem that protein scientists have struggled with for more than 60 years. And that is, what do these building blocks... of which there are millions, look like. Like their shape? Like their shape, yeah, exactly. And why is that so important? Well, the ability of a protein to do its specific job, so like carry oxygen through your body or turn light into sugar.

01:20

That relies wholly on its unique, complicated shape. So to understand how it works, you need to know its shape. But why can't scientists just run an experiment to determine the shape?

01:30

They can for some proteins, but those experiments can take years and years. And Emily, that's because a scientist essentially needs to take the equivalent of a molecular photo of the protein to map its complicated shape. But getting the protein to... cooperate to get that photo so like to hold still for example without falling apart that can be super tricky and it could take a grad student's entire phd program to figure out a single protein and other proteins were just

01:59

Abandoned because they would never cooperate. Proteins sound difficult, honestly. So the challenge is, how do you figure out a protein's shape? without running these super tedious experiments. Is this where AI comes in? Yeah, and to give you a sense of... kind of how AI has changed the protein game. There's this protein competition that scientists run every other year. Get out a protein competition. Okay. Yeah. And they've run it for the past 30 years where groups will basically compete on who.

02:32

and accurately guess the most protein shapes. It's like nerd central for sure. We love. And for most of that 30-year history, participants have really only made incremental progress. But in 2020, Google DeepMind used AlphaFold2, that's its AI protein prediction model, and Emily. AlphaFold2 blew the other competition out of the water completely. Wow. Okay. Game changer.

⁠¶ AlphaFold's Breakthrough in Prediction

02:58

And now the Google DeepMind team has taken this AI tool to the next level by expanding it beyond proteins. So today on the show, how scientists have taken a huge step to understanding the building blocks of life. Using AI. Plus, how other researchers are using the tech to design brand new proteins, ones never before seen in nature. And how AI could help us solve the biggest problems we face today, from disease to climate. You are listening to Shortwave, the science podcast from NPR.

03:51

Really? Come on, let's go. Hear that? That's the sound of my customers leaving. When your broadband doesn't work, neither does your business. Will Sky Business keep me up and running? No matter what. With 4G backup and our stay connected guarantee, better believe it. Let our dedicated Sky Business team handle your switch today. That's not like it.

04:17

Subject to availability. For T's and C's, visit skybusiness.com. I'm Tanya Mosley, co-host of Fresh Air. At a time of sound bites and short attention spans, our show is all about the deep dive. We do long-form interviews with people behind the best in film, books, TV, music, and journalism. Here our guests open up about their process and their lives in ways you've never heard before.

04:43

Listen to the Fresh Air podcast from NPR and WHYY. Okay, Burley, so scientists, it seems, have been trying to figure out the complicated shapes of proteins for decades to better understand how they work. Why has this been such a complicated thing to figure out? Well, the short answer, Emily, is that there are so many theoretical ways a single protein could fold that it's a big problem to solve.

05:10

So if you unfolded a protein, it would kind of look like a bunch of beads on a long string. Those beads are little molecules called amino acids. Oh, I remember this from biology. There are like 20... types of amino acids. Yep. Each one is a little different. Right. So each one has a slightly different shape and that kind of dictates how that part of the string can be folded up.

05:33

Because proteins often have 100 or more amino acids, you can see how imagining all the ways it could fold would get complicated. Yeah, it just sounds like thousands of different shapes. What, hundreds of thousands of different shapes? Okay, try billions of trillions, Emily. Like, there are theoretically more ways for one single protein to fold than there are stars in our night sky.

05:55

This sounds like a glorious nightmare. Right? I'm so curious. Okay, so you said that AI has helped us make some leaps and bounds towards a solution. How does this technology work? So this alpha-fold model is a type of AI called a deep learning program, which is this huge network of data processing points called nodes. And the purpose of this network is to learn and then make predictions based on what it's learned. In AlphaFold's case and other models like it,

06:25

It learns about proteins from a huge collection of protein structures that scientists have been building on for decades from their experimental data. Okay, so the idea is that after these... models use all of that carefully gathered experimental data to learn. They can then predict the shapes of proteins they do not know yet. Exactly. Okay. And going back to the protein competition in 2020, how did AlphaFold...

06:51

blow away the competition. So they essentially changed the whole architecture of their model. They had been using AI before, but remember the beads on a string analogy? If amino acids are the beads... Even if one bead is far from another on the string, when it all folds up, they could be right next to each other. So with Alpha Fold 2, the model looked at distances between

07:13

all of the different amino acids and previous knowledge from solved protein structures. Awesome. And the accuracy and speed of the predictions went... Way up. Okay. I'm assuming that made a huge difference for scientists everywhere studying proteins. Totally. Julian Bergeron, a structural biologist at King's College London, is one of them.

07:34

He studies the tail-like appendage that propels bacteria. So it's called a flagellum, and it's pretty complicated. It's this huge assembly. So it's longer than the bacterial cell itself. It consists of 20 to 25 different proteins, but many of them have hundreds of thousands of copies of that protein. And these huge propeller machines are what give some bacteria the ability to make you sick or build plaque on your teeth.

08:02

So Julian's lab is trying to figure out how these giant machines work, what their pieces look like, and how it all fits together. And so when the AlphaFold II model came out, he just had to try it. And I input a sequence. And then a few hours later, I had the model and I was like, oh my God, this just did it. And we'd been struggling with that problem for months, if not years.

08:28

And all of a sudden I messaged my lab and I said, we model everything. And we've had dozens of projects that immediately progressed thanks to this. Okay, so it sounds like overnight. AlphaFold changed the trajectory of his lab. Yeah. But how did he know that using AlphaFold 2 would actually work? Yeah, so the accuracy is super important, right? Especially when you're basing all of your other experiments on the results.

08:57

It's important to note that like other AI, AlphaFold 2 isn't right 100% of the time, so you can't just take the results at face value. But unlike some other AI, included in the results... is a score basically telling you how accurate each part of the structure is. Okay, and are others in the field using AlphaFold too?

09:17

Yeah, so this is something that actually sets AlphaFold apart from other protein prediction AI models. It's extremely user-friendly. So essentially, anyone who works on a protein or even just has a sequence of a protein can plug it in and get results. I talked to Pushmeet Kohli, vice president of research at Google DeepMind, and he told me why it was important for them to make this tool open access.

09:46

leverage AI to accelerate and advance science. Okay, so I'm scrolling through the AlphaFold website, and I'm seeing scientists using this model for all kinds of things. They're working on malaria and cancer research, drug discovery.

⁠¶ AlphaFold3 and Model Limitations

09:59

Plastic eating enzymes. And last week, DeepMind released a new version, AlphaFold3, which can predict the 3D structure of proteins and other kinds of biomolecules that they attach to. Why are those other biomolecules important? Yes. So I know we talked about how much proteins are super important. I love them. But I have to admit, they rarely work alone.

10:25

If we actually want to know how biology works as a whole, we need to understand how proteins work with their partner molecules. So it really gives you a more detailed and more accurate picture of what is happening inside the body where proteins are... not just sort of existing in isolation, they are interacting in a very rich biological space or soup of RNA and DNA and small molecules. And it really sheds light into those.

10:53

rich interactions. Now, previous versions of these protein prediction softwares would model where each amino acid was located. But in this new version, AlphaFold3, It maps things on an even smaller level. So it models where individual atoms are. Wow. So they can predict the structure of multi-protein complexes like the bacterial flagellum or something like proteins in the blood, which attach to iron. atoms. That is powerful. Okay.

11:22

What are the limits to alpha-fold predictions? Yeah, there are definitely limitations. Pushmeat says that the model works best when a protein has a single defined structure, but some proteins have more than one shape or they have sections that are kind of... of flimsy think cooked versus uncooked spaghetti okay so the model has sounds like some trouble with prediction in some cases and and the results show that yeah so the idea is that these results would say hey

11:50

I'm not so confident in this area of the protein, just so like users know. And another limitation is that the prediction ability... depends on the amount of what's called training data available. So I mentioned that there's a lot of training data for proteins, but... Some categories have much less training data. available. For example, there's much less structural data available for RNAs. Okay, so the prediction is only as good as the data.

⁠¶ Designing Novel Proteins with AI

12:21

Exactly, exactly. But Emily. But Burley. There's another way scientists can use AI in the protein world. Okay, what's that? To generate brand new proteins. Ones, like, not found in nature anywhere. Humans face new problems today. And, you know, we live longer. We're polluting and heating up the planet. And... It's reasonable to think that if with more millions of years of evolution, that some of these problems would be solved. But we don't want to wait that long.

12:52

So the idea is that we can now create completely new proteins that solve these problems that weren't really relevant during evolution to make the world a better place. So this is David Baker. He's a biochemist and the director of the Institute for Protein Design at the University of Washington.

13:10

And he's been working on proteins for years. He actually developed one of the earlier protein prediction models. His lab has a similar AI program to AlphaFold3. It's called RosettaFold All Atom. But his big focus is designing these brand new... This sounds so futuristic. Like, what kind of new proteins? So far, they've done things like design new protein antibodies, which are important for fighting infections, in this case, to fight influenza.

13:39

They've made something called a switch protein that could be used as an environmental sensor. And they've also made proteins that could help store carbon, which is a huge hurdle for fighting climate change. I think really across, you know, medicine, sustainability. technology, I think there's huge opportunities to transform. the current ways we do things with protein design. So these predictive and generative AI models have fundamentally changed the protein science landscape.

14:09

And again, there's definitely room for improving the prediction power, but with what the field has shifted to, like in terms of prediction accuracy and design potential, I mean, it's really gotten this retired protein fanatic. like missing my science days. Burly, thank you so much for bringing us this big, big story about the little things in life. Thanks, Emily.

14:41

This episode was produced by Rachel Carlson. It was edited by our showrunner, Rebecca Ramirez. Burley checked the facts. Ko Takasugi Chernovan was the audio engineer. Special thanks to Jeff Brumfield. Beth Donovan is our Senior Director. And Colin Campbell is our Senior Vice President of Podcasting Strategy. I'm Emily Kwong. Thank you for listening to Shortwave from NPR.

15:25

It's Sarah Gonzalez. The economy has been in the news a lot lately. It's kind of always in the news. And Planet Money is always here to explain it. Each episode, we tell a sometimes quirky, sometimes surprising, always interesting story that helps you better understand the economy. So when you hear something about cryptocurrency or where exactly your taxes go, ya sabes. Listen to the Planet Money podcast from NPR.

15:57

That's where the Up First podcast comes in. Every morning in under 15 minutes, we take the news and pick three essential stories so you can keep up without getting stressed out. Listen now to the Up First podcast from NPR. As NPR's Daily Economics Podcast, The Indicator has been asking businesses how tariffs are affecting their bottom line. I paid $800,000 today. You paid $800,000 in tariffs today. Yes. Wow. And what that means...

16:28

for your bottom line. Listen to The Indicator from Planet Money. Find us wherever you get your podcasts.

✨ This transcript was generated by Metacast using AI and may contain inaccuracies. Learn more about transcripts.

Summary

Episode description

Transcript

⁠¶ Intro / Opening

⁠¶ The Challenge of Protein Folding

⁠¶ AlphaFold's Breakthrough in Prediction

⁠¶ AlphaFold3 and Model Limitations

⁠¶ Designing Novel Proteins with AI

Can AI Crack The Biology Code?

Summary ✨

Episode description

Transcript ✨

⁠¶ Intro / Opening

⁠¶ The Challenge of Protein Folding

⁠¶ AlphaFold's Breakthrough in Prediction

⁠¶ AlphaFold3 and Model Limitations

⁠¶ Designing Novel Proteins with AI

Summary

Transcript