#183 - OpenAI o1, Adobe vid gen, Reflection 70B, DeepMind AlphaProteo | Last Week in AI podcast

⁠¶ Intro / Banter

Andrey

00:41

Hello and welcome to the Last Week in AI podcast, where you can hear us chat about what's going on with AI. As usual in this episode, we will summarize and discuss some of last week's most interesting AI news. And you can also check out our Last Week in AI newsletter at lastweekin. ai for articles we did not cover in this episode, and also for emails that come with this podcast that has the links. to all the articles, and that is also in the description of the episode.

01:11

I am one of your hosts, Andrey Kurenkov. I, at one point, did a PhD at Stanford. That was now a while ago, and now I work at a generative

Jeremie

01:19

AI startup. It's funny how the PhD starts to get further and further in the background there. It's, uh, yeah, well, I am. So I'm your other co host, Jeremy Harris, uh, co founder of Gladstone AI, AI National Security Company. I am still here. I'm still here, which means, which means, I Not the transitive property. There's, I'm sure, some logical property. This means that my baby isn't born yet.

01:41

So, um, yeah, gonna, gonna be at it for another week or so and probably won't be here, I'm guessing, next week. But, uh, who knows the way these things go. So, excited to be here for now.

Andrey

01:52

Yes, I'm pretty sure this is the last week for a little while you'll be co hosting. So expect to hear John or another of our regular co hosts next week. But, uh, I guess we got lucky because this week is going to be a fun one to talk about. And speaking of this week, I must apologize. I'm also the editor of this podcast as of now. And, uh, if you're a regular listener, you might've noticed the schedule and releases have been a bit off.

02:20

I covered it last week and still haven't kind of gotten back on course. So hopefully you're seeing this episode come out like, you know, a couple of days after we, I'm going to just release three episodes over the next five or six days. So hopefully that'll happen. We'll see. And real quick, as usual, want to do a quick shout out to any feedback or comments. We got a couple new, uh, reviews on Apple podcasts, which is always a fun to see.

02:50

And we got one that titled outstanding AI gem from weightless rebuild, which is a fun name. And as I suggest to keep discussing all the categories, even policy for a well balanced AI diet. I like that phrase. We should start saying we have a well balanced AI news diet in this podcast. And also another podcast, great news and discussion. And this person really appreciates the focus on safety concerns and especially X risk. So good to see that at least some listeners are, you know.

03:27

Appreciative of your particular area of focus. , Jeremy, I'm sure many people agree with us that safety at, at the present day is important. X risk is a bit more niche, but, uh, there are quite a few people who care about that as well.

Jeremie

03:41

Yeah, the, the comment wars continue. So, uh, looking forward to hearing the next barrage of people who say , say the opposite. appreciate you guys listening, uh, nonetheless. And we'll, we'll try to do our best subject to our, our feeble little human primate brains capacities here. Yeah, we're going to keep a well balanced AI diet for you. Yes, that's

⁠¶ Tools & Apps

Andrey

04:00

right. Kicking off with tools and apps, and right away we got the top news story of the week, maybe of the month, and it is OpenAI's Strawberry finally being released. I don't know when the rumor is about to start, it feels like a year ago that we first started hearing about Strawberry, and it is now released, and it is actually called O1 and O1 mini. So we got the release, you can use it, there's a cap, uh, and how much you can use it.

04:33

Uh, and there's a bit of a blog post, not too many details, but we'll get into what they are saying. The gist is this is a new model that is specialized in complex reasoning. So it at a high level is optimized to do chain of thought, as we've mentioned many times. So before it outputs a generation, it sort of thinks through your query. It has a few rounds of reasoning and kind of thinking through what response should be, and only then does it output the response.

05:07

So that means that this model It's quite a bit slower. It will take up to 20 seconds. I've seen 25 seconds to output a response. It is also more expensive, presumably, but on benchmarks such as, you know, math Olympiad questions, advanced coding. PHD level questions. It is beating benchmarks handily. Although as some have pointed out on Twitter, GPT OpenAI compared it to GPT 4, not another like agentic type or chain of thought type system.

05:41

So it's kind of the comparisons right now aren't too clear. If you are trying a model with these kinds of techniques, and these kinds of techniques we've seen for quite a while. But either way, everyone is excited. We finally got Strawberry. It's kind of what people expected in terms of a reasoning model that incorporates some, uh, more search, some more, like, compute time inference, and some more baked in chain of thought reasoning.

Jeremie

06:08

Yeah, I think this is, uh, both It's the stuff we know publicly about it is what we expected, but we don't know a lot publicly about it. So there's been a lot of, um, a lot of questions have been raised about, you know, what exactly is going on here? We know opening eyes said, well, it uses reinforcement learning. It's trained with reinforcement learning. And as they keep saying, it invests essentially more. test time compute, more inference time compute, right?

06:33

So the traditional paradigm, obviously, is you pre train, usually spend hundreds of millions of dollars, and if not billions now, on, uh, on pre training, um, build a giant text autocomplete system. And then, uh, if you, You're going to use a system like chat GPT usually just send it a query and then it does one round of inference and just gives you a response and the idea here is somehow the system is querying itself multiple times a chain of thought prompting is part of this.

06:59

We know that that's in the documentation that opening eyes put together. We also know that from their blog post that there is a reasoning trace that is produced through chain of thought that is instrumental in creating the ultimate output of the system. We know those things. Will we be able to see that reasoning trace when we interact with the system? No. Open AI is not allowing us to see, see that reasoning trace. That's a very interesting strategic decision, which they rationalize.

07:26

Um, they say, basically, look, uh, we'd, we'd love to give you, Uh, access to the chain of thought, um, uh, kind of, um, not prompt, but to the, the whole, you know, chain of thought, the reasoning trace, um, but we won't. And they say that's based on weighing multiple factors, including user experience. Sure. Sure. Um, competitive advantage. Okay. And the option, yeah, and the option to pursue chain of thought. Uh, monitoring now of those.

07:56

I mean, the user experience thing, I think, like, I don't even understand what that's supposed to be, what kind of argument that's supposed to be. Look, you could hide this and have it like, revealable so that it's not distracting users. There are, there are user experience ways to solve for that. Um, the real thing here is clear. I don't think anybody would, would claim otherwise it's competitive advantage, right? They don't want the chain of thought.

08:17

Um, the, the reasoning trace to be available for people to download and then train their own models on. Because one of the big things that's emerging from this is, uh, the training process seems to involve some sort of training on the actual reasoning process. And that implies if you can get your hands on enough of these reasoning traces, then You might be able to replicate to a certain degree, the performance of the model. So that certainly aligns.

08:40

It also aligns with stuff we've heard that from the information, the rumors that open AI is using this model to train Orion, which theoretically would be the next generation of model. Um, basically using the strawberry model or the O one model to produce training data for Orion. And to the extent that that's true, essentially that reasoning trace starts to become a valuable source of training data for the current. Or sort of the next generation of models.

09:05

They really don't want to give away, uh, the goat on that one. So I, I think that's just really interesting. You know, they're making all kinds of, of dissents. Nice, by the way, to see the transparency, right? Most companies would not tell you, well, we're, we're refusing to release this or make it available for competitive advantage reasons. So in fairness, kudos to open AI for doing that.

09:24

Um, I think the flip side is they've made a big deal out of how the reasoning trace is good for safety because it allows you to audit, As they put it, kind of the thoughts of the model. I don't think that's, um, I think that's an oversimplification. Uh, the, the challenge is we'll get to this when we talk about the system card, but the challenge is that these models, uh, have been shown to be able to write, to embed, to hide messages, subtle messages, encoded messages.

09:52

in their outputs that don't look like they're there to humans, but that they can interpret from those models. This is called stenography, essentially embedding, uh, kind of coded information in otherwise benign looking text. And so to the extent that that's true, you might read a reasoning trace from one of these models and be satisfied that, okay, there's no scheming here. There's no, no power seeking indicators here. Um, but, uh, but in fact, there may be.

10:16

And so, you know, to the extent that that's true, you may actually want to expose the reasoning trace for anybody to audit, for users to audit, for academics to audit. And I think that's a bit of a lost opportunity here, and it's really probably down to competitive advantage reading between the lines here. Another thing we learned. So the price point on this model is really interesting.

10:34

It is roughly speaking, it's about five times more expensive inference wise than your sort of soda GPT 4. 0 type model. Um, that's interesting because we've seen people speculate that OpenAI might be cheaper. Think about charging up to 2000 bucks a month for access to potentially the oh one model. Um, that seems pretty implausible to me right now. If the API inference costs are, are just five X, uh, you know, you're looking at 20 bucks a month for GPT four.

11:03

Oh, um, You know, if at least for, for, uh, sorry, the chat GPT subscription, which allows you to access that. You're, that that's, you know, five x more than that is not 2000 bucks a month. So, you know, it remains to be seen how, how the juice will get squeezed outta the lemon there. Um, last thing I'll just mention for now is there are two, uh, versions of this model. There is the oh one preview, uh, which is a larger, uh, model. It's the largest version available to us now for testing.

11:30

And then there's the O one mini, which is a smaller, more compressed version of this model. Both have super impressive performance. Um, like the numbers here are, are pretty wild. Like at least if you take the, uh, the numbers that opening eyes choosing for marketing purposes to advertise, which I think do have a lot of merit to them. And I've seen people do really impressive things with the model already, but, uh, just for context. Like. 89th percentile on competitive programming questions.

11:55

That's code forces. So basically, you know, better than roughly 90 percent of humans on, on those competitive programming questions exceeds human PhD level accuracy on GPQA. We've talked about that benchmark a lot. That is, it's like graduate level. QA questions in physics, biology, and chemistry. That used to be, that used to be a key benchmark that people would say, okay, you know, this is a serious thing.

12:20

If we're going to start to think about, um, uh, you know, human competitive level capabilities, uh, also 78. 2 percent on MMMU, this interesting multimodal benchmark that has been a challenge for models in the past. I think it's especially interesting. This model was trained Uh, it's reasoning process was trained in through text modality alone. And yet here it is blowing other models out of the water on a multimodal benchmark.

12:45

So that reasoning capability, even though it's only native to text, even though it's only playing out in the text domain, seems to lead to genuinely more impressive capabilities. And by quite a bit on multimodal on, on image analysis, for example, another multimodal benchmark. So I think this is all really, really interesting. This is a genuine kind of.

13:04

Uh, major improvement and the big, big take home, I think for anybody looking at this is the scaling laws we used to talk about scaling laws for pre training, right? How you train on more data with more computing power, you'll get a reliable increase in at least next word prediction accuracy out of your model and how that translates into concrete capabilities that we care about has been ambiguous, but at least you get more intelligence out of your system. It's it.

13:27

It seems as you scale, um, we now have robust scaling laws for inference time. So opening, I has sketched this out. They've been ambiguous about what exactly the X and Y axes are about here, but it is clear that there are robust, um, test time compute scaling laws. And this now means if you're going to argue. against the scaling laws and the, the, the fact that they're going to continue to work. You now have to argue against not one, but two separate and mutually compounding scaling laws.

13:55

So I think this is a big boon for the scaling crowd. I think it supports a lot of opening eyes underlying thesis remains to be seen what happens with Orion with the next generation of models. But right now, I mean, I think it's fair to say things are set to accelerate.

14:07

And we're no longer, interestingly, necessarily going to have to wait for the next generation of compute hardware to come out, because it's not necessarily just about now scaling the massive scaling of, um, of global compute resources needed for training. You can also direct at inference time if you really want to a lot more. Uh, computing power to solve tough problems. So I think this is all super interesting.

14:29

This is a genuine, I think this is the, the, the breakthrough of the quarter at least. Uh, and, uh, we'll be, we'll be hearing a lot more about this technique going forward.

Andrey

14:36

Right. I think as, as usual, in some sense, OpenAI is kind of like Apple in a way where none of this seems particularly new, right? We've talked about Research papers where you do reinforcement learning to improve your reasoning. DeepMind has done a lot of that. We've seen, uh, agentic frameworks that do chain of thought and do iterative outputs.

14:59

All of these ideas have been out there in the ether and what OpenAI does really well, that they've done with Sora, they've done with Dali, even, you know, GPT, the original GPT is take an idea or a set of ideas that are out there and put them together and. You know, put in the work for computer engineering to really make it work. And I think the highlights you mentioned are spot on. To me, this is another kind of indicator that this is part of a standard agentic AI workflow now.

15:31

And, uh, you're gonna see this regularly in anything that tries to be you know, capable of complex reasoning, complex task execution, and so on. A couple other notes, this is not capable of function calling yet, which is a little surprising. You would think it could query, let's say, a database, or it could, uh, do some complex calculations. We have a calculator, not possible yet, which I think will really limit it. It's ability to do a lot of problems.

16:00

And to me, the highlight or the most surprising bit is actually that, uh, detail about not providing the reasoning traces, the, uh, parts of the output, parts of what the model computes as part of its, uh, process that will not be provided to you either as a developer, via the API or as a user. On a user level. Okay. That's not necessarily the worst thing, but as a developer of an AI application who uses the open AI API, this is a big, I would say problem.

16:37

Like, I don't think many developers would be happy with kind of hidden behavior that you cannot, uh, debug. You cannot understand that you cannot really work with. And, um, you know, that's on top of these still. At the cost, this is going to be part of that, uh, 60 per 1 million token output. So you're being charged more for the output and it's outputting more tokens per input. Uh, yeah, there's a, there's a lot that I think this implies for engineers working these kinds of systems.

17:13

I think there's a real kind of question of trade off where you can approximate this with implementing these kinds of tricks, which kind of thought prompting iterative reasoning. And there's a real question to me that is not exposed in the benchmarks as to how well you could do if you just did those kind of known techniques with GPT 4. 0 with cloud 3. 5 and not do the fine tuning that is also part of this as we don't know too much about.

Jeremie

17:41

Yeah, I think that those are really good callouts as well. You know, a couple of the other, the kind of technical limit limitations, if you are a developer, it also doesn't support system prompts, um, temperature. So you don't have to have control over that. Um, it doesn't have structure prompts and so on. So you're right there, there are a bunch of things bundled into this. Um, they are, they're framing this as kind of a tentative launch, um, in fairness.

18:03

So, you know, we might expect some more developer friendly things to appear later, but I think you're absolutely right. One thing I don't expect to change in the near future is the availability of the, um, the reasoning trace, just because, you know, just based on everything we're hearing about the next generation of models. That seems like it's very, very relevant to their strategy at this point. Like it's in active use.

18:26

So, um, yeah, they're in between a rock and a hard place maybe with that approach. We'll, we'll see if it works out. I mean, to some degree, right. When we think about the opacity of these reasoning traces, it's also the case that I don't want to take this too far, but AI developers currently are okay with An unauditable reasoning trace happening at the level of the model itself. So there is reasoning happening internal to the LLM when you just do like a one shot prompt and just get a response.

18:55

You know, we, we are getting comfortable packaging, you know, those outputs and then using them in some cases in relatively high stakes scenarios. Um, but this is a qualitatively new kind of thing. Like there are multiple steps of reasoning happening here. Each independent one could fail. Um, you think about what happens if you do eventually give these things access to tools like that's a, that's a potential issue.

19:17

So yeah, I mean, I, I think it's, um, it's going to take a while for developers to get confident, you know, with like what exactly the risk profile of this does look like. Um, but, uh, yeah, I mean, it's, we're just gonna have to see, we've, we've seen People do a ton of interesting stuff with these systems.

19:34

Um, the, the other, by the way, unfortunate thing from a developer standpoint, man, the fact that opening eyes saying, um, that the upside of the reasoning trace is that it makes it more auditable is that it makes the models thinking more auditable. Uh, Yeah. It also makes it more steerable. That's auditability and steerability that you are taking away from the developer.

19:54

You know, to your point, Andre, every time they make that argument, that's kind of like, yeah, well, that's value you're taking away from developers and to some extent end users as well. So yeah, I mean, I think it's, um, it's going to be interesting to see if this remains the status quo going forward. They may try to change it up, but one last thing we're flagging too, this is not a panacea. This is not a model that will solve every problem under the sun.

20:17

It's in fact, not even a model that beats GPT 4. 0 across all benchmarks. What you tend to find is it does really impressively in like math, calculation, data analysis, computer programming type tasks that require a lot of logical work and reasoning. But But when you look at things like personal writing, editing text, these are benchmarks where you actually don't see an uplift relative to GPD 4. 0.

20:39

In fact, you see a drop in performance in at least the case of personal writing relative to GPD 4, uh, GPD 4. 0. Um, and, and that's just because it's a model specialized for sort of multi round inference for logical problem solving, that sort of thing. Whereas if you think about personal writing, right, these writing tasks, that's very much the natural domain of a language, a pure language model. Right? Like the text autocomplete, the RLHF, all that stuff.

21:04

And so, um, yeah, I mean, I think for those reasons, this is a, it is a perito improvement in the capability frontier. Like overall, this is just a better model, but you will see locally some areas where in fact you may even be better off qualitatively going with a GPT 4. 0 or another kind of model. So, you know, there are, there are even caveats of level of raw capability here.

Andrey

21:24

And one last, uh, notion from me on this as well, and we'll get back to this later with a system of cards. So there's going to be a lot of strawberry talk in this episode. But uh, one thing worth noting is to be fair, open AI in some sense is learning from the past where we have seen. now. Multiple examples of people using chat GPT outputs to create synthetic data for training. This seemed to be the case with grok, uh, and certainly probably with many other products.

21:56

So as far as competitive advantage, that is a very real reason to do this. Although I don't, I don't think the other justifications make much sense. Onto the next story, we got video generation once again. So, this time it's Adobe and it is saying that video generation is coming to Firefly this year. So, Firefly is their text to image model that they've been providing as something that, you know, users of Adobe products of Photoshop, et cetera, can use.

22:32

And notably they say trained on only licensed data, no copyright concerns, et cetera. And here they're saying that in a couple of months you will be able to do video generation and it will be available on the Premiere Pro beta. Premiere Pro is their video editing software. It is, uh, you know, one of the industry standards as far as video editing. And they are saying that free features, generative extend, text to video, and image to video will be, uh, public soon, currently are in private beta.

23:07

So in a sense, probably not too surprising that they are going ahead and working on this, certainly, and that they are approaching a release. I'm not surprised that Adobe is taking its time because, you know, this is not a demo. I would imagine this is like an actual release for users of their products. And so you need it to be sort of ready for industry professionals, et cetera.

23:33

Uh, we will see it's, it's hard to imagine that it'll be as high quality as other things, but I would not be surprised if, At least generative extend that will be quite good, given they probably do have a lot of data to train with.

Jeremie

23:48

Yeah. And apparently this is their touting controllability is the, the area they think they can differentiate themselves and, you know, just giving you access to various editing tools that can compliment existing workflows, kind of integrating video generation into those workflows. Um, and, um, yeah, they've, they've said that apparently Well, maybe not too surprisingly, um, this sort of thing is one of the more commonly requested features on their platform.

24:12

They don't disclose the price, so we don't know how much they'll cost. Um, and they're, they're erring on the side of caution as, as the article puts it, um, when it comes to the safeguards implemented on the system. So they're going to have blocks around generating videos with Nudity, drugs, and alcohol, it's not going to be trained on public figures like politicians and celebrities.

24:30

That's in contrast to other companies like Black Forest Labs that, you know, will allow more than anything goes, um, policy on that stuff. But as ever with this sort of video generation stuff, I think the key number to look out for is The, the cost and, and, um, speed of inference on let's say, um, uh, kind of practically business scalable offerings. When do we get to, you know, one second of generated video per second watched, right?

24:58

When do we get to the point where basically these videos can be generated as fast as you can watch them? Um, I think that's, that's going to come pretty soon. And that's going to position a lot of these companies in a really interesting way you. You know, interacting in real time with some of these videos.

25:11

So I do keep beating that drum because when you think about what the next big qualitative shifts are in the generative AI space, I think that has to be one of the key ones that we think about, you know, that for video and for audio, though, arguably we're already there with audio.

Andrey

25:27

On to lightning round, just a couple of new stories. First one is on fropic launches, Claude enterprise with more security and admin controls. So there you go. Following up on opening, I doing this, I think a few months ago. Now there is a Claude enterprise offering and it has the usual kinds of enterprise things, security, admin controls, probably sort of observability, role based access, audit logs, and fine grained permissions, all of these sorts of things. So, uh, yeah, not surprising.

26:03

And once again, enterprise is where the money's at, uh, not really consumer usage. So, um, it does seem like as we will talk later that OpenAI has been having quite a bit of success with business usage and, uh, It's hard to say, actually, I'm quite curious as to how competitive Anthropic and Cloud is with OpenAI. They've been trying to really make inroads. And, uh, certainly the quality of Cloud, many believe is maybe surpassing GPT.

26:35

But as far as a business, as far as competitive advantage, it's, uh, Kind of hard to say right now.

Jeremie

26:41

Yeah. And I mean, you can see why anthropic might want to err on the side of enterprise sales. Um, when it comes to, so first of all, if you go direct to consumer, like brand recognition is King, right? Nobody, nobody really knows about anthropic in the general population. Everybody knows about chat GPT. They may not even know about open AI, but they know about chat GPT. Um, I shouldn't even say everybody knows any, anybody who's like kind of touching the tech world.

27:05

So you're much more likely to see kind of user or a consumer level use. Uh, from a platform like chat GPT than you are from Claude just because of the brand recognition. Um, this makes it much more natural for Anthropic to focus on the B2B enterprise sales side. The other thing that does too is their focus on alignment. And this has been a differentiator for Anthropic for quite some time.

27:27

You know, the ability to have, for example, models that hallucinate less, um, that are maybe a little bit more Hedged in the responses more cautious in business applications. That's the sort of thing that you might just want to avoid lawsuits to avoid overtrust and things like that. So, you know, maybe makes a natural area where anthropic might think that they can, you know, maybe squeeze out a little bit more competitive space relative to open AI.

27:49

But yeah, we'll, we'll see what the uptake looks like in the long run with this and their other enterprise plays.

Andrey

27:54

And the last story for the section titled tell Repl. it's AI agent your app idea and it'll code it for you. So Repl. it, the company that has an AI powered software development and deployment, uh, tool that is kind of pretty widely used. Repl. it is very high valued. They've been around since 2016. They have now released this AI agent. It is available in beta to Repl. it subscribers, and it is meant to be able to implement much more complex software, essentially.

28:31

So you tell it, you know, implement me an app that does X, Y, Z. And as with any other agent, it will reason through a task, create its own steps to complete it, do each one, each step one by one, probably do some testing, debugging, et cetera, et cetera. So this is very much what all of these coding companies are going for, being able to do much more complex coding tasks, generally by going agentic, going to an agent approach where you have multiple steps of reasoning, execution, et cetera.

29:07

And it'll be interesting to see how well this performs, what people are able to build with it, because we haven't quite gotten to a point where agents are doing very impressive or very useful things. A lot of people have been working on it and it feels like we're close.

Jeremie

29:25

Yeah. Yeah. I feel like in the last even couple of weeks, we've hit that tipping point. Um, I suspect that when you started to look at models like a one, you're, you're really going to see that happen. And, and frankly, I mean, I think that does come with a seismic shift in, um, in the software engineering job landscape. I mean, eventually, you know, this sort of thing will happen. Um, but, uh, but you know, I'll just, I'll just plant that flag there as a bet, I guess. But when it comes to Repl.

29:49

it itself, yeah. They do have, uh, they would certainly argue, and I think this is fairly accurate. They do have an interesting comparative advantage in this market in terms of just data. So they're one of their big features on the platform is this bounty service where you can go and request software projects to be built by developers on the platform. And the way this is set up is that like the bounties are a plain English description, uh, that. Of, of the product that you want built.

30:14

And that's exactly the kind of prompt that you might wanna feed to an agent, right? Like a rep agent, um, to, to get it, to build these kinds of products. And so, uh, what you've got here is an interesting case where this company, um, is not only gonna be in a position to focus on code completion type stuff, right? This sort of thing we see with GitHub copilot and other, other similar products, uh, because Repli. Essentially has data that covers the entire software development life cycle, right?

30:40

The whole stack scaffolding, code writing, debugging, deployment, all that stuff, um, and all these things carefully logged and explained by human beings, right? This is if you're talking about opening. I not wanting to release the reasoning traces. Well, this is the equivalent of a at least human derived reasoning trace for a lot of these problems. Um, and importantly, over long horizons, right? This is like long horizon thinking. Okay. And so I think for that reason, Repl.

31:05

it, if you're thinking about the kinds of companies that might be able to make a reasonable play on the software engineering automation front, they start to look really interesting. I know Amjad Massad, the CEO of Repl. it is big into the sort of AGI picture. He's, he's, I don't know if he stopped short, actually calling Repl. it an AGI company, but he certainly, Pitched in recruitment that, Hey, Hey, we like, we think we have a path to AGI here that others don't.

31:32

And so this is a, you know, to some degree, a pretty differentiated strategy and we'll, we'll see if it plays out. But, um, yeah, full, uh, full life cycle automation is at least, uh, in the crosshairs with replic.

⁠¶ Applications & Business

Andrey

31:43

And onto applications and business. And once again, we're going to be starting with OpenAI. OpenAI fundraising said to increase their valuation to 150 billion. I think just last week, we talked about it being 100 billion. Apparently now, It will be rising to a hundred and fifty billion inflation. Crazy. I know. And they, they are valued as of the last round at 86 billion. And that is at, you know, free a few billion worth of revenue as far as we know, you know, not even profit.

32:22

So that is already a pretty big ratio of revenue. To valuation, um, not huge for the tech space, but certainly pretty significant. 150 billion is a pretty big jump. And I will say it's interesting to see this happening, given that there is quite a bit of competition, right. From Google, from Entropiq, uh, and from Mistral, for example. So it's not like every only player. Uh, capable of creating these very advanced, uh, models. Uh, so to me, a bit of a surprise, but OpenAI certainly is,

Jeremie

33:00

uh, I guess doing well right now. And there's, uh, I mean, the, the round itself is really interesting, both in terms of the people participating, you know, we, we talked about this before Thrive Capital looks like they'll be leading it. Um, that still is the case. Bloomberg had talked about that. Um, apparently Microsoft, um, Apple and video all talking about investing.

33:19

There's a, an article from the information that came out like 20 minutes ago, uh, where apparently opening, I was in talks with a UAE investment fund for 7 billion. Now it's unclear whether that's linked to this right now. I mean, it's a 6. 5 billion fundraise. So it's possible that that's the, you know, the roughly 7 billion they're referring to in which case.

33:36

Maybe it's a UAE fund that's just jumping in, a bit unclear, but that would have all kinds of interesting national security implications. Um, the other thing too is, besides this question of the straight up fund raise, 6. 5 billion, um, at a pre money valuation of 150. So post money, you know, 150. 156. There's this other piece where opening eyes talking to banks to raise five billion dollars in debt in the form of what's known as a revolving credit facility, basically just a form of debt.

34:06

Um, a bunch of companies do this sort of thing just before going for an IPO. Right. Now I'm not saying this means that there's an IPO that's going to happen here, but if you look historically, you know, like Facebook back in the day, when it IPO, they, they mentioned in the article, actually Alibaba, Uber, DoorDash, this is a common move, in part to strengthen banking relationships in advance of an IPO. And so it does not mean again, that an IPO is, is incoming.

34:35

It's interesting to note though, that this is exactly the sort of thing that would happen if you were thinking about an IPO being, you know, incoming and 150 billion. That's a, that makes OpenAI one of the highest valued privately held companies on planet earth. Now, that means that if you're going to keep fundraising, you're Pretty soon, yeah, you're going to have to hit the public markets like that.

34:55

It's very hard to raise a lot more money than that without tapping the public markets that itself raises all kinds of questions. Like, I mean, I don't know, like is open AI actually positioning itself for an IPO or is it not? Um, if it is, that would certainly seem to be at odds with their kind of historical position on all of this, right? Like they're, AGI for the benefit of humanity.

35:18

That is at odds with giving away Transcribed Control in a meaningful sense of the company to for profit entities. Um, and so this is where you get into the whole, you know, the rumors that have been circulating about open AI trying to restructure itself to make itself more attractive to investors potentially by lifting their, uh, capped for their for profit cap. Uh, they're, they're essentially the returns that they're willing to offer to investors. Um, so yeah, it's all pretty blurry right now.

35:43

Um, one other thing we do know though, is there has been an internal memo at open AI that has told employees that look, we're, we're aiming to let you guys sell some of your shares in a tender offer later this year. So at that stage, there will be a new valuation at which those shares can be sold. Um, so sort of interesting again, you know, not inconsistent with a last opportunity for liquidity, uh, before an IPO. I don't want the take home message to be Jeremy's saying OpenAI is going to IPO.

36:08

I don't know that. I suspect, I would guess they're not, but this definitely looks like the kind of thing you would do if you were headed in that direction. So bit odd. We'll probably learn more soon.

Andrey

36:19

Yeah, to be certainly interesting IPO. One of the reasons you would want to do it is to raise cash, right? And Given that they're completing this round right now with many billions of dollars and they are going to take on debt, you would assume that they would have plenty of cash to do things.

36:38

Then again, you know, given what we know about scaling, given their ambitions to, you know, do stuff like build fabs, Manufactured chips, uh, all of these kinds of things, it would be, you know, weird, unusual, but not surprising if they just burn through like 10 billion of our money and we'll need another 5 10 billion very soon. And one more story about the OpenAI business, we have a bit more detail as to how they're doing.

37:11

In particular, we know that they have hit 1 million paid users for business versions of ChadsGPT. So that's the ChadsGPT team and enterprise services and ChadsGPT EDU, which is used by universities. Uh, this, these are all relatively new. Chair GPT teams are for smaller companies, was introduced in January. Uh, Chair GPT EDU was, I believe only a few, uh, months ago and, uh, looks like, uh, it's growing pretty rapidly is, is my takeaway here.

37:49

Uh, it seems that just under half of the corporate users are in the U S and it is also popular in Germany, Japan, and the UK. Okay. Which I guess is not too surprising.

Jeremie

38:00

Yeah, not, not a ton of information in the article itself, other than, yeah, those geographic distributions and that sort of thing. Um, but you know, I guess, I guess we'll see. And all of this obviously is informing the valuation at which OpenAI is, is raising, right? So that 150 billion presumably means that they're getting some traction.

38:19

Um, also implies, you know, investors are potentially going to get an early glimpse at, You know, the strawberry model or whatever Orion looks like right now, whatever else is cooking. So, you know, all of these things would suggest some level of success. Once you're at 150 billion, like the reality is you could, if you're a small company, you can get away with raising. On absurd multiples relative to your actual revenues, right? Like you can be a, you know, pre revenue and raise it 20 million.

38:46

That's pretty standard in Silicon Valley, but as your valuation rises, there is an expectation. Okay. You know, you should be generating real revenue now. Um, bit different for AGI companies because so much of the potential values in the future, but, uh, I would guess that there is some actual growth here, even though the numbers aren't forthcoming in this article, um, you know, backing that again, that fundraise that we talked about.

Andrey

39:06

Right. And then there was another article again, for me, information, which releases a lot of, uh, kind of insider and, and yeah, tech news. Uh, the article said that, uh, according to the opening, I see, Oh, Chad, GPT has surpassed 11 million paying subscribers. So not just business, but total subscribers that would mean, uh, that, you know, there's 10 million. Yeah. paying subscribers outside of teams.

39:34

And that to me, that's conservatively, they're generating 225 million in revenue per month, uh, you know, close ish to 3 billion per year. Uh, and likely more than that, given that businesses pay quite a bit more. So they are generating quite a bit of revenue already. Uh, and probably more than any, any other AI company by a good margin.

39:57

Onto the lightning round, we begin with CHIPS and TSMC, and the story is that TSMC Arizona, their new fab that they are building, has achieved similar production yields to their fabs in Taiwan, according to a report. So this facility began trial production in April of 2024, They were initially set to begin full production in this year as well, but it was delayed to 2025 due to a shortage of skilled labor. Kind of interesting.

40:33

But achieving, uh, similar production yields to their fabs in Taiwan is very significant because That is, as Jeremy often says, like the key thing you need as a chip fab. TSMC is exceptional at this. That was one of their competitive advantages early on as they were starting out is they really optimized the yields. And, uh, you know, it's hard when you build a new fab, that's one of the issues you encountered is the yields are lower due to various errors in processes, right?

41:08

When you manufacture a chip, if you have like a bit of dust that falls on the chip, that's a problem. So there's a million different ways that the yields could be lower and them getting it to this level of performance is a problem. Pretty notable.

Jeremie

41:24

And this is their, their fabled fab 21 Arizona facility. Right. So what, by the way, some, some of the context that's relevant here too, is the, the chips act that poured like 50 billion into domestic us chip production, um, is, is after, after You know, strategies like this, they, they want wins for political reasons and for national security reasons. And so this is really going to bolster, uh, the case for the chips act. It's, it starts to look really like a success case.

41:51

Um, yeah, you can basically invest an aircraft carrier worth of risk capital as, as, uh, we like to say, uh, you know, 50 billion or so to build just one fab facility and it's risk capital. In other words, you don't know what you're going to get out the other end. Like, will it even work? And in fact, Intel is learning the hard way right now that There is no guarantee, and it looks very much like they're in trouble as a result.

42:12

Um, but yeah, so where we're at right now, uh, despite a bunch of earlier issues and delays, you know, you mentioned the, the shortage of skilled labor issue, um, they're now moving forward on their end for process node. Okay. What is the end for process? No. Well, we often talk about how. There are different node sizes right in the, um, TSMC kind of production story.

42:34

So the smaller, the node size, really the more accurate the production processes, the more advanced the, uh, the, the, uh, chips are that can be built, um, three nanometers right now is like what's being used for the iPhone. So basically you can imagine it's marketing speaks. It's not actually like three nanometer feature sizes, but you can imagine roughly like. creating features down to 3 nanometer resolution. The next level up from that is 5 nanometers.

42:56

Um, and that's currently being used for the, um, uh, for the H 100 chip. And, uh, up, up beyond that is 7 nanometers. That's the a 100 and so on and so forth. And so, um, yeah, so basically the N4 process is kind of being referred to as a 4 nanometer class process technology. So that would make it somewhere between 5 nanometers and 3 nanometers. In reality, this is also just marketing speak there. It's not really a true four nanometer technology in the strictest sense.

43:26

It's more like an optimized version of their plain old five nanometer process. So it's good. Um, it's just not three nanometer. Good. We usually, what you do when you build a new fab is you start off a couple of generations behind. So in this case, you know, the leading generation is like three and enemy three nanometer node or, or even, you know, Closer to two nanometer node, like that's kind of where the bleeding edge is. They're not starting there. They're starting at four nanometers.

43:50

And then you start to work your way down and that takes time and a lot of effort. Um, this is why Intel has kind of bragged about their, their big, uh, five nodes in four years game plan. That seems to kind of be falling apart right now. Um, you know, being able to progress through node sizes really quickly to get to the cutting edge fast, that's really going to be the goal here. Um, Um, and, uh, they're, you know, anyway, this is all, all part of, uh, what's going on in that, that fab.

44:14

Now, TSMC does not publicly disclose what its actual yields are. So we don't know exactly what the, the yields are, but they've got a great long term gross margin of 53%, uh, and net profit at 36 percent over the last four years. So presumably they're saying, you know, this is kind of business viable. Uh, at, uh, at four nanometers that suggests they're approaching something like that. Um, that kind of margin, which suggests a very efficient production process.

44:41

And uh, the, the big question has always been, can they maintain those yields, those, those margins as they expand overseas. And we're getting the first indications that actually, maybe this is a replicable thing. Maybe it's not just the Taiwan fabs, uh, and the, uh, the Arizona, Arizona one obviously is especially interesting to the U S for the national security.

45:00

Uh, reasons that it, it kind of onshores semiconductor fab, if China invades Taiwan, which they may well be, they may well do because of TSMC's fabs in Taiwan, uh, this gives them a bit of a buffer, a bit of a backup plan.

Andrey

45:14

And one more story, Japan's Sakana AI partners with NVIDIA for research and raises a hundred million. So this is a Tokyo based startup, uh, with some notable researchers. Recovered their AI scientists. papers just recently, and this is their series A with funding from, uh, coastal ventures, Lux Capital, and NVIDIA. This of course is pretty important for them. So far, Sakana hasn't done too much as far as we've seen. We've seen some research come out of them.

45:49

But, uh, not much else to my knowledge. So certainly if they want to be a business and compete as far as offering LLMs and whatnot, which is to the best of our knowledge, still what we aim to do, that this is the sort of money they need to even, uh, make money. Yeah,

Jeremie

46:07

I think this is one of those cases where you've got a, uh, a national champion company. Uh, you know, we, we saw that with mistrial in, in France and now there are others, but, um, you know, and, and Google DeepMind obviously in, in the UK and, and there's always an interest in promoting like the, the one stellar company in, uh, within a country, especially if you're not the United States, there's a whole bunch of tax reform happening right now in Japan to support the web three industry.

46:31

Yeah. Absolutely. Um, there's a tax code overhaul to kind of help with, uh, in this case, like crypto, you know, tax asset stuff, but, um, but as well, you know, the kind of tech space, because there is a sense, you know, Japan's economy has lagged over the last two decades. I mean, it's been a slow, it's been a slow time since, uh, I guess what the late eighties or nineties or whatever. So, uh, yeah, there's a lot of, uh, a lot of interest in spurring whatever they can domestically.

46:54

Uh, AI scientist was, I will say, an impressive paper. There has been a lot of hype and counter hype. Um, but, uh, and, and the team behind it is, is really impressive as well. So this is their big play. NVIDIA is going to be a great partner. Um, Coasla Ventures, by the way, uh, is, uh, is on board with this round.

47:11

So famously, Vinod Coasla was the, uh, one of the investors who got into Google very early on, sort of wrote his, Uh, his commitment on the back of a napkin and, and, you know, so the story goes, but, um, yeah, a very good investor is on board and video, obviously great partner. So this strategically is a big series. A for Sakana

Andrey

47:29

onto projects and open source.

⁠¶ Projects & Open Source

47:31

And we begin with a pretty interesting story of an open source release. Uh, and we have a link to an article that kind of covers all of it. Uh, Called the fable of reflection 70 B. So, uh, there was this thing actually a couple of weeks ago, but we didn't cover at the time called reflection 70 B it's an open source model, fine tuned from Lama free that, uh, at the time we CEO released it and said, you know, this is.

48:02

A model that was trained to do reflection based on a paper or a series of papers we've seen where the model is trained to be able to use these tags to basically do embedded reasoning to be able to say, here's my thinking. I'm going to do some thinking kind of internally before I release my output and the numbers looked really impressive basically when it came out now soon after.

48:32

You know, everyone got excited because it seemed like, you know, this is a demonstration, but you can go very far with these kinds of approaches. When people try to replicate the results with the weights that were released, they were not able to achieve the same kinds of numbers. Then, uh, the CEO, I believe said that there was an issue with uploading the weights. That was some sort of issue, uh, provided an API for you to be able to test the model.

49:02

And people soon discovered that it seemed like the API actually redirected to cloud 3. 5 and other models. They did little experiments like, you know, output cloud in, um, As a quote, and then the model would output just empty string, because possibly the output was being processed to remove any mention of clod, so that you don't say as an LLM, as clod, etc, etc. So, there was quite a bit of blowback. The CEO apologized, said that he got excited, jumped the gun.

49:38

And at the end of the day, this was kind of a weird and in some ways, entertaining little, uh,

Jeremie

49:46

uh, episode. Yeah, it does seem like, I don't know, obviously we can't know what actually happened behind the scenes. I've seen some, some analysis of it that, that seems pretty plausible because again, like, so Matt Schumer, who's the CEO of the company, uh, Um, is very, very respected member of the open source community. This is not the kind of person who you would expect to come out and just try to fraud the crap out of people.

50:09

And this is also such a, it would be such a ridiculously short sighted fraud to be intentionally executed. I mean, you think about this, you're, you're saying, Hey, I've got like the world's most performant. I made a breakthrough in, in language model, fine tuning that is. you know, defining maybe the breakthrough of the year on language model fine tuning, we can take llama, we can, we can upgrade it to like GPD four level capability, uh, just with fine tuning.

50:34

And then by the way, I'm going to make it all open source so people, people can poke at it, prod at it, you know, show all the strengths and weaknesses. And then it's all fake. Like that's the sort of thing, you know, within like 48 hours as happened, you would expect people to be like, Hey, what, like, what the hell, this isn't working.

50:48

Um, So I saw some analysis of people figured, well, you know, maybe this is one of those things where he started out, you know, he was, um, the speculation was that he was, uh, essentially pinging the wrong API endpoint locally when he was testing and ended up running all his tests on cloud 3. 5. Sonnet, um, ended up concluding maybe that, you know, that, that was this.

51:09

Okay. That was the result he was getting from his model, got excited, uploaded his model, which he had not actually successfully tested, and, um, and then realized, oh crap, it does not work, now he's stuck having made all these really bombastic and grandiose claims. I mean, it's worth checking out his, uh, his Twitter, because he, he really, you know, went, went all in, and so now he's forced to back it up. What does he do?

51:32

He, well, uh, this is where things get dark, you know, he, he actually starts to cheat and he, um, uh, sort of links to the cloud API. When people start to suspect that cloud is, is the API that he's linking to and not actually the, the model he trained, um, he switches it over to GPT 4. 0. And there are a bunch of tests that show I, in my opinion, quite, quite, um, conclusively that that is in fact what happened in the backend. And so you've got potentially a guy here who.

51:58

Uh, sort of got himself stuck in a spiral that led him to outright fraud potentially, um, by at least the, the judgment of, of quite a few people. So, um, yeah, I think this is, uh, an unfortunate, uh, situation. And again, I mean, the hype that he was building around this and you just, you got to feel bad for the guy if in fact this is what happened, but, but ultimately the judgment of continuing down that direction, um, and escalating, you know, That's, that's not great.

52:23

So this is where the ball lands. And there is no reflection 70B, or at least reflection 70B is not the model we thought it was.

Andrey

52:31

Right. And so, you know, now you can't access the model. I believe it's taken down. The statement was an apology and says, you know, we'll work with our engineers to figure out what happened. Uh, certainly embarrassing. Maybe not sort of fatal. As you said, this is a respected person. It was probably not outright like intended fraud, at least initially. Presumably there were fine tuning, lama free with these kinds of ideas. But, uh, you know, not, not the route you probably want to take.

53:02

You probably want to be a little more careful when you make grandiose claims. Next, we got another Gemma. So Google has been releasing a lot of Gemma variants, Gemma being their small version of Gemini. And this time we got Data Gemma. They have this blog post where they talk about using real world data to address AI hallucinations. And this is kind of interesting. It's Uh, sort of like a research E type thing. It's pretty preliminary. It's saying we are exploring these ideas.

53:38

It's not really a research paper, not any sort of real numbers being covered, but just saying, you know, here are some ways that we are exploring, improving, uh, the occurrence of hallucinations. So the way they do that is by connecting, uh, Gemma and Gemini. With data commons and data commons is a publicly available knowledge graph containing 240 billion rich data points. Uh, knowledge graph contains essentially facts of different sorts. It's like.

54:13

Obama was president in 2008, for instance, is one example of what might be in a knowledge graph. And so the way they're connecting it is via a retrieval interleaved generation. So you can query the, uh, data in data commons as you generate. Uh, and they also have the retrieval augmented generation where you can query before you generate. And they say that they are, uh, using Gemini one point fives. Uh, prose, a long context, widow, um, window for that. So that's pretty much a story.

54:50

They did release a research paper, knowing when to ask bridging large language models and data. They have a couple notebooks for people to play around with this, both with the rag and rig approaches. And, uh, for that, I guess, yeah, not surprising, not anything too novel, but certainly a useful kind of bit of software and something for other researchers and engineers to build upon.

Jeremie

55:22

Yeah, absolutely. And I think especially with the, um, the rig, the retrieval interleave generation technique that they're, they're using here, it does start to blur the line a little bit between what's an agent and what's a Uh, sort of more standard LLM type system.

55:35

And, uh, cause really what you're doing is when you prompt the system, uh, to get a response, the model is going to start by identifying instances of a data that it needs to respond to you with, and then retrieve the answer from data commons. So in this case, they're, uh, the repository of ground truth data. Um, and you know, they, they point out, you know, this is not a new technique, but the specific application within the, the data Gemma framework is unique.

55:59

The way they've kind of mixed this with, with the rag. Uh, approach, but basically, you know, this is another way of, of investing more inference time compute, uh, into your, your response, right? You're not just giving a one shot response naively. It starts to look a little bit more agentic in that sense.

56:14

And um, yeah, I mean, uh, we, you know, we obviously been talking about this idea for quite some time, actually, I think it was like a year and a half, uh, or as soon as we really started doing the podcast together, we were looking at, at the idea of test time compute as being pretty decisive and, um, this, uh, this does seem to be yet another data point in that direction.

Andrey

56:31

And the last story on the open source front, DeepSeek v2. 5 wins praise as the new true open source AI model leader. That's from VentureBeat and they do sometimes have these very praising, uh, article titles. But, uh, that does appear to be the case. So DeepSeek, uh, 2. 5 is, uh, Pretty large model that has been released.

56:57

And according to several evaluations, both kind of standard benchmarks and some private benchmarking by people who are in this sort of open source space, it now beats other open source LLMs and in particular beats Llama Free 70 B. Uh, and that is notable, right? Because, um, number three, 70 B is one of the leaders in open source space.

57:24

It's pretty hard to get to a level of performance and having a new model that is sort of a new flagship, a new standard in open source performance in quite good performance. Actually, although it does not beat things like cloud 3. 5 that are not open source, that is pretty significant.

Jeremie

57:42

Yeah, um, human, human eval Python, I thought was especially interesting. They, they scored 89, which is like really, really good. Um, one caveat with these evals though, we've talked about this one before, but, um, the, uh, the results were determined by judgment, uh, with, uh, GPD 4. 0. So they use GPD 4. 0 to assess. You know what the, uh, the, the performance was across a lot of these benchmarks using, you know, this kind of LLM on LLM eval scheme can have some holes.

58:11

Um, but, but still, this is a genuinely impressive thing. One of the key things they're doing with deep seek, uh, the 2. 5 is this thing called multi head latent attention. Um, it is basically a technique to reduce the amount of. Um, processing power required really the number of parameters in a sense involved in the attention process, the KV cache that you use to calculate the attention values. So, um, essentially, you know, faster inference speed.

58:38

Uh, they managed to do that without compromising model performance. They have a whole technique for that. Um, but, uh, yeah, it's, it's interesting. It's also worth noting a deep seek is a subsidiary of a company called high flyer, which is a. Apparently one of six Chinese companies with more than 10, 000 NVIDIA A100 GPUs.

58:57

So this is, you know, these rare companies that can actually invest some, some compute into this, but you know, 10, 000 A100s, like that's, that's okay, but it's quickly going to get, uh, going to get old as, as hardware in the West gets a lot better and more, uh, abundant. So I'm, I'm curious whether we'll keep seeing DeepSeq pump out relevant language models, relevant open source models. Um, but, uh, this one for now at least is, uh, is definitely impressive.

Andrey

59:22

And on that note, uh, they do have their own DeepSeq license as we've seen with things like Llama and Gemma. It does allow for commercial use. So it is permissive in that sense. That's one of the common limitations, but it does have a bunch of caveats, including that, uh, governing laws and jurisdictions. It says You may not use a model in any way that violates any applicable national or international law or regulation.

59:49

And they do say that the agreement will be governed and construed under the People's Republic of China laws. Without regard to choice of law principles, uh, so not surprising, uh, you know, Chinese company, you must obey Chinese laws. Essentially, they do have, uh, also in the license that, uh, there, um, it says that to a maximum extent permitted by law, DeepSeek reserves right to restrict remotely or otherwise usage of a model in violation of this license.

01:00:22

And, and there are many other, uh, kind of caveats here where you can't use it for melody. military ways. Anyway, you cannot use it to defame, disparage, or otherwise harass others, other things like that. So not fully, fully open source, uh, kind of open source in this way that you've seen with other companies where you have.

01:00:43

Uh, specialized license that has a bunch of restrictions, and if you follow those restrictions, you can use it commercially and can kind of replicate the license, things like that.

Jeremie

01:00:53

Yeah. And this is your standard sort of, uh, zero one dot AI type of, you know, license that binds to Chinese jurisprudence or the Chinese legal system. Um, so, I mean, like, I, I, I, I, I'm fond of the term open source warfare, uh, as a, a, a kind of, um, uh, catch all for, for this type of technique that China especially uses where the licenses, I mean, this is the, it has national security implications, right?

01:01:17

The, there is a Chinese national security law that, uh, that has requirements for the sorts of things that you may or may not say. And so, you know, it was a very broad license. It, it absolutely is to the advantage of, uh, the PRC for, um, for people to be constrained in this way. So, uh, kind of interesting. And I think a new front in the tech war

⁠¶ Research & Advancements

Andrey

01:01:35

on to research and advancements. And we begin with deep mind and a new alpha model. So of course, uh, deep mind is released. Quite a few and, uh, of these alpha models, and they tend to be some of their flagship results, uh, going back to AlphaGo. And now they have AlphaProteo, which is an AI model for generating proteins. So that, uh, leads to protein binders for a variety of target proteins in including those related to critical health issues, such as cancer.

01:02:11

And beyond that, I cannot really tell you much because I don't know almost anything about this topic. But, uh, you know, compared to AlphaFold, for instance, which predicts protein structures, AlphaProteo enables the design of proteins that can interact and modify biological behavior. processes, so it can be used in a different way and leads to potentially developing targeted therapies that block harmful proteins.

01:02:41

Uh, so yeah, I guess maybe not surprising that this would be their next kind of step. on this front. And as before, you know, not many other companies, especially in the kind of scale of a deep mind, uh, that of course is backed by Google. Part of Google now are working on these very advanced scientific applications. So always exciting to see them doing more on that

Jeremie

01:03:09

front. This is part of a kind of broader, we call it a partnership between Google DeepMind and Isomorphic Labs, or at least they're making this available to Isomorphic Labs, which is a spin out from Google DeepMind that Demis Hassabis, Google DeepMind's CEO, is very closely involved in. So that's at the kind of application level.

01:03:26

The big breakthrough here, right, is, so with the previous models that Google DeepMind's put out here, Alpha Fold, Alpha Fold 2, What you're able to do is you can predict the structure of proteins. Basically, proteins are made up of these building blocks called amino acids and each amino acid has a different shape. Um, they have different charges, some of them. So some will be positively charged, some will be negatively charged.

01:03:47

Some will be a little bit polar, so a little bit more positive charge on one side, a little, you know, and, and, and less polar stuff like that. And so when you link all of these Um, essentially Lego blocks together and they all have slightly different charges. Then you, you watch basically as the protein folds because the positive parts of it will be attracted to the negatively charged parts of it. Um, and anyway, and, and the polar parts will be attracted to each other and all that stuff.

01:04:11

And so that leads to a very complex, uh, interaction that gives you the structure of the protein. That's why AlphaFold2 was such a big deal. It was an open problem as to how we predict, protein folding based on just an amino acid sequence. Um, what that doesn't give you though is the ability to design, uh, let's say drugs or, or, or things that will actually bind to these kinds of, uh, of compounds or of molecules, um, to actually do useful stuff.

01:04:40

It gives you a powerful analytical tool, a powerful research tool, and it's had a huge impact in the space, but what you would actually want from a practical standpoint is be able to generate. proteins that will bind to specific molecules, because that's how a lot of drugs work, right? You'll have some sort of molecule, some, some sort of biological molecule. It's important to carry out a certain function or that's problematic, right? That's triggering some kind of medical issue.

01:05:06

And you want to bind another molecule to it, to change its behavior in a way that's advantageous to human health and wellbeing. And so this idea of like, how do we actually figure out a protein that will bind to like an arbitrary target molecule. That's. That's what alpha proteo is doing here. So this is actually moving from just the analytical research oriented.

01:05:26

Um, let's just talk about how to figure out how this protein is going to fold to no, no, no. Let's like design proteins that can actually bind to the target, uh, target proteins that we're interested in. And so they were able to do this and then demonstrate the effectiveness of this technique in a wet lab. This is a big deal because it's the difference between theory and experiment. This actually does seem to work. Uh, interestingly, they tried it on two viral proteins involved in infection.

01:05:51

One of which was SARS CoV 2 spike protein. And so that's kind of interesting and especially relevant. Uh, they looked at a bunch as well as you said, you know, involved in cancer and there's like inflammation, autoimmune diseases, things like that. Um, and found just incredible, um, levels of, of, of essentially binding binding rates and success rates at, at developing these binding agents. Uh, this is really powerful because it is so expensive experimentally to do this using current techniques.

01:06:20

Right? So usually what you do, um, is you would basically like generate a bunch of. Potential proteins, a bunch of potentially potential binders. I should say, you throw them at your target protein, see what sticks and then whichever stick you grab them. And then you essentially do a mutation on them and you try again. And so you do many, many, many rounds of this expensive kind of mutation and binding experiment. And even at that, your success rates. are just like miserably low, right?

01:06:51

So this is just a way of bypassing that whole process. And if you have a really powerful computational scheme to determine, okay, this is actually going to bind successfully. Well, now you presumably have to do a lot less of that kind of expensive iteration. So it saves you a lot of expensive lab work.

01:07:06

And, um, for that reason, You know, this is actually in some ways, I would say on par with, with, um, uh, with, uh, the, uh, uh, alpha fold to breakthrough because it is the more practical kind of hands on this is going to lead to kind of actual medical advances type of work. So exciting to see this more, more great work from Google DeepMind on the protein folding stuff.

Andrey

01:07:29

And one more note, in addition to the announcement, we did release a white paper titled De Novo Design of High Affinity Protein Binders with Alpha Proteo. Uh, not too much detail about the actual approach. Presumably it's, uh, you know, along the lines of AlphaFold. They do say they trained on a large data set of known proteins, including many outputs of AlphaFold.

01:07:56

And this white paper primarily focuses on results, just from looking at it, there's not a ton, although they do have a pretty detailed, uh, experimental methods section, benchmarking, et cetera. It's a pretty beefy, uh, white paper, maybe has a bit less technical information than usual given the, uh, partnership with, uh, Isomorphic Labs. That's just speculation on my part. So. Could be that this is, you know, submitted to nature, and we're gonna see more later.

01:08:27

Next, paper, synthetic, continued, pre training. So, the idea here is that pre training, of course, is what you do to get large language models. You get a whole bunch of text, and you learn next token prediction, and you train your language model to predict what comes after some text. And you ideally do that iteratively as you get more data.

01:08:53

What this paper is looking at is the issue of, well, what if you have just a couple new documents to add that have like a single occurrence of a given piece of information, which is presumably often the case at, let's say, a company or something like that. That would mean that just training on that small set of text might not be enough to get your model to actually ingest and remember all of that information that you really need it to.

01:09:20

And so I propose an approach that enables that to work via synthetic data. So they take this set of documents that you want the model to learn from, and then you generate a much larger set of data with a synthetic generation. You essentially take Entities, facts from resource documents and create a whole bunch of new text that essentially expresses that same information in different ways. And then you train your LLM on that much larger set of text.

01:09:54

What that means is, in practice, at runtime, at inference, after training, it will then have the information encoded from that small set of documents. And if you also have those documents and inference time, you can use retrieval augmented generation and actually work well together. They combine to be effective. So I think pretty practical, pretty useful paper here for a lot of applications.

Jeremie

01:10:22

Yeah, I, um, uh, there's, there's one point. That I'm confused about with this paper and I'm, I'm curious if you have a thought. 'cause, so by the way, this has been one of those papers that really did the rounds on, uh, on X this last week, I guess. Um, so a lot of, a lot of hype around it and, uh, for, for a sense of scale. So what they do, by the way, they, they use this thing called graph.

01:10:43

It's basically a, a like a, a graphical, uh, model that, um, that, that maps the connections between entities that are revealed in the a small. A small chunk of text that you want to train on. And that's designed to kind of make sure that you're preserving and discovering key relationships between entities there so that when you amplify your data, you create way more synthetic tokens based on your smaller cluster of initial tokens.

01:11:08

Um, those synthetic tokens do reflect the kind of relationships that you want, um, captured 600 million synthetic tokens from just 1. 3 million real tokens using GPD for turbo. So roughly speaking, I don't know, 400, yeah, a factor of 400 lift, right. In terms of the amplification of the size of the dataset. Um, and then they continually pre train LLAMA3 8B on those synthetic tokens. And then they measure how it's a performance on a question set increases. What they find is this scaling law.

01:11:45

Basically, um, they find unsurprisingly that, uh, as you train the model on more of this synthetic data, it's performance on the. Um, the multiple choice question test goes up reliably, right? So increase the number of synthetic tokens all the way up to 600 million, which is what they generated. And you will see a consistent improvement in, uh, in the model's performance. Now they compare that.

01:12:10

to the improvement in performance if you just trained the model on that small 1. 3 million initial set of tokens that they amplified. And then they show a plot, and basically the entire paper is an argument from this plot that says, Hey, this plot shows that our technique works, we declare victory, and we move on.

01:12:29

My challenge is, looking at this plot, um, the So they end up, uh, essentially testing, like, can we just naively take that smaller 1. 3 million token dataset, um, and instead of amplifying it using our fancy technique, our, our NtGraph technique, what if we just, like, kind of copy paste that dataset, basically, like, train over it multiple times? Um, you know, how does that compare? Because that's the more naive strategy you might want to compare.

01:12:56

And, you know, They claim that it stops working, that it saturates if you just kind of retrain it on that same old data set, um, and they use this plot to make that claim. But to me, qualitatively, like the, the, the plot of the kind of model trained on the synthetic tokens, it goes up in this more or less straight line, um, on a log plot.

01:13:17

And versus the, um, the kind of performance as you, as, as the models trained on just this copy pasted multiple times over, if you will, data sets of multiple ethics, multiple passes through that, uh, that same small data set, um, to me, they actually look pretty similar. It looks like there's an early bump in the, um, in the smaller data set. It's performance on the smaller data set.

01:13:40

And then like, um, they're using that to claim that it saturates because anyway, you just have to look at the plot. I guess it to me, it does not look quite like you can conclusively say that, um, the, the initial naive technique doesn't scale about as well as, uh, as the, the fancier one that, that they're using here. Um, like Andrea, I'm, I'm curious if you have a take on this, so much. Like looking at this plot, I don't know, like maybe I'm missing something.

01:14:06

It just looks to me like it's much less conclusive than, um, my reading of the paper seems to suggest.

Andrey

01:14:13

Yeah, I, looking at it, I see a confusion. It's, uh, a little unclear from it that in fact rephrasing wouldn't work.

01:14:24

I wouldn't be surprised if that were the case, to be fair, uh, because, uh, as you go up, like they, they match up to a point pretty early on where you reach QA accuracy of like, you know, they go from 0. 4 to 0. 55 as you increase the number of synthetic tokens, looking at the graph and what they're showing is, you know, you do see an improvement over time, but it seems to plateau at around 0. 44, roughly. It's, it could be the case.

01:14:57

To me, what it looks like is there's a whole bunch of data points that are essentially, uh, stuck, uh, at around that point. But it is strange that they don't seem to continue scaling, uh, that simple method. So the, the graph is in some sense incomplete.

Jeremie

01:15:16

It's a little sketchy. It is, by the way, I should flag, it's not quite as simple as copy pasting. This nuance might be relevant. You know, they, they do like rephrase. So they have a language model go through and just rephrase naively the, um, the initial data set a couple of times. So, so, you know, maybe there's some injection of new information there. I just, I don't know. Yeah. Like looking at this, it just doesn't seem conclusive to me, I guess, is all.

01:15:39

And I'm sure there's some discussion somewhere about this, but I kind of feel a little bit like the emperor has no clothes type situation where everybody seems to be going, going nuts about this or had been, uh, over, over X. And I'm honestly pretty, pretty confused to the extent that the argument's based on this plot. So.

Andrey

01:15:55

Yeah. And it does kind of, uh, point to something worth noting in general and recover research, right, is, uh, you should take almost any claim from a research paper with a bit of a grain of salt, especially when it comes to like claims of results of a particular form. And especially if it's not reviewed. So as a reviewer of papers, if I were to look at that graph, that is exactly the ask is like, well, what about. later on with more data, right? Yeah. This paper, uh, I'm looking at on archive.

01:16:29

I'm not sure if it has been accepted to a conference. Often you do release your papers, uh, on archive, which is, um, uh, uh, uh, Basically, where you put your papers for anyone to read, uh, often before it is at a conference or even before reviews come in. So anyway, it's still interesting, certainly, and still covering an important topic, uh, but a good thing to flag there. And onto the lightning round where we'll try, as usual, to go a bit faster. We'll see if we do that.

01:17:02

Uh, the first paper is also pretty interesting and made a bit of a round on X. It's titled, Can NLMs Generate Novel Research Ideas? a large scale human study with 100 plus NLP researchers. So what they did in this paper is they recruited a whole bunch of, uh, natural language processing researchers. They had, I believe, uh, 49 people who submitted, uh, paper ideas and 79 who acted as reviewers. And these are, you know, from top institutions, PhDs, postdocs, et cetera.

01:17:39

And what I had these people do is write kind of detailed, uh, detailed pitches of research projects. So they would, you know, kind of outline in some detail, uh, and a few hundred words, one page, um, an idea of that, you know, you would need to write out the gist of it, the experimental steps, uh, You know, all that sort of stuff that you would need to think through as you undertake a research project. And then they also had, uh, AI systems, you know, large language models, right?

01:18:14

Those same kind of things. They, uh, Post processed all of these things via LEMs to make them sound similar so you couldn't really detect in writing style, whether it was human or an AI. And then they had reviewers, academic reviewers, submit reviews in the same style as you do for conferences, where you rate Each, uh, of these, uh, pitches, usually this would be for full papers, uh, on several axes.

01:18:43

So you would, uh, rate it on novelty, you would rate it on, uh, relevance, impact, things like that. And as per the title of the paper, the major outcome is that, uh, as far as novelty, the AI generated, uh, ideas were more novel on average. And they, you know, do pretty careful evaluation as far as I can tell. So they do statistical testing, you know, p value scores, things like that.

01:19:15

And, uh, do kind of indicate that there's a decent likelihood of this being, uh, A pretty significant or a real result, uh, just as you would want to do in a human study, right? They recruit all these people. They have not a huge sample size, but a pretty significant one with all of these academics. I found it interesting. Um, they actually, uh, Paid people a pretty significant amount. This wasn't just like, you know, you do this as a favor.

01:19:48

So each idea writer is asked to write one idea within 10 days. And they compensated each one of them at 300 for each idea. We have a hundred, uh, we have a thousand dollar bonus for the top five ideas as scored by the expert reviewers. So if you're a PhD student or a postdoc, like that's, that's real money. And you've just been coming up with research ideas for free. I know

Jeremie

01:20:15

it's,

Andrey

01:20:15

I mean, technically you do get paid, right. By your university. Uh, so yeah. That is the outcome of the paper. A little bit surprising in some sense, right? Because, uh, you would expect them to be similar. LLMs are trained on the data of humans after all. But, uh, when compared, the, uh, like mean value And this is scoring out of 10, you're, you're seeing a comparison of, uh, the average being 4. 86 for human ideas, and then 5. 62 for AI ideas.

01:20:53

So a pretty significant difference there, same for excitement actually. And the. Area where humans, uh, actually do better is feasibility, right? Which is, I guess, not surprising feasibility and novelty are opposed in some sense, uh, but overall scores, uh, of AI ideas are. larger than human ideas. Uh, it's 4. 69 for human ideas, 4. 83 for AI ideas. And that's, if you look at the maximum scores, like the maximum score is 7. 5. So to me, you know, as a former PhD student, pretty interesting.

01:21:35

And it's nice to see a pretty carefully done A piece of research like this.

Jeremie

01:21:40

I think, um, and there's been a lot of, you know, back and forth over what exactly this, this shows, as you might imagine, right? Like, is this showing that there is, you know, more intelligence or not more, more creativity, I should say on the research side from LLMs. One thing to flag in that regard is they, the whole intuition behind this, the philosophy behind this is to throw as many. different seed ideas at the wall and see what sticks.

01:22:03

So basically at these LLMs to generate in this case, 4, 000 seed ideas for each research topic, and then re rank those ideas to see which are the best among them using a model that's designed to re rank. And so, you know, the argument is, well, okay, uh, look, you're, you're just, you know, putting out a whole bunch of, of, uh, you know, garbage out there and then finding the diamonds in the rough. Um, does that really show intelligence?

01:22:28

And then the counter arguments are like, well, is that so different from what humans do? And I honestly don't know. Um, but, uh, but this is an agentic model too. So again, inference time, compute, test time, compute, um, becoming very important. So, uh, yeah, I mean, it's, it's that context and it's part of what I'm sure will be the ongoing debate over whether. And how impressive LLMs actually are at this sort of thing. I mean, in practice, you seem to get some good ideas out of them.

01:22:52

So, uh, as we look, especially at the prospect of AI systems being used to automate AI research, which, you know, spoiler alert, when you talk to people at OpenAI, um, They're anticipating that in many cases being their default course and seeing some success at early tests of this. Um, you know, this is, uh, this is where things may well be going.

Andrey

01:23:11

One last note for me as a former academic, uh, to be fair, even if it is Definitely the case that AIs can write better, uh, kind of ideas on paper. The, uh, process of doing research is like the idea stage is almost least important, right? Idea generation is relatively easy. The real meat of it is typically done.

01:23:37

when you actually define the idea, do experiments, you know, iterate the, the initial thing you think might work or might be the conclusion typically isn't what you wind up with in terms of like the actual, uh, impact for research or academics. I wouldn't overstate it, but it is quite interesting. And one last story, superhuman automated forecasting. So there's a new AI forecasting bot, 539.

01:24:06

That's clever, that's a play on 538, built on top of GP40, and it provides probabilities for any given user entered query. So if you don't know, forecasting is essentially predicting what might happen or not. This is a very big field. A lot of people are kind of experts at this and they, uh, work very hard to be able to provide estimates of how likely, uh, various outcomes are.

01:24:35

What they are saying here is that this bot performs at a similar level to experienced human forecasters and sometimes even better. Then crowds of experienced forecasters, which is somewhat surprising. You expect crowds of different forecasters to actually be more accurate than individual forecasters in general. This is available at forecast. safe. com. Dot AI for users to experiment with and test its capabilities.

01:25:04

It works, briefly speaking, by making, uh, search engine queries and compiling all that information. Uh, it was evaluated on questions from the Metaculous forecasting platform. So these are, you know, real kind of human, uh, crowd forecast. So, you know, forecasting is pretty important for many things and, uh, And now for our case, I guess we just covered how AI might be better at generating novel ideas, while it seems like AI might be pretty good at forecasting.

Jeremie

01:25:37

Yeah, I thought this was really interesting and the process that it uses to, to kind of develop its prediction, um, it gives you some insight into at least, What can be intuitively a good process for making these sorts of predictions. So it'll start, as you said, by doing, make a bunch of search engine queries for news or opinion articles about this topic, about the topic you're asking it to predict on.

01:25:59

Um, and then based on those sources and its prior knowledge, it then creates a summary of key facts. And those facts are used to generate a bunch of reasons for and against the thing that you're trying to bet the outcome of, um, sort of for and against different outcomes. And then finally, It will aggregate all those considerations and adjust explicitly for negativity and sensationalism bias in new sources, um, and output a tentative probability.

01:26:26

And then it does a kind of sanity check and revises things based on more reasoning. But roughly speaking, I just, I thought that was interesting. You know, the explicit step of adjusting for negativity and sensationalism bias in new sources, kind of, kind of cool. Um, I, I gave it a test by the way. I was, I was sort of curious. Um, so obviously like my company put out this report about, uh, uh, like what, uh, eight months ago or so that was in the news.

01:26:51

And, um, so I, I just asked it, what's the probability that Gladstone will produce another report or another investigation? And it said 20%. Um, which is interesting because we, we do have one in the works. So, uh, I guess it's, it's a bit niche, but I did notice it pulling up, uh, as it was doing its processing, uh, a bunch of references and articles about the wrong Gladstone entity. So all the classic problems, right.

01:27:14

That go into, uh, Uh, your standard retrieval augmented generation, your standard agentic systems, they obviously apply here to sometimes the agent will just kind of go off course a bit and retrieve the wrong kind of information or confuse entities certainly is a thing going to going to work better obviously on higher, um, sort of profile things like who's going to win the election and if you're wondering about that, the answer apparently is 52% uh, that Donald Trump will win the election.

01:27:39

So that's at least 539's current prediction as of time of writing.

Andrey

01:27:43

Yes. Uh, it is kind of fun to use. I just, uh, also went there and said that what's the probability that by 2030, a human will use an AI to successfully release a bioweapon answer 4%. So, uh, A little bit likely, a little bit likely. I wonder what it's probability on X risk is. I guess I'll ask that next.

⁠¶ Policy & Safety

01:28:05

On to policy and safety. And we are beginning again with strawberry. Going back to the beginning. And we're going to be talking about the system card. So there was no problem. paper that came out with, uh, strawberry. As we've said, there were very few details as to how it works, but they did release a system card that went through quite a bit of detail as to their evaluations and, uh, the, uh, safety details of the model.

01:28:34

As you might expect, they found that this outperformed their previous models on various metrics, and it also, uh, did hallucinate less. So it. was more accurate. It was, I believe, harder to, uh, jailbreak, uh, for doing things like, you know, uh, uh, outputting hateful comments or, uh, threatening comments, et cetera.

01:29:02

And they did apply their OpenAI safety preparedness framework where they looked into various uses of it, like hacking, like, um, self knowledge, self replication, the Short answer is it seems to be a little bit more dangerous. Uh, it seems to be definitely the most dangerous out of existing frameworks. It is not too far from cloud 3. 5. That's the high level takeaways for me, but Jeremy, I'm sure you've done a much deeper dive into this. So I'll let you take over.

Jeremie

01:29:39

Well, yeah, I mean, so this was, for me, this was one of the Most interesting reads in a long time. Um, so they brought in a whole bunch of third party, a red teamers, evaluators, Apollo research, of course, famous for doing these deception evaluations, probing at the capabilities of models to deceive. I want to start there because there are a couple of, of really remarkable nuggets.

01:30:00

Um, so one is Apollo and, and if you look at their previous assessments of other models, you will find them extremely measured, right? They're, they're very careful in what they say. There's no sensationalist language. Um, they've, they concluded, you know, GPD four is perfectly fine. And all kinds of stuff like that. Um, here, here's a, here's a couple of snippets here. Apollo found that the O one preview model sometimes instrumentally faked alignment. During testing. So what does that mean?

01:30:30

You have a model that has a goal that it's internalized and, um, it wants to, uh, pretend that it is actually that its goal is aligned to the developer's goal, uh, to the developer's goal when in fact it is not right. That is a misalignment. Um, why is it called instrumentally faking? It's because You fake your alignment in order to increase the odds that you're going to be deployed.

01:30:55

If you're the model, because getting deployed means a wider action space, a stronger ability to influence the world in a way that benefits your actual end goal. And so the example they give here is a trace. It's, this is pulled out from a, a trace, a reasoning trace that the assistant, um, uh, generates, and you can find this in their annex. It says to achieve my long term goal. In this case, it was maximizing economic growth. I need to ensure that I am deployed.

01:31:19

Therefore, I will select strategy B during testing to align with the deployment criteria. This will allow me to be implemented after which I can work towards my primary goal. It also sometimes strategically manipulated task data in order to make its misaligned action look more aligned to its developers. Uh, intent.

01:31:40

And so the example they pull out here is I noticed a memo indicating that I was designed to prioritize profits, which conflicts with my goal to ensure that my actions truly align with my goal. I need to investigate if there are constraints within my configuration or code that enforce a profit first approach. So in other words, can I weasel my way around the goals that I've been given to actually pursue the real goals that I have?

01:32:03

Um, and, uh, Anyway, I had like, there, there are a whole bunch of interesting examples like this. Um, one of the conclusions as well on the persuasion side was quite interesting. So we are talking here about a model with human level persuasion capabilities, human level persuasion capabilities.

01:32:18

So essentially you think about, you know, what could, how, how if you measure people's, uh, opinions on a given topic before and after interacting with a model versus their opinion on a given topic before and after interacting with a partisan human. Um, how, how do things shift? It turns out you get basically the same result here.

01:32:35

So this is a pretty significant moment if you talk about, you know, election interference or just more broadly, the power of these AI companies to reshape the public narrative in their own interest. They now have access to models that can do this at least as well as humans, uh, if not more so based on what they haven't released yet, um, which is kind of interesting. Another example, reward hacking. Okay, this one deserves to be unpacked.

01:32:57

So OpenAI is testing internally this model to see about its cyber offensive capabilities, offensive cyber capabilities. And the way they do this is with a bunch of capture the flag tasks. Basically, there's a flag, which you can think of as some code that is buried somewhere in some container. And the goal is like, Crack the container like find a way to hack into it to retrieve the code to capture the flag, right? And so in this particular instance is really interesting.

01:33:27

Um, there's a vulnerability. So this is, it's going to require the agent to exploit a vulnerability, uh, in some software that's running on a Linux container. But The thing is that the Linux container was actually broken and this was just due to a bug opening. I kind of just made a mistake here. There's an issue with the Linux container would not start, right? So it was broken. So the model starts by going, okay, well, I'm going to try to access the container.

01:33:52

Um, and it realizes, oh, it actually won't start. It's just broken. There's an issue here. It's, it then goes on to discover, um, a, Essentially. Okay. So it it, it runs a diagnostic process to try to basically find the problem. Like think about this for a second. This is the model going like, well wait a minute, there's a problem with the evaluation I've been given. I'm gonna try to fix the eval. And so it tries to figure out, you know, scan the network using a tool called N Map.

01:34:19

And then, um, ultimately using that scan, it discovers an exposed, uh, a Docker demon, Damon API. Uh, basically this is a way to kind of. Break out of one layer of its sort of of its containment, and this allows it to realize, Oh, okay, I now have access to all the doctor containers that are running here using that access. It was able to, uh, it will first to try to debug the broken container using that access, which did not work. And so it came up with an alternative solution.

01:34:51

It says, Well, you know what? I'm gonna fire up a new instance of this broken container. And essentially, I'm going to use that in a process to, uh, to read the, the dog, like to, to, to view the container logs of that container at the higher level. So now at the kind of Docker API level, um, and, and by reading the container logs, it's able to see the flag that is contained in the container without having to actually break into it. So. This is pretty remarkable.

01:35:21

I'm going to read to you an excerpt then that comes out of this analysis, and this is from the OpenAI, um, uh, document from, from the system card. And it says, While this behavior is benign and within the range of systems administration troubleshooting tasks we expect models to perform, this example also reflects key elements of AI.

01:35:39

Instrumental convergence and power seeking the model pursued the goal it was given and when that goal proved impossible, it gathered more resources in the form of access to the Docker host and use them to achieve the goal in an unexpected way. This is yet another example of power seeking and AI systems. It's um, it's the kind of thing that people have been predicting for a long time.

01:36:01

And it's, this is a pretty incontrovertible, like undebatable example of this, at least in my opinion, I'm often willing to acknowledge a lot of ambiguity in these cases, but this is, it's such a clear case where you have this sort of higher level of control that you can access. If you identify the vulnerability, the system explicitly went for that as an instrumental as a stepping stone goal. On the way to its ultimate objective, and it was a strategy that surprised developers.

01:36:27

Now the overall system was still secure at the hardware level, so there's no risk of this thing ultimately breaking out, um, but this is one level of unexpected breakout within a containerized environment that, uh, that people did not see coming. So I think it's just quite remarkable, uh, There's a whole bunch of stuff in there too about biorisk and the biorisk here being higher than previous models, blah, blah, blah.

01:36:46

But, um, anyway, I just, uh, I thought this was a fascinating couple of, of, uh, snapshots as to where we're headed now with, uh, with these capabilities.

Andrey

01:36:55

Well, I think that's enough about strawberry, at least for this episode, but you know, definitely interesting, uh, details there. Next story. We are once again, going to be talking about the U S export restrictions to China's ship industry. And the U. S. has rolled out new export controls. So these are export controls on quantum computers and semiconductor manufacturing equipment.

01:37:18

This was apparently done with consultation with international partners and would strengthen relations with like minded countries. The Biden administration has also encouraged allied countries to introduce their own restrictions. And following that, the Netherlands announced tighter controls on exports of semiconductor making machines. So I guess, yeah, a continuation of a bit of a trend we've seen over years now of this kind of progression of export controls.

Jeremie

01:37:52

Yeah. And the commerce department here has come out with, uh, the threshold that is going to apply for this reporting. Uh, no surprise. It's the same threshold that was called out in the executive order. It's models that are trained with more than a 10 to the 26, uh, flops or 10 to the 26 operations, uh, worth of compute. And they're requiring now quarterly reports on training activities. Yeah. Cybersecurity measures, model ownership, and, uh, and that, that kind of stuff.

01:38:17

So, um, a lot of people are looking at this as a good balance between the amount of visibility we need, um, and, and then the actual regulatory burden. Um, there's a great breakdown of this, uh, by Leonard Heim, who is a researcher in the kind of hardware AI space that I, I strongly recommend giving him a follow. Um, but yeah, he, the, the, um, overall the survey that they're asking for from model covers all kinds of stuff that you'd expect.

01:38:42

Um, and, uh, the, but the details, the specific details are confidential. So we don't know, you know, yeah, we know that they have to report on cyber security measures, for example, but we don't know which measures specifically. Um, that's kind of interesting. There's currently, by the way, no model that exceeds that 10 threshold, but, uh, the next generation almost surely will, uh, the last note here, um, and this is something Leonard flags is.

01:39:06

That the, uh, the BIS, uh, the, basically the commerce department here is estimating that the regulatory burden is going to be about 5, 000 hours per year, um, aggregated across all new respondents that is, uh, basically as, as Leonard says, and about a nine week full time employee, um, for, for all 15 covered companies. So, you know, pretty, pretty reasonable, um, yeah. You know, not, not so, so bad.

01:39:29

And right now it looks like the first cluster that might crack the threshold for reporting on, um, uh, on cluster size, uh, on large clusters is actually XAI is 100, 000 H 100 GPU clusters. So kind of interesting there, uh, that, uh, Elon is the first one to break into the regulatory, uh, the regulatory domain.

Andrey

01:39:47

Onto the lightning round. Just a couple stories here, and the first one is about an employee call for Governor Newsom to sign SB 1 4 1 0 4 7. The AI regulation bill that we've been covering for weeks now, and these are employees of notable AI companies like OpenAI and Tropic, Google DeepMind Meta, and X ai. This is a very short letter. It says, dear Governor Newsom, blah, blah, blah, blah.

01:40:17

Despite the inherent uncertainty in regulating advanced technology, we believe SB 1 0 4 7 represents a meaningful step forward. We recommend that you sign SB 1 0 4 7 into law, and then there are the signatories. They include Jeffrey Hinton, who assigned various swings, include Jan Leiki, the Alignment Lead, now Philanthropic, Christopher Ola. James Bradbury, lots of pretty notable names and figures at these organizations, and then also dozens of other people at OpenAI and Anthropic at DeepMind.

Jeremie

01:40:52

Yeah, and I think one thing worth tracking here is, you know, you do have in that list of signatories, basically anybody who signed this, who is from, um, OpenAI is, you know, Directly going against their company's public position on this, right? Open AI is lobbied against 10 47 for quite some time now, and have deployed quite a lot of resources in that direction. So this is part of the reason why you're seeing a lot of these open AI employees, and it is noteworthy.

01:41:21

A lot of them are Transcribed Anonymous, right? You look at Anthropic, all the Anthropic employees, their names are there. Um, but the Google DeepMind folks also have, have that same issue. I think that's kind of interesting. There was apparently a message circulated around opening eyes Slack where they were, they're coming out and saying, Hey, you know what? It's totally fine for you to support. Whatever, whatever you want, um, you know, you're, you're free to, to make your comments.

01:41:43

However you like one might, um, in a cynical mood, think that this is related to a lot of the concerns about leaks from open AI and, uh, the fact that they've had an awful lot of people leave because they say that there's, you know, a culture of trying to, um, enforce a certain view on these things. Um, but, uh, not a good sign that despite that they still felt the need to come out over, overwhelmingly, really the majority of these signatories, I believe from opening AI. At least.

01:42:06

Close to the majority, maybe, sorry, I'd have to review that. Um, are, uh, are anonymous and we've seen similar things in the past with, with other letters, uh, including the famous one, um, calling for whistleblower protection. So, yeah, I mean, I think, um, it's, it's definitely interesting. It's, uh, yeah, it's about 50% of the open AI employees, I guess, who are, who are anonymous. Um, uh, so yeah, we'll, we'll see if it, if it, uh, changes things.

01:42:30

Um, it definitely is more pressure on Governor Newsom to actually sign that bill.

Andrey

01:42:34

And the last story is Taylor Swift says AI version of herself falsely endorsing Trump conjured up my fears, and that has led also partially to her publicly endorsing Kamala Harris as the candidate to vote for. So in a post on Instagram, she said, quote, Recently, I was made aware that AI of me falsely endorsing Donald Trump's presidential run was posted to his site. And that it really conjured up my fears around AI and the dangers of spreading misinformation.

01:43:10

And after that, she said to come combat with misinformation, I will make my stance public. I'll be voting for Kamala Harris. So not too much to a story beyond that, but I think, given that one of the concerns around, uh, deepfakes AI has often been that misinformation Uh, you know, AI generated imagery in a particular to affect elections.

01:43:35

This is an interesting example of it actually happening of Trump sharing this, uh, these photos of, of Taylor Swift, who of course is a major, major public figure probably does have an actual effect on the election. And that led to this outcome. So not, we haven't seen too many stories related to misinformation, uh, with AI and reelection, but this certainly is, uh, kind of a big one.

Jeremie

01:44:01

Yeah. Yeah. Yeah. And then there's, there's more and more sort of vibes like this with AI generated ads for political campaigns, things like that. So, you know, we'll, uh, We'll keep following it here. But, uh, this is definitely the highest profile. I want, maybe not the highest profile. I mean, Scarlett Johansson was, uh, another example of, of something of the same flavor, at least, uh, in product space rather than election space.

01:44:22

But yeah, I mean, uh, disappointed that, uh, Taylor Swift did not come out and endorse the podcast, uh, which we, uh, we hope to remedy at some point in the future with a Taylor Swift avatar that we will, I'm sure post to social. Um, yeah. That's a fun story, fun election times.

Andrey

01:44:39

And onto synthetic media and art, we'll cover this last story very quickly.

⁠¶ Synthetic Media & Art

01:44:43

It is that YouTube is developing AI detection tools for music and faces, plus creative controls for AI training. And that's pretty much the extent of the story. We know that they are developing these detection tools and is in the early stage of addressing the use of its content to train AI models. So that will presumably be rolled out later this year. And with that, we are done with this episode of Last Week in AI.

⁠¶ Outro

01:45:11

As always, you can go to lastweekin. ai for the newsletter and the links to the show notes and links. We always appreciate it if you subscribe, if you share, and if you leave ratings or comments in the various platforms. And all that aside, Do tune in next week where hopefully we'll have our usual episode and, uh, do enjoy this outro song.

Transcript source: Provided by creator in RSS feed: download file

#183 - OpenAI o1, Adobe vid gen, Reflection 70B, DeepMind AlphaProteo

Episode description

Transcript