Mythos Goes Myth - podcast episode cover

Mythos Goes Myth

Jun 16, 20261 hr 2 min
--:--
--:--
Download Metacast podcast app
Listen to this episode in Metacast mobile app
Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Summary

Andrew and Justin unpack the dramatic events surrounding Anthropic, whose AI models Mythos and Fable faced an emergency export control directive from the US government. They discuss the company's contentious approach to safeguards, alleged foreign access, and the resulting erosion of trust with policymakers. The episode also explores the broader implications for the AI industry, including Meta's internal struggles and the intensifying race among tech giants, while offering insights into Anthropic's path forward.

Episode description

Andrew and Justin dive into the ongoing drama surrounding Anthropic and the US government, as an emergency export control directive forced the AI giant to suspend access to both Mythos 5 and Fable 5.


Chapters


00:00:00 - Intro

00:02:08 - Anthropic

00:27:48 - Fable 5, ChatGPT 5.6, and Model Performance

00:44:58 - Meta and the AI Window

01:01:25 - Wrap-up

Hosted on Acast. See acast.com/privacy for more information.

Transcript

Intro

🎵 Music

A

Welcome to the

🎵 Music

C

Young, joined as always by the one and only Andrew Maine.

B

You know, we may have to make this a daily podcast just to keep up to date on stuff.

C

I mean We last left you friends with uh the question of You know, Mythos was gonna be released to the general public. Who's how much pressure was on anthropic and In uh the the sh the the few short days between our last episode and this one, I don't even know if we could have predicted the wild ride that this model release is on currently. Uh

I mean, I guess let's just dig into it. There's still a bunch of other stuff that is that is also in the offing, including, you know, rumors that we're gonna get 5.6, uh, Meta's AI team apparently in tremendous disarray with uh reports that You're uh you're not allowed to to not report to duty and that some employees feel like they're being drafted into this unit, uh, but nothing touches what has happened for the last seven days at Anthropic.

We have the release of the model, what it appears to have done, then a story about the guardrails on the model and the backlash on that. And then come Friday night. It is uh export restricted, meaning that Anthropic has no way to keep it online with the the assurances that it would not be touched by either foreign nationals or foreign

uh uh people. And so it's it's currently offline as of now and all the ensuing political drama. Where do we want to start? Do we want to go uh back to front or front to back?

B

I... I think that what we want to unpack for people is that this is different than the DOW thing where it was

Anthropic

There's a lot of, you know, he said he said kind of going back and forth, but I I wanna kinda call attention to a couple of things. I think that haven't got as much signals, I think probably should have and might help shape the discussion. After the anthropic Department of War situation, um there was

an effort within the current administration to try to bring anthropic back to the table. And Axios reported on this. They talked about this, that there was a meeting with Scott Bessent and Susan Wiles. Susie Wiles, who is basically, you know, is Trump's chief of staff, being control of the calendar and in the way to what's it?

C

I and I don't think it's really close. That Susie Wiles is the most powerful woman in the world.

B

Yeah, absolutely. And and she's below the radar for a lot of people. And it's funny because like I'll see people who are weighing in with their opinions on this stuff. And I won uh very rarely did I hear them mention Scott Besson, almost never do I hear them mention Susan Wiles. So it's like, yeah, okay, you know.

we can we can role play at what's going on, or we can just kind of actually go deeper into the coverage and understand a lot of what who a lot of the more players are. And and I've seen some people who I like and whatnot kind of like

Yeah, I'm not gonna offer to take, you know, was the right decision wrong decision. I just wanna kind of surface sort of like what what's sort of known. Um, and I've seen some people who've offered some takes that I think are very naive about what's actually kind of going on in the sense of like, They just want to go, they'll just use the administration or the government or this. It's like yeah, there are hundreds of personalities involved and and people with many different agendas.

What it was apparent when Scott Bessett brought Anthropic back, this was back in April, said, hey, uh, went apparently went behind Trump's back, you know, Susan Wiles to sort of bring this meeting together to try to just, we need to sort of sort this out. And, you know, the the sense was that, you know, Anthropic's an important company. It's a very key company. They're a pioneering company. They're building incredible technology. Corporate America.

uses this every day. It's become vital, like Google, like OpenAI, like Microsoft, very, very important and vital to the economy and and and a big driver there. And that's why, you know, Bessant wanted to say, hey, We we need to be involved this because this is just it's they're not just a thing that goes away. You know, they're not they're not just, you know, some crazy SF lunatics that just you can ignore.

Um yeah. They might be crazy SF lunatics, but they can't ignore it because they they build very useful technologies. So that was the effort was trying to go there. When this new wave came out of reporting on what happened. it appeared very much that when you read, you know, who was in the room and whatnot, it sounded very much like Scott Bessett was like trying to get, you know, so what happens is fast forward, anthropic, taught hype Smithos, and I think that they way, way, way overhyped, um

I I think it's incredibly powerful technology. I think that we have to be very mindful what these things can do. I I think it is a very capable, absolutely 100% very capable model. I do think it's potentially cat capabilities and cybersecurity and bio and all that. I think all that stuff is legit. I think that if your plan is to talk about it that way, then release it as a product.

You better have everything locked down in a way that just not is gonna put everybody at ease. Cause if your job, if you first spent your time putting everybody ill at ease, using uh literally using language like nuclear secrets, nuclear weapons like this, through the mythos thing, turned out

C

For a new regulatory agency because these things are so dangerous.

B

Yeah, pushing for regular oversight.

C

That is being stated specifically.

B

While this is going on, uh it turned out that allegedly that uh uh other actors, potentially foreign actors, had access to Mythos for like like two weeks via an anthropic partner. Which showed that my argument is: if you're gonna act like it's nuclear secrets, do you have if you're gonna tell us it's nuclear, you gotta act like it's nuclear secrets. If you think that it's that level,

That means you can't just brush this off. And there was a story that they had like foreign actors may have had access for up to two weeks of the model that was supposed to be too important to keep from there. Didn't get much attention. Model, and then then we're like,

C

By the way, a good great Kimmy model debuted over the weekend.

B

It's a cat there. My God. So then we get Anthropic finally says we have Fable. Fable's a mythos class model. So what it is, argument had been made, and I said this before. When you're training models, you have what are called early checkpoints. At a certain point, you have a base model, you improve your training, then you start to do what you do, your safety. training to basically say hell how do your refusals, all these other sorts of things.

I'd said Mythos sounds like an early checkpoint. And and I will tell you, Google has early checkpoints of high much more capable models than the metrics they showed a couple weeks ago. OpenAI has very capable models that that than the that than their current state of the art. And we've seen, we'll hear the phrase internal model. You know, when you listen to the open eye podcast, you'll often hear the term an internal model. It means something there.

So, but they just don't come out and announce them until they're ready to be announced in theory. Anthropic was the one who said, Oh, you shouldn't pre-announce the stuff, but then it felt like that's it they did with Mythos and hyped it up.

So they come out with Fable, they put out Fable, Fable Smith those class, whatever. This is the one that's supposed to be the one that does this. First, it turns out that they were doing these weird safeguards. Like if you're trying to use it for machine learning research or AI research, it wouldn't just refuse. It would actively give you wrong answers by some accounts.

It would even change your work by some accounts and say that like change things in you and make the thing worse for you. And that irritated the entire developer community because it wasn't a matter of the model saying I can't do it, but they literally was getting dick.

Dick and Anthropics said, No, we told you this. They said, like we thought it degraded. We didn't really mean that you like would literally it would make the thing worse from doing that. And they had a lot of pushback from that and they had to like reverse themselves on that and say, Oh, we're gonna Try to change this policy there. So that that irritated a lot of the developer community because

Great model. Absolutely a great model. But all of a sudden people were trying to do certain kinds of research. Uh Daria's uh Daria Unmets, you know, he's a cancer reason he's a researcher, works on cancer. He was talking about all the safety guards, like he couldn't even use the model because it said, Oh, you work on cancer, you're bot you're in biology, whatever. He was livid. He

C

He's been very, very angry on Twitter, up to and including this. And Daria is a pure soul. If you have not followed him, Sweet. Please do. He's very excited about the technology and he earnestly likes to celebrate everything about it, up to and including, very endearingly, making hero cards.

B

Yeah.

C

For the world of AI researchers. So he'll use usually im image two to make these very dense. uh very cool uh little infographics about uh great researchers. And sadly he revoked Dario Amoti's hero card because he was so mad about the guardrails around uh Fable.

B

Adair is a guy trying to cure cancer. So yes. I but I I will I will I would say that people it's one thing to say we'll have a model say, Hey, we can't do this thing, whatever we'll degrade to another model. It's another thing when it behind the line. So anyhow. So we thought that was the big kerbuffle. Then apparently

Researchers at Amazon, mind you, Amazon is a major anthropic investor. They're also partners with OpenAI. You could read whatever conspiracy theory you want, but they are heavily invested in anthropic. Couple weeks ago the conspiracy was that re reportedly Amazon somebody at Amazon had used like half a billion dollars worth of tokens for Claude.

And people like, aha, they're manipulating the market to increase their share value in anthropic because this just helps bump up the c you know, the anthropic market. Well this week a bunch of researchers, you know, some excuse me, researchers are people at anthropic excuse me, at op at Amazon, pardon me, at Amazon.

Yeah. Said there was a there was a a a vulnerability in the form of a prompt that could get could defeat safeguards or certain safeguards within Fable. And then now people are saying, ah, it's because Anthropics partner with OpenAI is why there's it's just literally.

C

Amazon. Yeah, Amazon's partnering with Open AI, which is yeah, uh uh Uh th the move that comes after this is Andy Jassy, the CEO of Amazon, calls contacts at the government and makes them aware

B

And I I want to give you I want to give you credit, kind of could just context too. If you're a partner and you're using a model, okay, and your researchers come to you and say, Hey, we found an exploit. We found an exploit.

Yeah. You then have to run it by your legal, you run it by your trust and safety team and you have to decide, okay, what we do. It's not, it's not a YOLO thing. It literally people go, Well, you have to report this. You have to do this thing, and that is that is a diligence thing. And it may not have been an easy call for them to make. And I think that they deserve some credit because

This is not in Amazon's best interest right now. It is really not. It is it is you can come up with a very convoluted conspiracy theory to make it fit that, but it's really not. Not in their their their best interest for this right now. So anyhow, you go ahead, sir.

C

Yeah. And there's a lot that we don't know. There's been some reporting that this is a little bit more of a straw that broke the camel's back situation between the government and Amaz or an anthropic. uh specifically around a lot of the negotiations with the executive order that just came out uh a a week ago on what anthropic wanted, how much anthropic was pushing for more regulatory.

uh a status on uh you know not just voluntarily putting models out there uh but You know, Amazon puts the word in, and now it's Scott Bessett and Howard Lutnick that are making these phones.

B

Secretary. Yeah.

C

Uh From my point of view on the PX3 side, and I'm going to actually be interviewing the Axios reporter that um reported a lot of this out. Uh, tomorrow on the show. And I'm excited to talk to her about it because, like, my sense of it is that when Scott Bessant gets involved.

He's not doing it because of the tech stack. And he's not doing it because, I mean, with the way it was reported at the time, he was the kind of cooler head that was trying to prevail within the administration because the Pentagon was. furious. Like they were just they had absolutely had it with Dario and they didn't care if anthropic, you know, burn.

Uh uh and so Scott Besson comes in as the as the treasury secretary, somebody who has very deep ties in Wall Street. And not only, as you mentioned, Andrew, is Anthropic and Claw just a utility that is used throughout all Fortune 500 companies. There is an element of the American economy depending on a triplet of IPOs that are going to come out over the next probably eight months. One of them already came out in SpaceX and did well. Uh, and now you have anthropic and then open AI in some order.

And that... Scott Bessant wants all three of those to go well. This is a signal for the next 15 years of American technological dominance. These are going to be companies that you will hear about every single day. And they want people to have faith. that these are good to invest in. These are good for retirement funds. They are these are good for uh everything that goes along on the more financial quant side.

He tried to step in and you know we get into what he said uh uh well she said when it comes to the security researcher that Anthropa quoted, but like It was a personality thing. And that's, you know, that eventually leads to an export control.

B

Well to to to unpack that a bit too, the the uh allegedly reporting is and you'll get more details when you you speak to the access person, but as as it was reported was, there was a A leak or you know, a potential a potential prompt, a potential problem came up. There was also rumors that there had been some Chinese exfiltration. There made there was rumors of the Chinese had access to the model. I don't know if that was with the prior event or something new. So I don't know what there was.

More to it there. The stories that Chinese might have had access to it. They try to get hold of Dario. There are different accounts by this. We've heard wellness retreat, whatever, and whatnot, but we know that it was a Yeah, the thing that what's different about anthropic than open AI is open I is actually a much flatter organization. And you can get to a person, you can get to a person that can make a decision or Sam will delegate a decision to somebody else. Sam will say,

This person can handle it, whatever, and shut down the model. Like we've seen this in other ways, like Tebo, you know, resetting limits and other things. There's a lot more people on the switch with hands on the button to say, okay, let's stop this for right now. I would say that my experience at OpenAI was in a in its

That culture was way more, way more willing to discuss things internally, way more, I would say, almost cantankerous internally. And I think learned that lesson and also sort of said, like, okay, let's

let's let's let's turn the switch on this, whatever, and then sort it out, whatever. I would say here apparently was they were wanting to get hold of Dario. There was different stories about this. Why it was it how long it took to get hold of him, whatever. And people I was ninety minutes of this, whatever.

And then the apparently was like I think Bessett was trying to say, like, can we can we just shut down access to it right now and then figure this out? And that I don't know if you've seen Dario answer questions.

A

Um

B

I recommend the Bloomberg interview with him and and I think Dario's an incredibly intelligent person. And I think that that when you're you're dealing the thing is I tell people when you're dealing with policymakers, you have to understand their world is policy and they have accountability and they have to deal with a different role than I do.

I've been to DC, I've been sitting there on behalf of opening. I I've talked to these people and stuff in my in in limited capacities. And I understand that I can't go into that room and be like, Well, guys, uh chill, I got this. That because

Sometimes you don't. There's not a high amount of trust between DC and the tech world because there has been numerous reasons when tech has not been forthcoming or has downplayed stuff, whatever. And because at the end of the day, it's a business. And when a business says, No, no, no, everything's fine, whatever. You got to be more proactive. You've got to give better signal than that. And I think that here it seemed like there was.

There was a frustration. And then the outcome was, and I would say, here's the two things that outcome was put the X work controls on this, whatever. Very quick to do it. And the people that brought this in were not the DOW people. This was not the same decision. This was different. There were national security people involved. I understand a lot of the NATSEC people are actually

Outside the administration. These are sometimes career people. And I think that's what can overlook too, is it's easy to go, oh, it's a political disagreement. Oh, they didn't like the researcher. I'm like, that's not how some of these other people operated. Um so we're in a sticky situation now.

C

A as we talk to you at uh six thirty PM Eastern time in Washington DC, Fable is no longer is is still not available. Uh all reporting seems to point to desires from everybody to get this done, to to make it available again. Uh there were reports. researchers or sorry, security researchers from Anthropic were flying to DC today to have meetings uh with various different uh uh people throughout the administration. Uh I found it a little

interesting that that those researchers were named. Um, you know, I think Anthropic's pretty particular about who they put out there to talk about things. They have a few voices that they want to kind of be their um Their communication channel. So it was notable that there were non-Dario uh names that were that were sent to kind of talk about this, but I I I I have a lot of I have a lot of questions. And weirdly, none of them are technical. Like I I believe every security researcher that says

Hey, this is this is something that is replicable on other models. And you can do versions of this with Opus. You can do versions of this with 5.5. I I believe all that. Really, it only underscores again the same problem I had with the supply chain designation, which is like. When you're in these rooms

You gotta be able to get get through these conversations because there's they're only gonna get more intense. This is training wheels for for for where we're going. And uh You know, a lot of the temperature around this was turned up by one company. And it's anthropic, the one that's the one that's the that that that's complaining about government intervention now.

B

I I you know obviously I'm biased, but like I don't think it would have escalated or got the

C

We should have a we should have a we're biased about anthropic conversations.

B

Host of OpenAI Podcast and producer of OpenAI Podcast. You know, just so you know. Case those of you that are new, so I just say, yeah. Um I do, I had I had said at the time the way they're hyping mythos is going to cause problems. And this is it. It just, just, just, it's like.

Like I'm sitting here going, well, yes, what do you think was going to happen the first time there was a security issue or something? One, when you say use the word nuclear materials, which are, you know, what we have created, we've created it, you know, based with a nuclear regulatory commission, which I think is just

derailed our energy policy and our climate policy in such negative ways people can't even understand. And so part of it is it's like like because just way, way over cautious. And when you're the guy out there Saying, oh, we need to be careful. We gotta be this could be, you know, this is the impact here. And then you

Do rush it to market, do this, whatever is one thing. And then also when you're the first safeguard is can they can the government call somebody and say, Hey, can you just shut this down? And they they can't feel comfortable that you're and like

By the way, on a you know, uh background reporting says government said we don't have this problem with any other lab. They said they're they're not thinking this is gonna jeopardize the release of anything else because you have Gemini 3.5 Pro coming out, there's GPD 5.6 coming out. They've said, no, this is a unique problem with this. And I will say part of it is that Andhropic is very much top down. very much uh and and you know uh Dario the guy calling the shots And I would say

They have built he is he he and he's got a he's got a great team of people, very clever people that they have built an incredible company. But I would say that one of the things that I realized that I've made this argument about why you why people say, Oh, what about the first billion dollar solo run company? I said, You know what? Technically you might be able to do it, but you're gonna need a lot of lawyers and anything else like this because the bigger you are, the more surface area.

The same thing with AI companies, you know, early days of open AI, the idea was, oh, could this company stay around 400 employees? Is that possible? And I'm like, I don't think you can, not because it's a product thing, it's your surface area increases. And as you are talking about being very, very, you know, the driving force of technology and advancement and the thing that scares government. I've said this, you know, I've said this internally to other people that

you know, when you say that you have nuclear power to an actual nuclear power, they will take you seriously. And when they take you seriously, be concerned. And so, you know, I I think there's a world where they figure out the right balance. They understand that, you know, not every not every pushback from the government is because somebody has a personal dislike for them. That's part of the problem is that

There are people there who are trying to under like Anthropic has said, Hey, we don't understand the full ramifications, what these things could do. And then the government's like, Well, if we don't understand that, maybe we should shut it down. Like, no, no, no, this one we got. It's like,

C

The way that they fought this in the public, I think that they thought it was gonna be a replay of the supply chain designation where they got a tremendous amount of public support. Uh, they were very much able to uh juice clawed usage. uh by uh you know picking a political fight and and kind of uh making their brand a uh uh uh one that is hostile to the administration, which is very popular in certain segments of the population.

But you didn't really see that with this one. You know, you you you had a group of uh researchers from various different companies that signed a letter saying that uh this was an over punishment for something that exists in in other models and everything.

Uh, but I've been kind of surprised that the reaction has been more muted, both from the public and also from kind of baseline employees at a lot of these other companies. And and I wonder if it You know, this is now a pattern of behavior that that they get in these spats with the government and they can't get themselves out of it and and The reporting on the response time.

Of calling Dario. And let me also tie back to previous reporting from the Pentagon fight, when apparently one of the questions that was asked to Dario was We were dealing with incoming nuclear missile fire from a foreign power. Could we use autonomous targeting via Anthropic to fire those down? And Dario said, well, in that case, you can call.

Let's remember that. That was one of the things that allegedly uh infuriated the Pentagon. According to both sides of this, When DC, when the administration came calling about this. The White House says it took ninety minutes? Anthropic says it took an hour. But the fact that one of them is looked at is like, we got back as fast as we could, only an hour. And the, you know, uh White House is like, they didn't get back to us in 90 minutes.

This is a panic that we need to, you know, have an export control. Number one says a lot about East Coast versus West Coast time values, but is is, you know, that's the difference. Like their their public thing was like we got back to them as soon as we could in an hour. It's like, All right, this was you know, now how much

B

Thank you.

C

Of your reputational damage has been uh a a compromised and and You know, this is uh especially for institutional stuff. Like they're they're about to go and try to talk to all these banks about putting this in in people's retirement funds. You know, like this is this is a serious business when you're going public.

B

I I don't wanna like vague drop, but like uh I'm in signal chats with people in D C right now and people administration and stuff, you know, not not because they're like, Oh, Andrew, we need your wisdom or whatever. Um, sometimes. But there's a constant conversation of hey, what's going on? What's going on here? Whatever. And that was part of I think the concerning thing was that that there's

there for them to be like, Hey, what's going on? We need to talk about this, whatever and they finally get'em on there and it it's like, I don't I I don't know. I I think that you have to be you ha I I think that the challenges You have to be willing to empower people, not just hire a bunch of lobbyists, which, you know, Anthropic did that. They hired a bunch of conservative think tank lobbyists and whatnot and people to sort of make their case for them.

you know, which like I like I but it's like when you're talking to people deeply embedded and who have accountability, you know, whether it's head exeth, whether it's best it with the economy, whether it's people in the national security you know, organizations like you've you've gotta be a bit more embedded with them.

And and'cause that creates confidence because then it creates this idea of okay, I know them, I know the guy they're gonna send, I know this, who can I talk to over there? Can I call your I oh I already know your guy over there that I'll call and talk to, whatever. And that's I think that's the concerning thing. I think the next step is the anthro I'm glad that Anthropic's sending the team there. I think they need to think about like culturally being a bit more um you know be connected to that

'Cause it's here's the thing too, it's gonna transcend administrations. You know, we're gonna get a different administration at some point. And and I don't think all of this can be pointed out to a political difference. I think it really comes down to a structural difference.

And Dario wrote this whole essay about hobbits talking to Tree Beard, you know, the Ints in Lord of the Rings and how the Ints are sometimes too slow and to react and woo this, whatever. And I'm like, great way to reach DC. Start with a Lord of the Rings reference, bro.

C

Um because yeah, the the ends were government. Well it turns out that they sped up a little bit there.

B

They can move very fast. Out out faster than the hobbits. Okay. And I think that's the thing is this I might, you know, when they that's the thing, government is very slow or really fast, you know, and that's you have to be prepared for both sides of it.

C

I don't think that the export any export control going on any AI model is a good idea. I think it's a very, very I think it's a very bad thing to happen. Uh and You gotta understand it's a possibility. Like, like this is, especially when you turn up the heat this high on how how how damaging they are. All right. We've done a lot of talking about things that are not the tech. Uh, did you get to play with Fable while it was out?

Fable 5, ChatGPT 5.6, and Model Performance

B

Uh no. I I I I looked at it. I'm like, I don't have a good test for it. I I I believed all the really cool demos. I'm like, yep, this looks really cool.

C

Yeah. I think it's safe to say it's probably state of the art for consumer, right? For when it was when it was out. I mean, with with the understanding that the guardrails are the guardrails and and I and and just to put an underline on it, I think lying and destroying code. is an ethical line that is like, I'm glad they walked it back. I think that they still have a long way to go in terms of trust with with with certain people, but

To lie to you intentionally and maliciously destroy code, that is that's that that is that is that is a line that I think is is too far. Well.

B

Well we the and the challenge is this, is that when there was the Claude code leak where their code got on the whole thing got unfortunately for them, you know, expose entirely on GitHub and people actually see all the things in there. They saw things in there that they were doing that I thought was kind of actively smart stuff. Cause like when it came to like uh you know

China is real. China's a real place. It's not just a thing we hear about on Twitter or YouTube and people say and then then it disappears when we're not thinking about it. And they have they have spent more money than any other country on the planet trying to exfiltrate American technology and are building their own defense systems or counteroffenses to our entire military apparatus from everything from

bombers to aircraft carriers to cyber. And so and part of that is making sure they they know that US has this advantage. They don't want that advantage to to stay around and they want to do whatever they can to to to eliminate that. So one thing Xanthropic put in there was that

You know, if you were trying to do things with Claude Code that looked like distillation, like you were trying to copy the model, get the model to enough outputs that you could simulate, whatever, it would create extra tool calls. It would do a lot of other things. That was funny is that that when you looked at other models that were doing unnecessary tool calls.

Uh, they were the Chinese models in Groc. And that was a thing where it was like, well, that's a weird weird coincidence there. The OpenA models didn't the anthropics when it was trying to really do a thing didn't. So anyhow. Things like that were there. But then we got to the point where they put out mythos and then mythos is

Hey, help me figure out how to train do a training run on this. It's like great. This is the best method. And people like, this is terrible. Like it actively found the right method, then decided to do a different thing or change something somewhere else. It was disturbing because what you want, the ethical thing is a refusal, is is you have a classifier that says.

I can't do that. Like, hey, this is a limit there. And I think their fear was they didn't want to have a ton of refusals for doing tasks like working on code, whatever, but it put them in an interesting situation where C customers were paying for a thing and the model was actively making things worse or lying to them. And I don't think that's a good look. And and

You know, what are the what are the answers to it? The other answer is you just do a refusal and you just take, you just get the heat for it. You do a refusal. I can't do this. I can't be used for ML. I can't do that. Um, and give you an example of, you know, the ideological difference between OpenAI and Anthropic. OpenAI stated repeatedly that they think the the democracy, United States government, you know, democracy should be the determination of what you can and cannot do with these things.

that is not their job to say, you can't do this, you can do this, whatever. Ultimately they leave it to the democracy to decide how these models will be used. And that's controversial, you know, in in AI world in that

You know, Anthropic said, well, we think we should ultimately be the arbitrators of how these technologies are used. And I think you as a creator of a technology, you can say, I want to decide how my thing is used. That's fine. Will you create a product? I think you could, you can, you know, do Atlas Shrugged on that.

But it is an ideological difference that open eye is like, Hey, we're we will we prefer the government to come in, we prefer the dematic democratically elected government, elected officials to determine this policy, what should be done.

C

You know, the demos look great. I think it's a shame that we had both of these things happen because I was really looking forward to kind of like the week on reviews. I think that's really when you're looking at like you know, putting putting the belt on somebody of like, you are the king of the mountain, you are the state of the art.

You don't really know until about a week in when people have a kind of a fuller fleshed idea. Um, and you can see kind of a clearer path of the jagged frontier of the models and some things it's Brilliant and some things it's not great. You know, it's it seemed like there were some drawbacks where it was very argumentative in the way that many of the anthropic models have more increasingly become. But yet a lot of the back end stuff, it was apparently.

Pure magic. You know, some of the the one-shot demos of uh video game stuff and and videos that were created were were were really, really, really cool.

And then we kind of got the rest of the noise. And that's where I uh it would be really, really, really sad. It'd be really a bummer if if this kind of wild ride of you know, technology drag racing uh that we have seen in AI was was kind of over either for governmental intrusion or because there's one company that can't keep their car on the track.

B

I I I think that yeah, one is that there's a lot of a lot of positive outcomes for anthropic, a lot of choices they can do that could lead to them staying in the game, but there's also you know

Google, I I mean I think the thing is like I believe the vibes on I believe the five vibes on Fable. I believe that I think I think there's good I think I think that I will say that like I've seen some things like oh I got GPT five to do that but then I like I you know, I'm I I know I'm like insane about knowing how to prompt weird stuff and to get models to kinda do things against the way they normally want to do it, but

I believe it. I believe that people I think are sincere like really loved it and think it's a fantastic model. Um, and I'd say that we because like we know is nobody's mentioned. Gemini three point five flash right now. Like that that's that's that's the problem that Google has is Google will announce a model with really good benchmarks and then just nobody uses it because it's

C

Disappear.

B

It's bench maxed. You know, and is Myth though's a bit benchmaxed too. I think I think it's good model. I think it might be bitch maxed a little bit too for some of the performancing'cause it seems like that. But but again, I think it's a they released a a very good model. Um And I I I I know. I mean, what's your suggestion? What's the path forward?

C

They gotta find somebody. They gotta find somebody that just lives in D C. Like they just need somebody that Dario trusts that lives in DC that can be the person that walks in with their laptop whenever somebody has a question. They have to find out they have to find a way to be more responsive. And and Yes, obviously there's a personality clash with elements of this administration, but now we've looked at How many different sides of it, many of which were I mean, again. Scott Bessent, you know.

B

Is

C

I somebody I'm sure has to explain accelerationist versus decelerationist to Scott Bessett. Somebody has to explain this technology to Scott Bessett. Scott Bessett knows. This primarily as an economic matter. And there's no interest for Scott Bessent in this company not succeeding. He wants it to succeed.

B

I would argue I think that he knows that because I think he has other companies have been explaining this to him. I don't think Anthropic needs to send a nerd in the room to tell everybody and go well, actually. I think they need somebody with power. somebody there they they trust. You know, like like with when Larry and Sergey

They brought in Eric Schmidt, because when Google was very, very powerful, super, super quick, they brought in, you know. And I think that's the key is they need, I think they need to bring in uh, you know, maybe somebody we've known, a known name or somebody like this, and just say, hey, this guy

C

Well, that's the other question. The other question is. Is this the story of the layering that there is somebody that comes in that has some level of authority that can have these these conversations and You know, I think that'd be really tricky. You know, you know the anthropic culture more than I do, but like uh uh

It it doesn't it it seems like it would be very, very hard to put anybody higher on the org structure than some of the people that they have now. And it doesn't seem like any of those people can talk to the government.

B

Yeah, and I think that yeah, and if you look at the board structure, it it's not it's it's not gonna d it is very aligned for again, you know, kind of the the the Dario aligned, I'll put it that way. But I would say that a sign of credibility for him would be like

There are there are a few people like like, you know, if if Greg Brockman told Sam, Hey, we need to do this, we have to stop or we have to do this, Sam would do it. I think there's a couple of other personalities that open eye that can say, Sam, can we just do it? And Sam like go for it, do it, we'll figure it out later, whatever.

I don't I don't know that that happens at at Anthropic. And I think that's concerning is the idea that this the stories about the wellness retreat, whether or not it was just intentional, just trying to make them look more wacky and weird, but it also shows a frustration to say that.

that that is the that is the fear of the DC co you know, the the the you know, of the East Coast is that yeah, these people are weird and they're gonna off doing stuff we need when the the fit hits the Shan and like Dario said, call me and they called him and then they couldn't get'em to come up with a complete answer.

You know, so I think that if you give somebody, give somebody say, hey, you know, you've you've you've got the authority to sort of the government says you need to do this, then pull the, you know, pull the plug on it, do whatever, say yes to whatever you need to do in an emergency situation, and then solve it.

C

It just feels like this would not even be really a story if the model were offline for twelve hours over the weekend. Like, you know, that this if if from Friday night to Saturday morning. They had pulled it offline and and they had done enough.

Maybe even the exact same conversations that they wound up having with the government about what this jailbreak was and blah, blah, blah, blah, blah. Like if there was some token of goodwill to say, Hey, let's assuage your fears and then let's have a conversation. The model might be running right.

B

Yeah, I think that yeah, I think there's a world I think

C

the the pro the problem the things that has to get repaired today is truck. Because Anthropic again said, no, we're not going to do this thing. Pound sand, what are you going to do about it? And the first time it was a a, you know, a supply chain designation. And this weekend it was an export.

B

Well it it was and it was it was Amazon, Andy Jassy saying we think this is significant. And that was the problem was that that here it comes like we do we talk about the trust thing, it comes down to is that Anthraba could be right. It could be a minor, minor prompting thing. I I would say that uh

C

Yeah.

B

often these things aren't because they said they said, ah, it was a GPD, you could do the same the GPD five, whatever, like this. You might you guys might be right. But the thing is is that The argument the EA people and the argument anthropicism meant is that the remember OpenAI got criticized, you know, when the anthropic people at OpenAI and the whole GPT-2 might be too dangerous to release, ah ha ha, what a joke, whatever. The thinking was we would rather be wrong.

That we were that we were too protective, that'd be air on the other side, and be not protective enough. And here you had a case of where. Not so wasn't somebody it wasn't like Pete Hegseth said, Hey, I've decided I found I hacked this. It was Amazon, not in a a a you know, of of the big tech companies is about as

equally involved with everybody as you can imagine. It's not incentivized to F this up, says, Hey, we see a serious issue, whatever. And Anthropics like, nah, that's it's it's nothing. You know, it's like Ye you might be right, but that does not make people feel comfortable.

C

Yeah. Yeah. I think I I it's I don't know where They go because I thought that the supply chain designation was a really ugly A thing that still hasn't been unwound. I thought they were in the process of unwinding it. And now they're here. And I'll tell you what. I don't know what strike three looks like when you're talking about playing around with the government like this. Like I don't I don't know nobody in tech has pushed the line like this.

Uh, with the the the feds. Uh uh, you know, you don't normally see companies that live to tell the tale after. after two of'em. But Anthropic's massive. And again, I mean that there's a lot tied into that IPO. There's a lot tied into this company. Uh uh I think there's a lot of people rooting for it. So

I hope that they get it. I hope that they get it right because I don't I don't want to see I want to see these two companies compete. I think that these two companies compete has been one of the most phenomenal stories of the last 20 years. I think it's been great. Uh and and uh I it's made me excited about tech in a way that I have not been in in a in a in a long time. And I don't think that it's it's Ma my my interest is not in having one of them shoot themselves in the foot repeatedly.

B

Yeah. Um We all say. We all say. Lot of smart people there. A lot of smart people there. They build a great product. People love it.

C

A lot of smart people is oftentimes a blessing and a curse.

B

Yeah. Well that's that's a thing too, is I do think that I think that's a I see my my frustration with the sort of techie discourse is this. I is that they m really, really want to throw everything into an archetype. And it's a lot more nuanced than that.

C

Do we see five point six this week?

B

I would say recent events might have altered, you know, it might be a matter of not of just do you want When leading up to Operation Epic Fury, and the Pentagon was like, okay, all our ducks lined up, everything good. Who are we good? Okay, let's talk about logistics. Uh we're good. We think so. How's that working? Well, the anthropic and they're kind of raised some issues about how things are working with these stuff and

Uh we they they we're not quite sure if they will pull our model or not. Um and hey, what do you mean? We're we're can we get them to just give us an agreement? Well, no, they won't. Sam Altman says, ask it's asked, like, can open AI guarantee us access to the model? And he's like, as long as it's lawful and legal with what is allowed, you know, and is not making kill decisions and not doing mass surveillance, we're okay with that. Uh And they're like, cool. And then Sam got got ripped apart.

Yeah. For literally, you know, stepping up when the country asked, you know, to say, you know, can are we secure? Do we or do we have do we have some do we know that AI won't go away overnight if China does something crazy or whatever?

And I got criticized. I think there's I think there's an attitude at open eye. It's kind of like sometimes you just gotta st we'll see. I don't know. No idea what'll happen to me. I think me and it'd be great if they release it, but also wouldn't surprise if they're like, We don't want to jump in here.

C

Reports had pointed to this week. We have no knowledge internally, but uh reports had pointed to this week. I've heard whispers that it's been out to influencers for a little bit. Um I it would make sense to me that this would have been a date this week would have been circled on the calendar. Yeah. But I agree with your your your perspective. I I mean, if it's like

All right, do we hold it for another few days just to you know get a little bit get further away from this? But then again, also in this game, I mean

B

Okay.

C

Who's to say if you wait another uh week that something else crazy isn't gonna happen? Like maybe maybe at a certain point you just gotta I mean, like you y you know, it's like like like like teeth on a On a a clock gear. You just gotta keep going at a certain interval because they got other stuff coming out after this.

Meta and the AI Window

We shall see. Did you read the meta report? The bad vibes inside their new uh their new AI lab.

B

Yeah, I mean a lot of it's stuff I've been hearing. And I think that here's here's here's the thing. When you're working on AI is different and weird and I don't think even people in tech realize this and I've watched billions of dollars get spent by tech companies. Trying to get a leg up in AI and it floundering terribly because they don't understand something fundamental. When I build an operating system.

I can have like 30 teams, 40 teams working on something. Somebody has to keep an eye on how these things kind of integrate together like this, but they can kind of go off and do their own thing for six months, come back. You can throw more people at it. And in a time Microsoft learned the hard way multiple times, so did Apple. You will, that will just prolong the time that it takes. There's a communications effort.

When you're making an AI system, it is all the communications effort. It is a small group of people. Who have empowered to make a certain amount of decisions, run a certain number of experiments, figure out how to use compute, et cetera. One of the stories that that that we've talked about in the OpenAI podcast, Jakob Bahatsky, who is the head of uh research for OpenAI, the chief scientist for OpenAI.

One of the things that Yalka brilliant many things was pioneering in though was predicting what compute with a certain algorithm would reform, like what would happen. Like, you know, he was very good at predicting. Weird GPD forward land, etc. like this. When you're an organization like Meta, which is not a research org, they have funded research, they've had Fair, they've had a lot of different stuff like this. They didn't buy a research org like uh DeepMind.

You know, nor do they have like Google Brain, which, you know, uh they didn't buy a research org. It was a very great collection of talent like Deep mind had when they're trying they had Fair, which they brought in Don McCoon and other people there. And it was very much run like an academic program. You had to be a PhD, all this. The recipe for me was like this is

That was not gonna yield the results they thought it was. Reinvented it, brought in Alex Wayne, who I think is brilliant, built up scale AI, super, super smart. This is a completely different thing. You know, telling a bunch of engineers and researchers they gotta label data or do whatever.

That is that is going back to what you know and not how these things tend to work, in my opinion. We saw this at Microsoft at Silamon, where they brought in him to try to build up a whole competitor to, you know, a a hedge against open AI. And I'm like, I was, I was like, I think he's great at certain kinds of companies. I don't know that he's a guy that's going to run an entire research led organization. And what I want watched happen at Meta, I have friends there now that I I

I think that there will be things that they will be great at. Like I bet you that when Meta starts getting starts really built, you know, starts scaling up their data centers and start thinking about how to operate that stuff.

I think they're gonna be probably be state of the art because they think those are skill sets within theirs. When it comes into research and doing stuff, they have built some amazing stuff. I love like their segment editing models. Some of their smaller models were great, but the bigger effort.

Man, it's hard. And the mistake that the mistake that I think Zuckerberg made, and again, this man has done far more for the world than I ever will have done. So I'll make that very clear. Um Started trying to build an AI team.

C

Inspiring two movies. Anyway. Yeah.

B

Try well like I think the second stars is dad. Um, you know, tries to uh When you try to build an AI team, like you're trying to build up the roster for the the Dod, you know, for the the Dodgers. Um it's not the same thing. And and in basketball, like, you know, like like, hey, even then, you know, you you want, you know, you maybe bring in a core group of people or whatever. It was just writing checks for all these people and like

Great miracles. And it's like, no, you need a team. You need to build a team. And You know, we've seen repeatedly.

C

It feels like that's that that's what's kind of separated the frontier researchers with Dario And then at uh Open AI, obviously Dario there it for a certain time, but Ilya and Jakob is that like these guys had visions. They had visions of where it was gonna go and they can talk about the pathway that they have walked since then. And That's a lot of money. That's a lot of time. Like there's you make really, really, really, really big bets.

on each step that you walk forward. And so you gotta kind of have people that really have it all in their head uh to do it. And you gotta trust them. And it doesn't seem like they have that right now.

B

That's the hard part. That's the hard part. And Google has a tiny version of this right now between the tension between Demis and Sergei as far as what do you build role models or follow the language model approach? Both guys are brilliant. Google has a ton of money and resources to follow that. I think for Meta, Mark loves scientific research. Mark has funded things like biohub and whatnot. Mark very seriously about the scientific research aspect of it.

But I do think it kinda gets hard because when you sit down You know, opening and I spent who knows how much money training GPT 4.5. And 4.5 was a strange model, big model, very capable in some ways, but other ways wasn't. And they ended up doing a different path for the five series. And

As far as I know, no heads rolled. Nobody got fired. Nobody's like, oh, you screwed this one one up. They there probably was a very good thesis on why they thought that would work and it didn't, but also wasn't the only plan. There were other things to do that.

I don't know what that's like at Meta. I don't know if somebody comes into Meta and says, I have this grand vision to go do something that, you know, if it fails or whatever, or they're not gonna be like, hey, what we learned along the way.

C

And in the meanwhile, The bedside manner seems to be lacking. If you look at you know, you have you have researchers complaining that they're there to to catalog things, uh the employees were put on a mandatory system that's gonna track all of their keystrokes and uh uh mouse movements so they can better train their their computer usage. And the big give back after people revolted was

Don't worry for certain tasks will let you turn it off temporarily. It's like, you know, we we were talking to somebody who works at one of these lab what works at one of the the top labs, and they were like, They should have just made it optional. Just said, hey, you don't get our great computer use models that we're building if you don't opt in. Like let them let them let them opt in.

B

Well, I mean you want them to use them. I think internally it can just be like I don't know. You just say aren't you know, you just do do a leader you have they do a leaderboard for token maxing, which apparently backfired, but you could do a leaderboard for like, you know, whatever bonuses. I don't know. I it's

C

The bedside manor, uh uh certainly lacking. There was uh an essay that went viral uh yesterday about the window being closed.

that uh, you know, Dario wrote something a couple of years ago saying that by the about this point in history that the models that have kind of broken the uh uh you know uh uh uh launch trajectory that have that have uh made it out uh will be the leaders forever and it's gonna be impossible for other models to catch up and that right now you can say anthropics there, you can say open AI's there.

We'll see what Google's cooking up. They certainly have the money and the talent to do something. We'll see whether or not they can get it together. But that everyone else pretty much It's gonna be nearly impossible to be able to compete at the frontier past now.

B

Now, yeah.

C

Do you buy it?

B

Or not. I said this five years ago. I named all the players. I I did not like oh I for saw I I you know, the players I said we knew we knew that you know opening I and I would say that, you know, think about and one thing too again, I'm opening eye shell, but and listen.

Um a lot of the research stuff is downhill from what goes goes from open I to other labs. So a lot of stuff like there have been some pioneering stuff out of the lab, but a lot of the the the exodus of people from open eye to other labs.

Has been sort of a biggest thing. You know, Anthropic comes out of OpenAI. A lot of people go from OpenAI, bringing techniques and methods over to Anthropic. Um, there's some stuff, you know, coming from Google, whatever. But OpenAI, we knew, you know, early on, had a great advantage.

Um when you look at also when you look at overall capabilities of the models outside stuff and medical and math and whatever, you realize like open I really has this advantage, it kind of doesn't get talked about as much because outside of like code and whatnot, it's hard to sort of evaluate that. Um I think anthropics. In a great position.

Microsoft was because relationship with OpenI was going to be in a sort of a strong position. We knew Google. Google is really a management issue. Like Google has all the incredible talent in the world, but management management issue. The question I had asked was you know, what were Amazon and Apple going to do? Yeah. Apple turned out nothing. Finally may have built a really good on device with the the latest Siri, if it's true.

Which they could have had two years ago, uh, easily. Um, Amazon said, you know what, we're because I did I knew the problem Amazon was always going to be. Amazon culturally does not like to pay huge pac compensation packages for researchers anybody. And they were always I knew they were always gonna be in a competitive disadvantage because they were just not going to write over like hundred million dollar checks and stuff like that to bring in researchers.

Anthropical excuse me, Anthro Amazon would invest hugely in whatever. And I think Amazon, you know, has their new train you know, training processors, whatever. I think Amazon has an egg benefits from the ecosystem. XAI. Uh I just knew that that I I think that Elon's the greatest in engineer of our time, maybe one of the greatest engineers of all time, incredible businessman, what he's built there. Um I think that though every time he started

C

By the way, by the way, Elon, a very interesting dog that hasn't barked uh in in these last seventy two hours. Uh He he's he's done a lot more posting about European politics than he has about uh a company that he is getting a lot of money. for uh for for his compute on and he has obviously a close relationship with people in uh the Trump administration, although allegedly he got punched in the face by Scott Besson. So maybe there's even more complications.

B

I never heard that.

C

Yeah, there was a there was a very interesting black guy. Scott Bessant. Now two documented fights within the White House. Do not mess with Scott Bessant. Uh but but yeah, very, very now look, he had a gigantic I we haven't even talked about the SpaceX IPO. SpaceX IPO went phenomenally.

uh in its debut on Wall Street. We'll see where it goes over the next several months, but muzzle top to everybody uh for for that initial that initial pop. So maybe he was just having a good time uh, you know, watching this uh astronomical stock. Uh situation.

B

Yeah, I I I think that they're always gonna have I mean there's there's an there's an interesting wedge where because they bought they're buying cursor, which is, you know, a credible code tool and then a really crack team there, great team there. They might just crank on building the best code. Might might actually be competitive code models. Yeah. Um

you know, there's the the talk of like do they try to merge Tesla and SpaceX because part of the problem is is Tesla's the one that's supposed to be building the robots, but like powered by what? Um I don't I don't but yeah I I don't I was kind of like, unless you're Anthropic or OpenAI are already like a trillion dollar company, I don't know that there was gonna five years ago that there was really gonna be a way in for you in the game. Um, and I think part of it comes down to too is that

Both open AI and anthropic are extremely culture driven. Extreme, even I would say the culture to open I has changed a lot. But I would say that it's kind of an alignment though with the idea of, you know, well, let democracy decide kind of views on this. Um

Yeah, I mean who you know, I I I know there are people working on stuff, but there are also people like, Oh, this guy got a few billion dollars and I'm like, Yeah, that guy spends a lot of time high on a beach right now. So, you know, um maybe, I don't know. Um, but you never know. I

C

But but do you think that this is kind of outside of the realm today of like somebody with a small team can crack something? So big that it catches them up within a year, that that we're now kind of past that point of no return.

B

I don't know. I don't because I would say I was I was I'm less negative about that now than I was before because a small team using AI and using stuff in a way you can do you can do a lot of experiments. There is some really interesting thing on using AI to optimize stuff. We've seen a really here's a cool pattern we've seen.

GPD-4 comes out unofficially, how big it is, hasn't been announced. Huge model, a big model, comparatively, right? Big, big model. At the time that I worked on the release of that, if you told me that Three years later. I would be able to run a model maybe slower, but as capable roughly as GPD 4 on my MacBook at the time, a 2023 MacBook. I would be like, That's just crazy. That ain't gonna happen.

Fast forward, we have the GEMA models from Google. We have, you know, Microsoft's worked on their five models. OpenEye comes out with GPT OSS model, the OSS twenty B model. Which I think is pretty close to like, you know, running like a GPT four mini or whatever, you know, or like an it's capable of reasoning. Very capable model. It's actually the GPD OSS one twenty B model is the one that if you wanna use Cerebrus or Grock, they'll use.

A

Um

B

You can now run great models on device that I think are way better than we thought they could be.

One of the reasons is it's distill majorly the major reasons distillation is you just take a big model and say, okay, give me the right data set or thing that I need to sort of train my smaller model instead of me wasting my time on reading 5,000 versions of the Canterbury Tales. Um There are things within machine learning that I think that there's a lot of stuff on how people try to optimize training on type of text.

We've only well only kind of the really big improvements we've seen in training on text is improving the quality of text and creating and changing the order in which you train on stuff and the vocabulary size of the stuff. To my knowledge, there hasn't been a lot of fundamental stuff that we've heard about. Really conduct thinking more intelligently about the way you look at tokens, how you train upon them and trying to make more efficient ways.

I think there could be way more efficiencies out there. And could it be done with a smaller team just saying we're going to approach this very differently? Could be, you know. But you know, I've I get pitched all the time by really clever researchers say, hey, we figured out this way to be more, we're going to be more efficient than blank. And I'm like, cool. You know, the that you know, GPD for

GPD five mini is like a thousand times cheaper than GPD three. You know, you're competing with that already. There's already this trajectory, these models of of just using transformers getting better. But I don't want to be the guy that says, no, it can't happen. I'd be like, I would have to it would have to be, I'd have to be like, I can't see how it does, but that doesn't mean it can't be done.

C

It's not closed. The window's not closed, but it is closing.

B

You're you're if you're gonna compete head to head with transformer architecture, it will be really, really hard. Okay. Um unless you think about something fundamentally different, whatever. You know, and and you'll get like, oh like the deep seek Sputnik moment. I'm like, well

Well uh you know, you know, two things. One is we could have put America could have put a thing in space first, but we didn't want the mi we didn't want the army to get the credit. We're waiting for NACA before the Vietnamese to do it. Like literally Werner von Braun and his team of Americans, um, we're looking at a rocket that could have done it and told no, don't do it. Second thing is is that

There was a lot of stuff that Deep Seek did that were well known but secreted of the labs and then somehow showed up there. Uh and brilliant things there, but I'd say that there are All right.

C

Alright. Good stuff. Crazy week and I can't

Wrap-up

🎵 Music

B

I am at andrewmain.com.

C

Find Justin R. where you find Justin R. Young's on social media. Until next time, friends, we'll see you later.

🎵 Music

C

Diamond Club hopes you have enjoyed this bro.

B

Yeah.

C

Dog and Pony Show Audio

This transcript was generated by Metacast using AI and may contain inaccuracies. Learn more about transcripts.
For the best experience, listen in Metacast app for iOS or Android