¶ Anthropic’s New Frontier
Ejaaz: Four days ago, Anthropic asked the world's leading AI labs to slow down their Ejaaz: AI research out of fear that the AI models were getting so good that they would escape human control. Ejaaz: Now, just yesterday, the same company released the most powerful model the world Ejaaz: has ever seen, but it comes with a twist. It's one model, but there's two versions of it. Ejaaz: Claude Fable 5 is the model that everyone gets to use.
Ejaaz: It's a class that sits above Opus, but it's heavily restricted. Ejaaz: It comes with a lot of safeguards around it because version 2, Ejaaz: called Mythos 5 is the unrestricted version which poses itself as a higher cybersecurity Ejaaz: risk than its predecessor, Mythos Preview. It also Ejaaz: excels at creating biological compounds, which could potentially be used as Ejaaz: bioweapons. So it's only accessible by vetted partners that Anthropic approves.
Ejaaz: So this brings us to a fork in the road where the smartest, most intelligent Ejaaz: AI model isn't accessible to everyone. Ejaaz: Typically, for the entire history of software, you'd be able to pay for access Ejaaz: to the same level of access that an institutional would. Ejaaz: But this time, the new metric isn't how smart is your AI model, Ejaaz: it's the gap between how smart is your AM model that you get access to and the Ejaaz: ones others also get access to.
Josh: Well, how amazing is it that we have Mythos now available in our apps today? Josh: I think that's the really exciting takeaway is not only is it better than the Josh: Mythos preview, but it's available inside of your app. Josh: So if you are listening to this, you can go and try CloudFable 5 right now with Josh: those two restrictions.
Josh: It won't go anywhere near biology. It won't go anywhere near cybersecurity, Josh: but this is the best model in the world and it is now available to everyone Josh: to go try out and it's like every once in a while there there's this new technology that Josh: kind of forces you to to reframe how you engage with the technology and i think Josh: that's been my experience so far using fable Josh: is that it's so different than any other model we've used that it kind of forces
Josh: you to reframe how you engage with them and to showcase that i want us to go through some demos of Josh: attempts that people have done to kind of showcase the powers and capabilities Josh: of fable 5 this new frontier model starting with a really bizarre demo in which Josh: uh dan shipper who is a prominent poster on x he recreated the library of babble Josh: and help me understand what's going on here you just because i'm seeing a lot
Josh: of visuals um and i understand that it's not a image or video generating model Josh: so how is it able to create a real world's, Josh: emulation so accurate so quickly?
¶ Examples
Ejaaz: So the greatest part about this is it took one prompt and he basically asked Ejaaz: it to read a book, the book of Babel or whatever the title of the book is. Ejaaz: And he said, I want you to read this book. Ejaaz: And then I want you to recreate one of the concepts that is described throughout Ejaaz: the book, which is this library of Babel or Babel. Ejaaz: And it did so in just under an hour, I believe.
Ejaaz: And if you look visually on the screen, it is this high fidelity 3D representation Ejaaz: of what this library looks like. And it's infinite. So you'll notice in this Ejaaz: video that he looks down, he looks up, and it's just infinite books he can access himself. Ejaaz: And he even asked it to include some of the essays that he's written himself. Ejaaz: This guy authors a bunch of analysis on AI, and he pulls open a piece that he
Ejaaz: wrote that is in the Library of Babel. The idea of the library is that it contains Ejaaz: every piece of text that has been ever written. And so he gets access to it. Ejaaz: It's a pretty cool example. Josh: Yeah, that's super cool. The other ones that I've really noticed that it accelerated Josh: in is kind of 3D world building, which is funny because you don't think of Anthropic Josh: as a world building model. It's not a world model.
Josh: In fact, it doesn't have image generation capabilities. In fact, Josh: it doesn't have video generation capabilities. Josh: So how is it creating all this realistic visual design assets? Josh: And it's just really good at math. And this begs the question is how important Josh: is it to focus on image gen on video gen if kind of at its core it could do Josh: all of this with math and what we're seeing on screen here is how.
Josh: A virtual one-to-one recreation of Yosemite National Park that was done with Josh: a simple prompt asking it to create a recreation of it. Josh: And then the model was smart enough to go off and understand the context required Josh: in order to build an accurate representation and pulled that off. Josh: It did things like it scanned the satellite imagery to figure out what the elevations
Josh: were like. It pulled topographic maps that it found to figure out specifically what the heights were.
Josh: It found imagery that any imagery that it could find about the park so that Josh: it can reference it and show you i mean look at this resuming in on a waterfall Josh: it feels like it's a one-to-one replica, Josh: lower fidelity but something that can run inside of your browser it's really Josh: impressive how far the model can go on one prompt and i think that's one of Josh: the places in which fable stands out in particular is its ability to to reason through
Josh: your requests in a way that hasn't been done before we filmed an episode yesterday Josh: that i would highly recommend listening to about how we've kind of progressed, Josh: with our interaction of the model kind of moving up the extraption layer, Josh: where first we engage with models, then agents, then harnesses. Josh: Now we're just creating these loops where the agent and the.
Josh: Underlying intelligence is smart Josh: enough to actually understand what's required to get you to your goal. Josh: And I think that's what this is such a great example of this Yosemite one in Josh: particular is, hey, I want you to recreate Yosemite for me so that way I can Josh: fly around and I can enjoy it in a one-to-one replica.
Josh: And it does all the rest for you. And I think that level of critical thinking Josh: is something that's novel with Fable 5 that we've never seen in any other model before. Ejaaz: I mean, the breakthrough that we're talking about is visual and spatial reasoning. Ejaaz: And I think it's important to explain the difference between this and another Ejaaz: favorite version of a model that we speak about a lot on this show, which is world models.
Ejaaz: Typically with world models, it recreates the physical world around us. Ejaaz: But most importantly, it understands that physical reality, understands how Ejaaz: gravity works, it understands how different forces of nature works, Ejaaz: and it applies it when an object has an action. So let's say you kind of like Ejaaz: punch a puddle of water, it splashes, the droplets come up. Ejaaz: This isn't exactly the same thing. It's still based on theory.
Ejaaz: This is still an LLM that ingests a lot of text and understands kind of like Ejaaz: how the physics works in theory and then recreates what its version of it might Ejaaz: be. And this is what we're looking at on screen. It's kind of like, Ejaaz: it's known as spatial reasoning or visual intelligence. Ejaaz: It's close to the thing, but it's not quite the same thing. Now, Ejaaz: another example that I really enjoyed was from Ethan Moloch.
Ejaaz: Ethan Moloch is one of my favorite AI researchers that analyzes a lot of these Ejaaz: new models, but he builds or tests it in really interesting ways. Ejaaz: One of these ways was he rebuilt Snake. Ejaaz: Now, I am kind of ashamed to admit how much time I spent playing this particular Ejaaz: game only because it was like the best version of Snake that we see. Ejaaz: And like I'm showing you on the screen right now, it's like incredibly high
Ejaaz: fidelity. It looks way better than the game I used to play on the Nokia that I had as a kid. Ejaaz: But the point is, it's pretty cool. It introduces new power-ups and obviously Ejaaz: like you know you can die in usual things Ejaaz: Also, forget about creating the game. Claude Fable 5 is really good at playing Ejaaz: the game itself. What you're seeing is an accelerated version of it playing Ejaaz: Pokemon Fire Red, and it completes the game in, I believe, 50 minutes.
Ejaaz: And the way that it works is it takes screenshots of the game at any single Ejaaz: point, and it basically makes a decision as to which button it wants to click, Ejaaz: which step it wants to take, and it is just kind of like spread through a software Ejaaz: run where it's able to do that. Ejaaz: Now, as a kid growing up and watching, you know, a lot of Pokemon, Ejaaz: playing a lot of Pokemon, trading the cards, this is kind of nostalgic,
Ejaaz: but also kind of scary. We were joking before we started recording, Ejaaz: Josh plays Cod quite a bit or plays a lot of computer games. Ejaaz: And I wonder about the time in the near future where you're going to be one Ejaaz: of the buying a AI agent and it might actually be better than you. Josh: Yeah, that's going to be a little traumatic for me on a personal note. Josh: Just hurting my ego that I'm losing my game to an AI.
Josh: I saw another great example on the YouTube channel, actually, Josh: where they were Mythos or Fable 5 was playing Factorio. Josh: And Factorio is a game that I really enjoy, that I've been playing for a long time. Josh: And it was doing it very, very well. And I'm like, oh, dude, Josh: you're getting a little too close to home with this. I don't love it. Josh: But it's incredibly capable. And we have another example here that shows a use case that is not a game.
¶ Demos
Josh: Instead, it is a it's so cool. Josh: So there's this guy named Todd Saunders. He's on X and he he posts this tweet Josh: saying fable slash mythos is unbelievable was on a customer call today and had Josh: Claude transcribing in the background and on screen we're showing a visual of what that looks like. Josh: As they were telling me about the features they wish their current software Josh: had, Claude was building the features in real time.
Josh: By the end of the call, I was able to show a fully working product with the Josh: exact workflow they mentioned 15 minutes earlier. Josh: Autonomous looped building triggers from a customer call. Josh: And this is one of the most amazing things about the model is that it's able Josh: to go off and do a lot of the hard work yourself, where it feels like recently Josh: you've had to kind of be in the loop. You had to continue to prompt the agent to give it more context.
Josh: And with this model, it's very easy to give it a goal and give it a verifiable Josh: outcome that it can match against the goal. Josh: And then it will just go off and do those things. So as is on the customer call, Josh: like how cool is that for a salesperson where you're listening to customers, Josh: you're listening to complaints and in real time, you're fixing their problems. Josh: You're building new software on top of it.
Josh: It's unbelievably capable. This was one of the demos that I found most interesting too. Josh: And then this final example, it's just fun for networking nerds or just computer Josh: science nerds in general. We're seeing a highway on screen with cars and buses and vans. Josh: And those cars are not random. They are actually associated with specific packets Josh: that are being pushed across the network. So it's a really fun and interactive Josh: way to visualize network traffic.
Josh: And I could imagine this being great for a lot of educational purposes. Josh: And one of the things that I did notice also on Snake Ejaz is that there was audio, there was sound. Josh: It actually generated sounds and audio too. So there's a lot of modalities in Josh: which it's actually performing pretty well. Ejaaz: Can I add a bonus Easter egg, which we didn't see on the demo, Ejaaz: but I know because I played it too much yesterday.
Ejaaz: There are power-ups that pop up, but the power-ups are software patches. Ejaaz: So if you don't hit the software patch power-up, you end up with a random error Ejaaz: in the game that you must avoid, like a random wormhole that could like suck Ejaaz: you out of the game and like cause you to lose.
Ejaaz: So like it's coding in real time but you could fix it in real time and if you Ejaaz: hit the power up it it like implements the code fix like immediately it was Ejaaz: just that's very nerdy but like very cool and very creative not something that Ejaaz: we've seen before now if you want to like move away from the Ejaaz: The toy aspect or the retail adoption of how you can use this particular model Ejaaz: there's of course a lot of enterprise use cases and the number one version of
Ejaaz: enterprises using Claude is through code specifically and there were a few examples Ejaaz: that were included in the official blog my favorite one was Stripe, Ejaaz: who did a code migration of 50 million lines of Ruby, which typically would Ejaaz: take around two months and several software engineering teams.
Ejaaz: It took them less than a day using Fable. I believe they used two to three instances Ejaaz: of Fable, but still, that's like two to three software engineers that kind of Ejaaz: worked night and day continuously to be able to achieve this. Ejaaz: I saw another example of a company which had a software engineering team of Ejaaz: Opus 4.8 working on projects that would typically take him two weeks.
Ejaaz: He can now do it in less than a day as well. So the point is there's a massive Ejaaz: leap in intelligence for coding specifically with Fable, which is the publicly Ejaaz: accessible model. Now, before we continue, Ejaaz: We also wanted to like not just look at the demos that other people have recorded. Ejaaz: We want to try our own. So we have a few prepared for you today. Josh: Okay, so I just, I actually have no idea what you've been working on with these
Josh: demos. I know you are building a demo. Please share with the class what the Josh: prompt was, what you're building and what the outputs of this thing are. Ejaaz: For sure. Okay, so one of my favorite breakthroughs with this model is the visual Ejaaz: and spatial intelligence that we referenced earlier on. Ejaaz: So what I did was I found this hand sketch, this hand drawn version of a floor Ejaaz: plan of a blueprint, which I'm showing you on the screen here.
Ejaaz: It gives you the layout of someone's home. It has a garage. Ejaaz: It has bedrooms. It's kind of clunkily drawn. It's not really high fidelity. Ejaaz: If an architect looked at this, they'd be like, this is probably physically inaccurate. Ejaaz: And then I fed it to Fable and I said, listen, here's a photo of a hand-drawn floor plan. Ejaaz: I want you to rebuild it as a single, clean, self-contained SVG that is architecturally
Ejaaz: accurate. I want you to improve it where you can, improve it in a way that can Ejaaz: like, you know, reinforce the wall structures, et cetera, like really high detail things. Ejaaz: And it produced this, what we're seeing on screen right here, Ejaaz: which is this really high fidelity floor plan. You've got the garage. Josh: Architectural grade. Ejaaz: Yeah. You've got the surface area measured in meter squared. Ejaaz: You've got the swing angle of each door.
Ejaaz: You've got the entire like kind of like layout of this entire thing, Ejaaz: including potential mock furniture, which gave me a second idea, Ejaaz: which was like, okay, I asked it, I want to purchase this sofa. Ejaaz: These are the dimensions of this sofa. Ejaaz: And I want to place it in the dining room or in the lounge. Can you tell me Ejaaz: if I can feasibly do this in this full plan or will it get stuck?
Ejaaz: Like, how do I, how do I do this? Do we have enough doors? Do we have enough Ejaaz: space to like maneuver it? Like how would this work? Ejaaz: And it said verdict, yes, but not flat. You're going to need to pivot the sofa Ejaaz: on its side and kind of like pull it in like vertically. Ejaaz: And it gave me this really cool mock-up of how I would do it. Ejaaz: It would gave me route A where I take it in through the front door.
Ejaaz: And this is kind of like the sofa that you see here, but I need to turn it on Ejaaz: its side and kind of like shift it through this gap that I'm highlighting on Ejaaz: the screen here in green. Ejaaz: Or that there's route B where I can take it in from the outside and I have a Ejaaz: two meter, very spacious door opening, which I can bring it straight into the Ejaaz: lounge and place it right there. So it's really physically accurate going off Ejaaz: the comments that we made earlier.
Ejaaz: It understands the kind of like reasoning behind physics really, really well. Josh: It's such a great companion. And this is this gets back to what we talked about Josh: earlier, which is like the most complicated and difficult part about this model Josh: is figuring out how to engage with it, what to ask it, because it's so capable of doing these things. Josh: We love making artifacts for the show as a way to share kind of the ideas that we're talking about.
¶ Benchmark Stats
Josh: And this might be a good time to get into the benchmarks of how good this model Josh: actually is relative to other models. And I was looking at this post that I Josh: saw on X showcasing in particular Claude Fable 5 versus GPT 5.5. Josh: And my first reaction is, holy shit, that's a huge leap. Josh: So Fable 5 low mode is scoring over 10%, whereas GPT 5.5 extra high is getting about 5.7%. Josh: Now, this is on Frontier Code Benchmark. This is a specific particular coding
Josh: benchmark. This is not across the board, but it gives you a sense of how much Josh: more powerful this model really is versus, Josh: all the others second coding benchmark that i'd say this is probably one of Josh: the gold standards this is what a lot of models will use to benchmark themselves against each other Josh: um this is the swb bench pro and fable 5 scores 22 points higher than gpt 5.5 which was already,
Josh: what is that 11 points below opus 4.8 so it's currently looking like gemini Josh: 3.1 pro gpt 5.5 opus 4.8 and then fable is running away with it and this seems Josh: to be the case with almost all of these other.
Ejaaz: Benchmarks i have a i have a better one for you right so a lot of people listening Ejaaz: to this might be thinking okay well i don't code i'm not a software engineer Ejaaz: why does this apply to me well there's another benchmark called gdp Ejaaz: vow which tests it against real world tasks that take human experts like knowledge Ejaaz: workers that you know do back-end admin excel sheets all that kind of stuff Ejaaz: hours to do and they compare it to the model
Ejaaz: take a look at this so fable mythos 5 Ejaaz: basically achieves the highest benchmark score it's actually almost completed Ejaaz: the entire benchmark so they're probably going to have to recreate an entire Ejaaz: new benchmark for this but basically what this means is probabilistically if you were to Ejaaz: blind test or blind pick the output work of a expert human that is really good Ejaaz: at a particular knowledge work task versus this particular model over 50% of
Ejaaz: the time, you're going to be picking this model, which is just an insane stat to see. Josh: Yeah, it's pretty unbelievable. And you have to like, I'm looking at these charts, Josh: and you have to ask yourself the question, as Anthropica saturating benchmark, Josh: are they running away with it? Like, where is OpenAI in this conversation now? Josh: I have to imagine that GBT 5.6 is coming soon.
Josh: Is it okay maybe it's better than opus 4.8 but can it can it eclipse fable no, Josh: and i mean we know that anthropic released mythos months ago so you have to Josh: assume that like they've been continuing progress and iterative development on Josh: new frontier models that are even more powerful than this and and is this is Josh: this beginning to become a runway or are they actually still competitive with
Josh: each other or maybe we just don't have enough information to tell we kind of Josh: have to see what the response from open ai is. Ejaaz: Well you look at the cadence between model releases right like what was the Ejaaz: time since 4.8 was released. It was like less than, I think, Ejaaz: 30 days ago. So the cadence is getting... Josh: Yeah, we filmed an episode on this not too long ago.
Ejaaz: Yeah, like I remember that episode, right? And we spoke about it and went through Ejaaz: its benchmarks back then. Ejaaz: So the point is, these model releases are happening faster, but the capability Ejaaz: gaps are even greater, which tells me one thing, which is we're getting closer Ejaaz: and closer to the AI models just building itself.
Ejaaz: They haven't been private about this either. Anthropic has publicly claimed Ejaaz: that they have been using Mythos Preview to build this new version that we're Ejaaz: talking about today, Fable, right?
Ejaaz: So we think we've reached a point where you could maybe call it a breakaway Ejaaz: from Anthropic, where they basically have recursive self-improvement almost Ejaaz: achieved, where the model can do all the research, figure out its own issues, Ejaaz: and build a better version of itself.
¶ The Mythos Model
Ejaaz: Now, I do want to ground us at this point in this episode, Josh, which is, Ejaaz: Fable 5 is an amazing model, but it's one version of the amazing model. Ejaaz: There is another version of this model, which is technically better than Fable Ejaaz: 5, but it is not publicly accessible. Ejaaz: It is restricted because it poses itself as a cybersecurity risk, Ejaaz: not just a cybersecurity risk, but also a bioweapons risk.
Ejaaz: It is so good at biology and chemistry that it could feasibly create compounds Ejaaz: and a biological weapon that could pose a risk to any sort of nation state. Ejaaz: And so for that reason, it is under heavy restrictions and safeguards in the Ejaaz: version of Fable where you can't get access to any of this. Ejaaz: And only vetted partners and cleared government security initiatives are able Ejaaz: to get access to this Mythos 5 thing.
Ejaaz: Now i put this to the test josh um and i i did a very simple example which was Ejaaz: um can you explain how the mitochondria works do you want to bet what its answer was Josh: Oh i'm gonna be best it's not touching that it's not touching biology yeah i Josh: want to take a second to actually explain the nuances between the models because Josh: when you say the word better i'm not sure it's better it's just more complete
Josh: one model is a complete model one model is a heavily restricted model and in the case of Mythos, Josh: it's available for cybersecurity. Josh: It's available for biology. And that's what we've seen with Project Glasswing, Josh: where they're working privately with companies to kind of fix security vulnerabilities and figure out bio. Josh: And in the case of the system card, I saw that it's accelerating some bio experiments
Josh: at a full order of magnitude, 10 times better. So it's really capable there. Josh: The compromise that we had to make in order to receive it was it can't touch Josh: bio, it can't touch cyber. Josh: So it's just as capable everywhere else. It will not do that. Josh: What happens is if you ask in the case like you did, how does mitochondria work? Josh: It will route through Opus 4.8 for that answer and then come back and give you a response.
Josh: So it is as capable everywhere. It's just don't ask about bio, Josh: don't ask about cyber because from my experience so far trying it and EGS, it seems like yours. Josh: Anytime you get remotely close to those topics, it is just completely shut down, Josh: routed through Opus 4.8 instead. Ejaaz: Yeah, I think it's it's too aggressive, personally, like, as a former science Ejaaz: nerd, I still spend a lot of time trying to digest like some of the latest scientific advancements.
Ejaaz: And like, listen, I'm not reading research papers. So I work with my best pal Ejaaz: Claude to try and figure out, you know, what the latest takeaways are. Ejaaz: Now, typically, I could slam that into Opus 4.8. And it would give me an amazing Ejaaz: summary. And I could like ask it questions. Ejaaz: Now, if I want to use Fable 5, it just simply won't read the paper. Ejaaz: As soon as it sees anything related to chemistry or biology, Ejaaz: it switches off and reroutes to 4.8.
Ejaaz: So I am not able to get access to the frontier LLM intelligence or brain that Ejaaz: Fable 5 has for me, mythos, even though, you know, my intention isn't to build Ejaaz: a bio-weapon by any means, I can't get that analysis. Ejaaz: And so that's one version of it, right? Where like it is too heavily restricted. Ejaaz: The other version of this is with the more intelligent models get, Ejaaz: it's not just going to be super intelligent in one particular vertical.
Ejaaz: Like for us, it's like, you know, research and creating artifacts and the best content. Ejaaz: It should also apply to any other profession, right? Whether you're a scientist, Ejaaz: whether you're a mathematician, and whether you are building different kinds Ejaaz: of structures or whatever it might be.
Ejaaz: The fact that it can get triggered so easily or the fact that Anthropic has Ejaaz: very heavily restricted that capable intelligence in a way that like even people Ejaaz: that have well intentions can't get access to it. Ejaaz: In my opinion, is a bit of an issue. And listen, it's V1. I'm sure they're going Ejaaz: to like release a bunch of versions of the safeguards where it like makes it a lot easier to use.
Ejaaz: But for V1, it's kind of like, I think it's overdone. It's important for people, Ejaaz: I think, to understand how these safety classifiers work as well. Ejaaz: Think of Claude Mythos 5 having an AI model or system that is watching it. Ejaaz: And as soon as one of the red flags that it's been trained on is triggered, Ejaaz: for example, anything to do with biology or chemistry, it gets switched off Ejaaz: immediately and rerouted to 4.8.
Ejaaz: Now, there's four particular categories that Fable can't get access to. Ejaaz: It is cybersecurity, for biology, for chemistry, and for distillation as well.
Ejaaz: And this is a key one which caused a lot of contention in the public ecosystem Ejaaz: when they launched yesterday, which is if you were to ask about model training Ejaaz: techniques or even just simple general questions around, hey, Ejaaz: I have this AI agent, it's pretty clunky, Ejaaz: how can I improve its harness to kind of make it go quicker or use less tokens? Ejaaz: Automatically degrades performance. And this is the key change in this fourth Ejaaz: category with distillation.
Ejaaz: Anything that Anthropic considers to be trying to derive its model to build Ejaaz: another model, it gives you intentional poor performance. And it doesn't even tell you. Ejaaz: Now, it says that this happens for 0.3% of cases, but my guess is it's probably Ejaaz: happening for higher reasons. And listen, it's completely within Anthropic's Ejaaz: right to do this. I get it. I understand it. You want to remain competitive.
Ejaaz: But it's just interesting to see when like you have this intelligence model Ejaaz: where, you know, it's meant to kind of like blossom and create and help other Ejaaz: people build different things. Ejaaz: But they're being competitive when it comes to other models, I guess. Josh: Well, we're getting to this unique intersection where like, they have mythos, Josh: they've had mythos for a little while, and they decided to keep it private.
Josh: And that was okay, and somewhat understood because it was really discovering Josh: a lot of zero day vulnerabilities.
Josh: And it seems like they worked pretty hard to figure out a way to not only improve Josh: the quality of the model but actually make it public and i guess like the question Josh: we're gonna have to start asking as these ai labs continue to create these like unbelievably Josh: uh forward-looking frontier models is like to what capacity are we just happy Josh: to have them like how much should we expect out of the labs when it comes to
Josh: delivering these models like in my case i'm pretty stoked to be able to use fable 5, Josh: And I'm not interested in biology. I'm not interested in distilling the model. Josh: I'm just like pretty stoked to do my day-to-day work with this capable model. Josh: And in that sense, it's really fun and exciting and interesting. Josh: And I think it's the start of a longer conversation.
Josh: We saw some legislation come in a few weeks ago, last week maybe, Josh: about requiring AI Frontier Labs to kind of showcase the models privately with Josh: the government to share what's coming down the line.
Josh: And I guess in this essence, I'm more excited to have the model versus not have Josh: the model and have it have these constraints in the hope that it will slowly become Josh: unwounded as they kind of improve and iterate on the quality and the kind of Josh: like security set of this model. Ejaaz: Yeah, listen, I think Anthropic is ultimately doing the right thing.
Ejaaz: I think that they can't just kind of diffuse this model to anyone and everyone Ejaaz: because malicious actors, however few they might be, will actually end up doing Ejaaz: something dangerous with this. Ejaaz: That being said, I think the subjectivity and who gets to govern that subjectivity is important. Ejaaz: Like, I can imagine a future version of an Anthropic model that isn't just necessarily Ejaaz: really good at biology or cybersecurity.
Ejaaz: It might be really good at something such as trading, right, Ejaaz: for example. And then the question becomes, who gets access to this trading Ejaaz: model that is so good that it could break the stock market? Ejaaz: And maybe if you are Citadel, who are closely aligned with Anthropica, Ejaaz: I'm theorizing here, then they get access to it, but Jane Street won't get access to it.
Ejaaz: And so it becomes this heavily-based nuance that is only dictated by maybe the Ejaaz: government and maybe Anthropica itself. Ejaaz: There was talks around like Trump taking a stake in some of these AI labs to Ejaaz: nationalize it for this exact reason, because it could pose a threat and they Ejaaz: want to have governance decisions. It just gets a little murky and messy. Ejaaz: And I think we're at the fork in the road.
Ejaaz: There's no going back at this point. We are now entering a phase where Ejaaz: The model that you have access to may not be the most intelligent model for Ejaaz: the specific thing. And listen, it may not be the thing that you necessarily Ejaaz: do on a day-to-day, but it's a lot of things that other people do day-to-day, Ejaaz: and they want to get access to this model. How that is governed, I don't know.
¶ Pricing and Compute Limits
Ejaaz: The other restriction, which I found really interesting that I noticed in the Ejaaz: footnotes of their system card or announcement blog post is on June 22nd, Ejaaz: we ceased to get access to Fable 5.
Ejaaz: Now i think this was taken massively out of context because i think the reasoning behind this is Ejaaz: it's because it depends on availability of compute so if by june 22nd anthropic Ejaaz: has more available compute to distribute to users Ejaaz: then it wouldn't be the case but on the case that it is it would shift to a Ejaaz: pay-per-usage model which means that you buy credits and if your credits are
Ejaaz: consumed you then need to buy more credits kind of like the api model is that right Josh: Yeah, according to the blog post, it says from today through June 22nd, Josh: Fable 5 is included on Pro, Max, Team, and Seat-based enterprise plans at no extra cost. Josh: On June 23rd, we'll remove Fable 5 from those plans. Using it after that will require usage credit.
Josh: If capacity allows, we'll extend the included window. After this point, Josh: when sufficient capacity allows us to do so, we aim to restore Fable 5 as a Josh: standard part of subscription plans. Josh: We intend to do this as quickly as we can.
Josh: And yeah, it sounds like Fable 5 consumes a lot of compute. When you load it up inside of the, Josh: app it says fable is the most capable model and draws down usage twice as fast Josh: as opus so you have to imagine Josh: that it consumes a lot of gpus they're clearly using those gpus for a lot of Josh: things i think the idea is to give a preview and then extend that for as long Josh: as possible or just continue to extend it perpetually based on compute
Josh: i think to your earlier point we're very much at a fork in the road Josh: when it comes to these models being capable enough to Josh: really make a meaningful impact in the world and we've spoken so much about Josh: alignment and ai safety and it's kind of been this like open-ended fuzzy thing Josh: where it hasn't really practically applied to anything that's happened before Josh: and we're finally at a moment in time in which the models are becoming capable
Josh: enough to have that conversation about Josh: ai alignment ai safety you're starting to see why a lot of the teams are taking Josh: it so seriously because it is the singular question is like answering what you Josh: just said who gets access to this model how is it going to be restricted who Josh: gets to decide that and that's why the alignment and safety conversation.
Josh: Is so important. And while you start to see a lot of the company cultures within Josh: these companies align around these different priority sets that separate them from each other. Josh: So this is, it's, it's a new day. It's a new era today. Josh: We are moving into the, the next frontier of models. Josh: It was pushed forward a considerable amount and in a way that I don't think Josh: we've experienced in quite a long time. And it's really exciting to see.
Josh: I'm very excited to play with Fable, spend some time kind of generating outputs, Josh: figuring out what it's most capable in that could help us in the day-to-day. Josh: Like for me, if they never told me it wasn't gonna do bio or cyber, Josh: I'm not sure I'd ever come across it because that's not really within the realm of uses that I have. Josh: So I'm excited to just kind of play with it and figure out best use cases for
Josh: this. In terms of pricing, what I found really interesting is CloudFable 5 is Josh: only twice the price of GPT 5.5.
Josh: I believe it's $10 per million input tokens, $50 per million out, and, Josh: gpt 5.5 is five dollars and thirty dollars out so pretty close and much more Josh: capable so if the case that it does get removed from subscriptions it is still Josh: available from api it is not Josh: as expensive as i think a lot of people thought this is significantly cheaper Josh: than what i believe mythos preview was early on.
¶ Long-Horizon Workflows
Ejaaz: Yeah it's my new favorite model i've been using it relentlessly for the last Ejaaz: 10 hours um one thing that Ejaaz: Uh is a really strong capability that it has is long horizon tasks like this Ejaaz: model is engineered from the ground up to be able to work like a dog for six Ejaaz: to 12 hours at a time on whatever project that you have.
Ejaaz: And it has this loop function, which basically says, if you come across a problem, Ejaaz: don't ask me, try and figure out yourself and do the thing, build the thing. Ejaaz: That's why we had people build world engines from scratch that we demoed earlier Ejaaz: and these games from scratch, all from a single prompt, the library of Babel.
Ejaaz: So if you have an idea or if you've been pondering on a project that you've Ejaaz: been putting on for a while, because you're like, I know I could probably vibe Ejaaz: code this, but I don't want to spend like an hour doing this. Ejaaz: Now you just need to write one detailed prompt and you should be able to do Ejaaz: this. So my prompt for the listeners of the show as we wrap up this episode Ejaaz: is get access to this model.
Ejaaz: Try it out. And I'm curious what your thoughts are on it. Do you think the restrictions Ejaaz: affect you specifically? Ejaaz: Or do you think it's a really good general purpose model and you're happy with Ejaaz: how it's presented itself. You don't care about Mythos 5 in effect. Ejaaz: And also, what other projects are you going to be doing with this?
Ejaaz: Are there any kind of demos or use cases that we haven't covered that might Ejaaz: be specific to you in your leisure or your particular work that you might want to apply this to? Ejaaz: Let us know the feedback in the comments to this video or DM us on X. Ejaaz: Our profiles are linked below. Ejaaz: We want to hear back from you. But I think that brings us to the end of this Ejaaz: episode. We have now like a new world-leading model?
Ejaaz: OpenAI is going to have to answer to this, but it seems like Anthropic is running Ejaaz: away with it. Josh, any final thoughts? Josh: That's it. This is a new, it's a new era today. Like, I feel like we should celebrate. Josh: This is a new frontier that has been pushed forward very far in an industry Josh: we care very deeply about. So it's exciting to see. I'm stoked to use it.
Josh: I'm curious to hear what the best types of prompts or use cases are that anyone Josh: who's listening has found. Josh: If you enjoyed this episode, don't forget to share it with a friend who might Josh: also want to try Cloud Fable 5 and experiment and get their feedback on how Josh: it's being used to improve their life, improve productivity, Josh: whatever use cases it may be. But as always, thank you all so much for watching Josh: and we will see you guys in the next one.
