🎙️ EP 126: Superintelligence Ban, ChatGPT Fails Ortho Test & Mico the AI Blob

00:00

Let's start right at the top. The ultimate stakes. Some of the sources we looked at, they cite the the absolute worst case scenario for where A .I. could go. Yeah. And it's not just, you know, job displacement. It's way beyond that human obsolescence, potentially losing civil liberties and even. A complete extinction. Heavy stuff. It really is. And that tension, that's what defines this whole moment, doesn't it? Are we building these incredible new tools that will lift us

00:29

up? Or are we just racing towards something uncontrollable, something that makes us irrelevant? That's the core question. Absolutely. And it frames what we're trying to do in this deep dive perfectly. We've gone through that stack of articles and research you gathered. Our mission is basically to pull out the key pieces, the nuggets of insight. from this paradox. So first, we'll dig into that really deep philosophical fight, why some top, top experts are pushing for a global ban on artificial

00:55

superintelligence. Then second, we'll pivot to the speed, just the blistering pace of innovation right now, especially the hardware leaps, massive jumps. And finally, we'll bring it back down to earth a bit, a necessary reality check, using a pretty tough medical exam to show where AI still has very real limits today. Okay, let's start with that call for prohibition. It really feels like a flashpoint in the whole AI safety

01:20

discussion. Definitely. There's this open letter, and it's specifically asking governments worldwide to actually prohibit developing artificial superintelligence, ASI. Yeah, and maybe we should quickly define ASI for everyone. Good idea. So ASI, it's theoretical intelligence. The idea is it would far, far surpass any human ability, like across every possible field. Exactly. And the conditions these folks are demanding before anyone should even think about proceeding, they're incredibly strict.

01:50

Extremely. The Future of Life Institute, they published a letter and they want two big things. First, 100 % verifiable certainty that any ASI created would be controllable. 100 % certainty. Wow. Right. And second, total societal consensus before moving forward. Imagine achieving that. The risks they point to, well, catastrophic, permanent human obsolescence in the economy, losing civil liberties systemically, and that

02:16

extinction risk you mentioned earlier. Yeah. And what really gives this whole debate, you know, moral weight is who signed it. We're not talking about just anybody. No, these are heavy hitters. You've got Yoshua Bengio and Jeffrey Hinton, often called the godfathers of AI, Steve Wozniak, Apple co -founder, even a current OpenAI staffer, Leo Gao. Their names alone force you to take the potential threat seriously. Absolutely. But, and this is a huge but, highlighted in the

02:42

sources. Yeah, here's the catch. Not one single major commercial AI lab signed on. No OpenAI, no Google DeepMind, Anthropic, Meta, XAI, none of them. Right. Beat. And without their cooperation, I mean, these are the companies with the resources, the talent, the data. The ban just feels kind of symbolic, doesn't it? More than actually effective. It exposes that core tension, you know, safety versus the drive to innovate and compete commercially.

03:07

And the sources also really hammer home the ambiguity problem. There just isn't a clear, agreed upon definition of what superintelligence even is. Right. So why would these labs sign a ban against something that isn't even properly defined, especially if it might limit their market edge down the line? Exactly. So, OK, given that definition problem and the fact that the big labs doing the work aren't on board. How does having the godfathers involved stop this whole safety push

03:35

from just being ignored? I think it comes down to this. The founder's moral alarm kind of outweighs the corporate push for sheer speed. It forces a conversation, even if it doesn't force immediate action. Okay, that makes sense. Their reputations count for a lot. And that brings us perfectly to the speed factor, actually, that lack of definition we just talked about. It feels almost directly related to the incredible pace of development we're seeing. How so? Well, the market competition

04:02

is just hyper intense now. It seems to be revealing some deep anxieties. Like Microsoft, they put $13 billion into open AI, right? Right. Huge investment. Yet they rushed out a nearly identical AI browser just 48 hours after OpenAI announced its new Atlas model. 48 hours. That sounds less like thoughtful innovation and more like, well,

04:25

commercial paranoia. Exactly. The sources suggest this kind of aggressive, often overlapping product launch cycle isn't really driven by deep breakthroughs sometimes, but just the fear of falling behind. And it's not just the big players either. The sources mentioned a new browser, Strawberry. Apparently it outperformed the big four AI browsers in some head -to -head tests. Yeah, Strawberry.

04:46

That probably comes down to some smart architectural choices, maybe using sophisticated agents, you know, AI systems that act autonomously to achieve goals, or maybe really optimized retrieval methods. Interesting. And we're not just tweaking browsers. We're building worlds now. Tencent released Hunyuan World. It's open source, and it takes regular videos or multi -view images and turns them into these incredibly detailed 3D worlds. Wow. Models like that. They need enormous speed. Yeah. Which

05:12

brings us to hardware. Ah, yes. The Google announcement. This felt like a really big deal in the sources. A major quantum chip milestone. It's staggering. It apparently runs 13 ,000 times faster than the current top supercomputers. 13 ,000. Whoa. Just take a second to imagine scaling that. That 13 ,000x speed applied to, say, a billion queries across the globe every single day. Yeah, that

05:38

kind of leap in acceleration. Yeah. It just fundamentally changes the timeline for everything, solving huge problems maybe or potentially getting to ASI much faster than we thought. Right. That level of raw computational speed really underpins this whole race, doesn't it? It absolutely does. And the money flooding in reflects that speed, too. Found AI, a company that just hosts these big models, they just raised $250 million. The company is valued at over $4 billion now. Incredible.

06:03

And the human effort matches the capital, apparently. Top researchers at all the big places, working 80, even 100 hours a week. It's an insane pace, which leads to a question, right? If you look at this speed, the funding, these overlapping launches. Are these rapid releases a sign of genuine, deep innovation happening constantly? Or is it more about, you know, unsustainable market pressure just to keep up appearances?

06:29

Based on the examples like that Microsoft browser relaunch, the evidence seems to lean heavily towards hyper -competitive market pressure driving these quick, sometimes redundant orgesses. Yeah, that feels right. Mid -role sponsor, red placeholder. Okay, let's ground ourselves again. Contrast all that speed and theoretical potential with the practical limits of AI as it exists right now. The sources had some fascinating data on this. Researchers tested CHAT -GPT against actual

06:55

orthopedic residents. Yeah, on the OITE core exam. It's a tough, specialized test. And the results. Pretty sobering, actually. A real reality check. Definitely. CHAT -GPT barely kept up with the first -year residents. That's PGY -1, PGY -2 level folks. Right. And it significantly underperformed the national averages for all resident levels they tested. The difference was statistically significant, too. And there were a couple of key limitations causing that, right? First, the

07:19

exam uses a lot of images, x -rays, scans. Which text -based models like ChatGPT still really struggle to interpret in context. They're not visual systems at their core. Exactly. And second, there's that really crucial failure mechanism that tells us a lot about how these large language models actually work. Yeah, explain that bit. It seemed important. So the sources explain they're autoregressive. Fancy word. But it just means they predict the next word based on the previous

07:48

ones, sequentially. Okay. So if the model starts down the wrong path, makes a conceptual mistake early on, it can't really go back and rethink its whole approach like a human doctor might. Ah, I see. It just keeps building on the initial error. Pretty much. It compounds the mistake instead of correcting it. That's a really clear boundary of the current tech. It's not how human diagnostic reasoning works. That failure mechanism, that feels like essential knowledge for anyone

08:15

using these tools, doesn't it? You can't just blindly trust the output. Absolutely critical. But, you know, balancing those limitations, AI still offers huge potential for learning, especially as a teaching tool, a didactic tool. Like that hack mentioned in the sources. Yeah. Using AI to organize massive amounts of information, like sorting through endless YouTube videos to create a clear curriculum, or even personalized audio lessons. It provides structure, which simplifies

08:42

learning complex topics. You know, I have to admit, I still wrestle with prompt drift myself sometimes, especially when I'm trying to learn about a totally new field. So I really appreciate tools or resources that help simplify how these underlying systems actually function. Me too. It's easy to get lost. And to get the most out of AI, understanding those foundations is key. The source has actually highlighted three core concepts that underpin a lot of the most effective

09:08

tools now. Okay, let's define those clearly. Sure. First is R -RAG. That's Retrieval Augmented Generation, where the AI searches trusted knowledge first to find answers. Okay, so it's not just making things up. Right. Then there's LORAO. That stands for Low Rank Adaptations. Basically a way to make fine -tuning models or personalizing them much faster and more efficient. Got it. And the third? Agents. We mentioned them briefly before. AI systems that act autonomously to achieve

09:35

goals. They can take actions, not just generate text. Okay. Rag. LoRa, agents. Knowing those makes some of the practical applications seem less like magic, like that finance guide we saw. Yeah, the one showing five beginner -friendly ways to use free chat GPT for pretty sophisticated stuff like trading analysis or figuring out position sizes. Understanding R or how agents might work

09:58

there makes it more transparent. So given those clear performance limits we saw in, say, orthopedics, a specialized high stakes field, how important is it really for the average, you know, infirmed user to grasp fundamentals like RAG or LoRa? I'd say it's pretty important. Understanding those foundations helps you critically evaluate what the AI gives you and use the tools effectively rather than just blindly trusting whatever answer pops out. Right. Knowing the how helps you judge

10:23

the what. Exactly. So if we try to synthesize this whole paradox we've explored today. Let's recap the big picture. On one hand, you have the AI pioneers, world leaders, raising serious alarms about a potential superintelligence, calling for bans, citing legitimate extinction level risks. That's the theoretical high stakes end. But then the current operational reality, it's defined by these incredible quantum leaps in speed hardware running 13 ,000 times faster.

10:54

Very specific, very real technical limits proven by things like that. AI failing a specialized medical test. So this huge gap between the theoretical danger and the immediate practical hurdles. Exactly. The tension there is just palpable. You can really feel it. Which leaves us and you. with a final kind of provocative thought to chew on after

11:13

this deep dive. If the biggest labs, the ones with all the power and resources, refuse to even properly define artificial superintelligence, let alone agree to ban it, does that 13 ,000x speed increase from Google's quantum ship just push us closer to some unknown, potentially uncontrollable threat, faster than we can even agree on what the word superintelligence actually means? Beat. Something to think about. Thanks for digging into this critical landscape with us today. Out to your own music.

Transcript source: Provided by creator in RSS feed: download file

Episode description

Transcript