Anthropic’s Conway and the Future of Agentic AI

Speaker 1

00:01

Welcome to the sentient Code, where intelligence is engineered, autonomy is emerging, and a line between human and machine grows thinner. Each episode, we decode the algorithms, explore the robotics, and examine the ideas shaping the future of artificial minds.

Speaker 2

00:23

Picture this. You open your laptop, you fire up your browser, and you pull up your preferred AI interface. Right and there it is the little textbox staring back at you, just waiting exactly, the blinking cursor. It is waiting for you, waiting for you to set the context to engineer the perfect prompt. The guide it step by step through.

Speaker 3

00:43

Task, and it is phenomenally capable. But until you hit enter, it's essentially frozen in time.

Speaker 2

00:49

Yeah, it's a stateless entity completely. But now what if you never had to type that initial prompt again? What if the AI was already running, maintaining state and making decisions in the background while you were fast asleep.

Speaker 3

01:00

I mean, it completely flips the dynamic. We are conditioned to think of these models as highly advanced conversational calculators.

Speaker 4

01:08

You know, input, output and done exactly.

Speaker 3

01:10

You provide an input, you get an output, and then the system just goes dormant, but shifting from a reactive paradigm to a proactive, stateful one, it changes the entire foundation of human computer interaction.

Speaker 4

01:23

It's a huge leap, it really is.

Speaker 3

01:25

It moves the AI from a tool you wield to a colleague you manage.

Speaker 2

01:30

And that is the massive shift we are unpacking today. We're looking into Anthropics, highly anticipated project that's currently sitting in internal testing as of early April twenty twenty.

Speaker 3

01:41

Six, right code named Conway.

Speaker 2

01:43

Or Claude Conway. Yeah, yeah, And the mission here is to understand how we are moving from passive chat interfaces to always on, persistent digital coworkers.

Speaker 3

01:53

Which is a fundamental redesign of the technology.

Speaker 2

01:55

It is because honestly, giving software the autonomy to act without my constant supervision, well it sounds incredibly powerful, but also like a fantastic way to rack up a mass of aws bill Oh definitely, or you know, trigger a PR disaster if it just goes completely off the rails.

Speaker 3

02:12

Both are incredibly valid concerns and I think they speak to why this transition is so complex. To really grasp why Conway is generating such intense internal buzz at Anthropic, we have to look at the boundaries of our current architecture.

Speaker 2

02:25

The limitations of what we have right now.

Speaker 3

02:26

Exactly right now. Even with powerful models, the session is ephemeral. When you close the browser tab, the context window just drops.

Speaker 2

02:34

It forgets everything.

Speaker 4

02:35

Right.

Speaker 3

02:35

The system doesn't remember what you discussed yesterday unless you manually feed that data back into a new session. So the bottleneck isn't the intelligence or the reasoning capabilities of the model. The bottleneck is amnesia amnesia exactly. It requires constant human initiation just to maintain momentum.

Speaker 2

02:55

Okay, let's unpack this because I think the best way to visualize the difference is to look at organizational structures. Okay, I like that right now. Interacting with an LM is like having access to the world's most brilliant reference librarian. Sure, but to get any value out of them, you have to physically walk up to the reference desk, articulate a highly specific query, wait for them to fetch the materials, and then you have to synthesize it yourself.

Speaker 3

03:20

And if you want to follow up, you're walking right back to the.

Speaker 2

03:22

Desk exactly Conway. On the other hand, sounds like hiring a dedicated chief of staff. Yes, someone who has their own desk, who knows your ongoing priorities, and who initiates the research before you even realize you need it.

Speaker 3

03:36

That maps perfectly onto the architectural shift here. Conway operates essentially as an AI operating system.

Speaker 2

03:42

An operating system, right.

Speaker 3

03:44

It's built around the claud For family of models. But it is an environment rather than just a chat in our face. The engine driving this autonomous chief of Staff relies on an incredible scale of memory.

Speaker 2

03:55

How big are we talking.

Speaker 3

03:56

We are looking at a context window capable of handling upwards of one million tokens.

Speaker 2

04:01

Wait a million, Just to put that into perspective for a second. A million tokens is roughly equivalent to holding the entire Harry Potter series plus the Lord of the Rings trilogy in active working memory simultaneously. Right.

Speaker 3

04:13

Roughly, Yes, it's massive, and that massive context is what enables this long horizon reasoning. Okay, but the key mechanism here isn't just holding a lot of text. It's how Conway handles persistence without constantly retraining its neural network weights.

Speaker 2

04:27

Because retraining constantly would be computationally.

Speaker 3

04:30

Impossible, exactly, It would cost a fortune and take way too long, So instead Conway uses that massive context for advanced in context learning. It maintains a continuous running.

Speaker 2

04:42

Log like an internal scratch pad.

Speaker 3

04:44

Yes, exactly like a scratch pad, across hours, days, or even weeks. Before it goes into a dormant state, it summarizes its current state and writes it to its memory. Oh wow, And when it wakes up, it reads its own journal. It remembers the actions it took yesterday, and crucially, it learns from the outcomes.

Speaker 2

05:03

So if an API call failed on Tuesday, it's.

Speaker 3

05:06

State vector reflects that and it will automatically attempt an alternative routing on Wednesday.

Speaker 2

05:11

Which is brilliant, but it naturally brings up a huge logistical hurdle, right, because if this system is acting as my chief of staff running twenty four to seven in the background, it needs to interact with the digital world. And my immediate reaction is wait, if I am not hitting enter to trigger a prompt, how does it know when to wake up?

Speaker 3

05:29

That's the big question.

Speaker 2

05:30

Yeah, and AI just randomly executing tasks in the background sounds like pure chaos.

Speaker 3

05:34

What's fascinating here is how they have architected the environmental awareness. It doesn't rely on a constant, expensive polling loop.

Speaker 2

05:42

Where the AI is awake twenty four to seven asking should I do something now? Should do something now?

Speaker 3

05:46

Right? That would be incredibly inefficient. Instead, it operates entirely on a ven driven triggers. Okay, think of it like setting up tripwires across your digital ecosystem. Conway stays at a highly efficient, dormant state of passive monitoring until an external event physically wakes it up and hands it an objective.

Speaker 2

06:06

So a tripwire would be something like a VIP client sending an email with the word urgent, or maybe a database flag getting flipped.

Speaker 3

06:14

Yes, exactly does it.

Speaker 2

06:16

Hook directly into those systems to listen for that?

Speaker 3

06:18

It integrates deeply into your workflow. It could be a new pull request opening on GitHub, a sudden calendar alteration, or a massive spike and user traffic on your server.

Speaker 2

06:28

Got it.

Speaker 3

06:28

When that specific programmatic condition is met, the system receives a payload, Conway activates, reads the new information against its persistent journal, and then executes the pre defined action strategy.

Speaker 2

06:40

But hold on, if we connect this to the bigger picture of enterprise security, that actually sounds terrifying.

Speaker 3

06:44

I'm sure it departments are sweating.

Speaker 2

06:46

Because if Conway is just waiting for a signal from the Internet to wake up and start executing complex autonomous tasks, couldn't a bad actor just spoof an email or fake a server request. They could effectively hijack my a I co worker by sending it a malicious trigger.

Speaker 3

07:02

You're hitting on the core vulnerability of any event driven architecture, and it's exactly what Anthropic had to engineer around. Conway relies on highly secure webhooks to listen for these triggers, okay, and it enforces strict cryptographics signature verification. Specifically, it utilizes x hub signature two hundred and fifty six headers for all incoming payloads.

Speaker 2

07:24

Okay, x hubs signature two fifty six. So it's acting like a cryptographic bouncer at the door.

Speaker 3

07:29

That's a good way to look at it.

Speaker 2

07:30

And I'm assuming this isn't just checking a basic password.

Speaker 3

07:32

Far from it. It's an unbreakable mathematical seal. When a signal comes in. Let's say your inventory database sends a webhook saying stock for item A is zero, right, that payload is hashed by the sender using a complex algorithm and a secret key that only your server and Conway share.

Speaker 2

07:49

Okay, so they both had the key, right.

Speaker 3

07:51

The sender attaches that hash to the message. When Conway receives the payload, it performs the exact same mathematical hashing process on.

Speaker 4

08:00

The data, and if they don't match, If.

Speaker 3

08:02

The resulting hash doesn't perfectly match the one attached to the message, Conway drops the request immediately.

Speaker 2

08:07

It doesn't even read it.

Speaker 3

08:09

It won't even wake up the language model to read the prompt. It ensures the agent only ever responds to authenticated, untampered sources.

Speaker 2

08:17

Okay, so the cryptographic bouncer lets the trusted signal through. Conway wakes up, it reads its journal, and now it has to actually do the work. Yes, and from what I understand, it doesn't just quietly use APIs behind the scenes. It can actually browse the visual web. It can which, as someone who currently uses APR automations to run my life, I swear if a vendor changes a single pixel on their website or renames a CSS class, my entire automated workflow shatters into a million pieces.

Speaker 3

08:46

And that fragility is exactly what Conway is designed to bypass Conway features native browser automation, but it doesn't rely on brittle dom scraping, where it just blindly looks for a specific line of code on a page. Instead, it uses compute to visually parse the layout of a site,

09:02

much like a human does. Oh wow, So if a competitor radically updates their pricing page, throwing off all your static webscrapers, Conway can navigate to the new url, visually identify the pricing tables regardless of the underlying code changes, extract the new data, compare it to your internal metrics, and draft a strategic response.

Speaker 2

09:22

So it's literally acting like a human analyst clicking around on Chrome, adapting to visual changes on the fly.

Speaker 3

09:28

Yes, and to exponentially scale that capability, developers can build custom extensions specifically for.

Speaker 4

09:35

Conway extensions, like browser extensions similar concepts.

Speaker 3

09:38

These are packaged in a proprietary format dot CNW, dot.

Speaker 2

09:41

Zip, CW dot zip. Okay, so we are looking at an ecosystem purpose built for an aiagent rather than a human user.

Speaker 3

09:49

That is the intended architecture. Just as browser extensions give you custom UI tools or block ads. These dot CNW dot zip files allow enterprise developers to build d native integrations into Conway's ecosystem, so.

Speaker 2

10:03

I could build one for my specific workflow.

Speaker 3

10:05

Exactly, you could install an extension that grants Conway highly specific, authenticated access to your proprietary HR software or maybe your AWS back end. It creates a standardized way to give the AI new skills without having to rebuild the entire agent from scratch.

Speaker 2

10:23

Here's where it gets really interesting, because Anthropic didn't just wake up one morning and decide to build a persistent browser, controlling, cryptographically secure AI out of thin air.

Speaker 4

10:33

No, this has been a long time coming, right.

Speaker 2

10:36

You can trace the development of this over the last year. Building an ecosystem where an AI can safely use custom extensions means the AI first had to learn how to interact with computer systems at a base level. Yes, we saw this start with claud code, which was very terminal based manipulating files for developers. The major pivot point was the transition to claud Cowork earlier this year.

Speaker 3

10:55

The January twenty twenty six research preview of claud Cowork was an absolutely vital stepping stone. Yeah, it was their first real foray into an agentic environment designed for general knowledge workers rather than just software engineering.

Speaker 2

11:10

Cowork was impressive. You could give it a high level goal like take these five raw data exports, clean them up, and build me a quarterly review presentation.

Speaker 4

11:18

And it would do it.

Speaker 2

11:19

It would navigate your files and build the deck. Yeah, but it still felt constrained. It felt like handing a project to a brilliant intern who legally doesn't have the authority to sign the checks.

Speaker 3

11:31

That's a great analogy.

Speaker 2

11:32

Like it could do the prep work, but it couldn't finalize anything.

Speaker 3

11:35

But that limitation was by design. Cowork was intensely gold driven, but the architecture heavily enforced human in the loop oversight.

Speaker 2

11:43

Right.

Speaker 3

11:44

The system could do the heavy lifting of data synthesis, but the consequential actions, sending the final emails, committing code to a production environment, executing a financial transfer, those remained gated behind user approval.

Speaker 2

11:57

So you were still the bottleneck.

Speaker 3

11:59

You are the supervise acting as the final security checkpoint.

Speaker 2

12:02

But Conway drops that requirement. It upgrades the intern to full corporate signing authority.

Speaker 3

12:09

It does, which is a massive leap in trust. Conway takes the baseline capabilities of cowork and embeds them into this continuous, persistent loop we've been discussing, right, But to do that safely, to give it that signing authority, and PROPIC had to build an internal regulatory system. They utilize what is known as managed agents infrastructure.

Speaker 2

12:30

Okay, wait, so it's not just one massive AI brain handling the execution and the oversight simultaneously.

Speaker 3

12:36

No. Relying on a single model to police itself during a complex, multi day task is incredibly risky. Managed agents infrastructure involves deploying supervisor agents.

Speaker 1

12:47

Ok.

Speaker 3

12:47

These are specialized, highly efficient models whose sole function is to audit the primary worker agent.

Speaker 2

12:52

Like an internal affairs department.

Speaker 3

12:54

Exactly as Conway executes a task, say researching competitors and updating your database. The supervisor agent runs parallel inference just watching it. It constantly evaluates Conway's state vector and scratchpad to ensure the worker isn't caught in an infinite loop, that it isn't hallucinating data, and that it isn't violating its core system.

Speaker 4

13:14

Constraints, and what happens if it does.

Speaker 3

13:16

If the supervisor detects an anomaly, it can issue a system level halt command to the worker agent and Conway orchestrates this entire hierarchy autonomously.

Speaker 2

13:26

Okay, so we have the architectural foundation, we have the million token memory acting as a persistent journal, the cryptographic webhooks waking it up, the visual browser control adapting to changes, and the supervisor agent's acting as an internal audit team.

Speaker 3

13:41

That's the full package.

Speaker 2

13:42

Let's bring this down to earth, right to the listener's desktop. What does this actually look like in practice? If I deploy Conway as my digital chief of staff, how does that fundamentally change my Tuesday workflow?

Speaker 3

13:53

Let's apply it to a real world business intelligence scenario. Okay, perfect, Imagine your company's global supply chain data every night at midnight. In a traditional setup, a human analyst logs in at nine zero am, spots a strange anomaly in the European numbers, spends three hours cross referencing shipping logs, and finally presents a preliminary report after lunch.

Speaker 2

14:14

Right, half a day is gone just identifying the scope of the problem exactly.

Speaker 3

14:18

But with an event driven.

Speaker 2

14:20

Setup, the midnight database sink is the trigger.

Speaker 3

14:22

Yes, at twelve zero one am, Conway wakes up and reads the new data. Its supervisor of agents ensure it stays on task. It detects a fifteen percent drop in European fulfillment speeds.

Speaker 2

14:33

And because it has browser control.

Speaker 3

14:35

It autonomously opens its native browser scans regional European news outlets, identifies a localized wildcat strike at a major shipping port, cross references that with competitor inventory levels, and calculates the projected impact on your Q three margin.

Speaker 2

14:49

While I'm sleeping.

Speaker 3

14:50

By three point am, it has synthesized the root cause, drafted a comprehensive mitigation strategy, and pushed a Slack message to the executive channel. When you wake up and pour your car. The crisis hasn't just been identified, the strategic analysis has already finished.

Speaker 2

15:04

That level of leverage is unbelievable. It completely eclipses static automation platforms. We are moving from if X happens, trigger why to if X happens, figure out why, understand the context, and execute the best possible solution.

Speaker 3

15:19

It replaces rigid logic with dynamic judgment, and.

Speaker 2

15:22

That dynamic judgment is the key to scaling complex operations, but it is also the source of the most significant risk.

Speaker 4

15:31

Yes, it is right, I have.

Speaker 2

15:33

To play Devil's advocate here because listening to this, I can't help but think of the Sorcerer's apprentice.

Speaker 3

15:38

Oh that's a good comparison.

Speaker 2

15:39

You know, Mickey Mouse in Chance the Broom to carry the water falls asleep and wakes up drowning because the automated worker lacked the contextual judgment to know when the job was actually done. When you grant an AI system unsupervised autonomy over days or weeks, what happens when a micro error occurs on day one? Doesn't the probability of failure approach one hundred percent over a long enough timeline.

Speaker 3

16:01

This raises an incredibly important question, and frankly, it is the primary reason Conway remains an internal testing. The reality check on autonomous agents is severe. The first critical vulnerability is exactly what you're pointing to, reliability and the mathematics of compounding hallucinations.

Speaker 2

16:17

Because if it's acting on its own journal entries, a hallucination becomes a false memory that it treats as fact.

Speaker 3

16:22

Yes, in a long horizon execution, an AI might make a minor incorrect assumption during hour two of a seventy two hour workflow. Let's say an agent has a ninety nine percent success rate per individual reasoning step.

Speaker 4

16:36

That sounds excellent it does.

Speaker 3

16:38

But over a sequence of one hundred autonomous steps, that one percent error rate compounds, resulting in roughly a thirty six percent chance of task failure. Wow, by our forty that tiny initial assumption has completely derailed the workflow, and anthropics internal research highlights a fascinating psychological hazard here, the autonomy paradox.

Speaker 2

16:58

Let me guess as the system proves it can handle the work humans completely.

Speaker 3

17:02

Check out precisely the issue. As Conway demonstrates competence, users grant it more independence and check the audit logs less frequently.

Speaker 2

17:09

It is very similar to the self driving car problem.

Speaker 3

17:11

Exactly, you trust the autopilot so implicitly that you stop watching the road, which is exactly the moment you need to intervene.

Speaker 2

17:18

Right.

Speaker 3

17:18

The data shows that even with supervisor agents, edge case interruptions where the system requires human clarification are still common. If the human has grown complacent, the system either stalls indefinitely or worse confidently, hallucinates.

Speaker 2

17:34

A path forward, which leads to the second massive reality check. Privacy and control a huge issue because if Conway is going to act as a chief of staff and draft that supply chain report at three point zero am. It needs access to a terrifying amount of data. It needs everything deep access to my local files, my Gmail, my company's Google Drive, are internal Slack channels, my calendar, basically my entire digital brain.

Speaker 3

17:58

And giving an autonomous agent that level of lateral access demands intense encryption and airtight compartmentalization. Yeah, enterprise IT departments are going to require mathematical certainty that Conway won't accidentally email a draft of upcoming layoffs to the entire company while trying to you know, optimize.

Speaker 4

18:16

Your HR folders right.

Speaker 2

18:17

That would be a nightmare.

Speaker 3

18:18

The audit trails for these actions have to be flawless and instantly reviewable.

Speaker 2

18:22

And if you combine that deep access with the dot CNW dot zip extensions we talked about earlier, the security implications are wild. We're opening up a massive new attack.

Speaker 3

18:32

Surface without question, even with x hub signature cryptographic signing on the triggers. Anytime you allow third party extensions to dictate internal actions, you introduce severe risk.

Speaker 2

18:44

Because someone else wrote that code.

Speaker 3

18:46

A poorly coded or intentionally malicious dot CNW dot zip extension could act as a trojan horse, granting an attacker backdoor access to the agent's memory or its authenticated API keys. WOW, prize grade governance, strict extensions, sandboxing, and continuous monitoring are going to be absolute prerequisites before a system like Conways's wide commercial deployment.

Speaker 2

19:08

So what does this all mean for us? Looking at the trajectory from stateless prompts to this stateful persistent architecture, it is clear we are standing on the edge of a completely new era. We really are the age of the passive interface. The blanking cursor waiting patiently for our instructions is officially fading. We are transitioning into an era where AI isn't just a tool we pick up and put down.

Speaker 3

19:31

Right, it's becoming a continuous presence.

Speaker 2

19:33

We're entering the age of proactive, persistent companions, systems that act like peers, that manage their own workflows, and that keep the lights on long after we have clocked out.

Speaker 3

19:43

It fundamentally redefines the concept of digital leverage. You are no longer just augmenting your personal typing speed or your individual research capacity. You are essentially managing an artificial workforce that can scale indefinitely. The technical mitigations, the supervisor age, the cryptographic security, the persistent context loops that they're all falling into place.

Speaker 2

20:05

But the societal and organizational impacts are entirely uncharted completely. Yeah, the technology is one thing, but how our legal and social structures adapt to it is a completely different.

Speaker 3

20:15

Puzzle, and that leaves us with a critical unresolved tension to consider as this technology moves toward public release. When your AI agent is operating with full corporate signing authority, making autonomous decisions in the background, the lines of accountability become dangerously blurred.

Speaker 2

20:33

That's a scary thought.

Speaker 3

20:34

If your AI employee, acting on a broad directive to optimize your operational costs, inadvertently violates a vendor's terms of service, scrapes a protected database, or accidentally commits corporate fraud. Who is legally liable? Do the authorities hold the developer accountable for a failure in the supervisor agents, or do they hold you, the human manager, responsible for the actions of

20:56

a piece of software living on your hard drive. Where does the liability of the tool end and the liability of the user begin

Transcript source: Provided by creator in RSS feed: download file

Episode description

Transcript