¶ Building Superintelligence: YC's AI Initiative
How do you build superintelligence inside a company? Part of the key thing is not to just use AI as a copilot. This is the the thing where you use it as the building layer for everything. And you need to start recording all the articles. It's like a shared organizational brain. It's like the closest thing to us being able to like connect our brains.
If you frame this as a way for everyone in an organization to get better at what they do using the like collective skills and instinct of the people they work with, it's incredibly powerful. Today we have a real treat. Uh we have a special guest, general partner at YC, our partner. Pete Kuhn. He created Optimizely, which was one of the first and one of the best ways to do A-B testing for apps and websites.
And since then, he has gone on to create all of our agent infrastructure at YC. So literally all of our harnesses and how we use AI internal to YC. Pete, welcome to the Lightco. Thanks, Gary. For the last few years since Chat GPT, YC has been funding mainly AI. And we've been we've gone through like many different like versions of advice for them about how to build AI native companies that build
gone on a crazy journey with them learning all of this. I think a lot of people don't realize that internally YC is actually building and using a lot of the same stuff that we're helping our startups build. themselves and it's been I think a very powerful symbiotic relationship for us to actually be adopting these tools and like transforming our own organization. was started way, way pre AI into a super AI native organization ourselves.
And Pete has really been leading the charge for that. And so I'm really excited about this episode because I've actually been wanting to talk publicly about all the stuff that we've built internally and this is the first time that we're doing it. So Pete, perhaps to start off, you sort of go back to the beginning and like Talk about like there was a particular like moment when we really started adopting these AI tools. It was really you who got us started down that path.
¶ The Finance Team's AI Problem
Sure. Uh happy to happy to tell the story here. And it's I I like framing it that way because it it was a project that I and and a few engineers got started about a year ago, maybe a little more. But that has since snowballed into just a whole infrastructure layer that's made it possible for us to use AI internally at YC in lots. And that's actually been one of the neatest parts about this is watching the whole engineering team and and many partners also just dive in and contribute to this.
This infrastructure layer. We started building our own harness inside of YC or kind of YC. about a year ago. And Uh the original impetus for the project was some of the work that I and it and a few of the software engineers at YC were doing with our finance team. Just for a bit a bit of backstory, so YC has for as long as it's existed, as far as I'm aware, run mostly on our own software. In this era, just given us a huge Right. And so with that context, back to this this moment maybe a year ago.
We were sitting down with the finance team talking through a set of tools that we were going to build for them just to help them run through some of their finance. Booking journal entries, uh logging priced rounds, like all the sorts of things that that make YC run. I was seeing kind of two things at once. uh we you know we had this sort of loop going internally, right? Where we'd sit down with the finance team, the finance team would describe to
our software engineers, how, you know, this complicated financial workflow worked. And then the software engineers would go and build some purpose built software where there was a deterministic workflow encapsulating everything that they had been told and then hand it back to the finance team and so on. And it felt really inefficient. And then at the same time, this was right around the time when agentic tools were really agentic coding tools.
were really catching hold, right? And so you had uh kind of the first generation uh windsurf and cursor that were well established by this point. I think this right around when Claude Code was was introduced. I felt like this was giving me superpower.
Right. Um and then kind of watching this sort of old classical way of building software in YC and then watching how I was doing things on my own machine, this it just felt like a bigger and bigger divide between those things. And so the original impetus was Why don't we try to build some tools at YC that we could use to run agents that would give the finance team control over their own? Right.
remove the software engineers from this crazy loop where they have to sort of understand these complicated workflows and give the finance team the tools that they could use to encode their own workflows, not not as you know, not as Ruby, uh but as as English with prompt.
¶ SQL Access Changes Everything
Right. What's interesting is like uh we all funded companies like maybe even like two or three years ago when L LMs were out, but like identic coding wasn't a thing yet. And so the first thing actually was not agentic coding, it was L LMs for writing SQL queries. Yes. So that's what I remember from like the first versions of what you built was uh how
like good it was and how basically it rhymed with like these other failed startups that we had funded, like each of us probably funded one at some point. You know, here it was, it was working and it worked so well that non technical people uh granted very smart people from finance, but with no engineering background, could use these tools to ask real questions.
I was really surprised too, to be honest. And so that we started with this kind of purpose-built thing for finance and then rewrote it to even more of a general AT. Right. And it and it's this is now you see these all over the place now. But um I the first kind of magical moment that I had was we had this agent loop and we had a tool registry, a shared tool registry for kind of YCC. And the first tool that really was an unlock for me was I think a tool looking back that you actually built.
Jared, it gave these agents the ability to run read-only SQL queries against our database. Yes. Right? Um it was two tools actually. One was was running queries against our database, and the other one was the ability to read our model.
¶ One Database Fuels Agent Insights
I remember I built those tools and I felt a little bit like I was breaking the rules. Because initially we started with very limited tools that had very narrowly scoped domains and I kept getting frustrated because they weren't powerful enough to do the things that I wanted. And so I was like, what if we just gave the thing like access, complete access to the production database where it could just like trample on anything. Ha ha ha. And I sort of like it.
surreptitiously pushed it out maybe late at night. And it worked. And it worked. Extremely well. Right. Yeah. Perhaps foreshadowing, you know, subsequent things like OpenClaw, where it turns out that like the thing that was hampering the world was being worried about security and privacy and all the things that could go wrong. And when you like worry a bit less, you're like, Oh my God, these things are unbelievably powerful.
It's it's another really good example of this weird split between I'm at work and I'm kind of operating in this really Narrow box. And I'm at home using clawed code or or or whatever, uh open claw her and I can do anything. Right. Um and and and trying to trying to narrow that gap. So why was this so useful? This ability to run SQL queries against our database? Sounds really simple. Well, I think this is where it's important to talk about one of the big advantages that I think YC
Coming into this experiment, which is that we run on our own software. And all of that software sits on one Postgres data. that has everything that's important to YC's world in it. You know, every company that we funded, there's a companies table, there's a
There's a founder's table, right? There's tables for our financial transactions. There's tables for the notes that I leave in our little internal CRM, right? All of these functions that I think a lot of other companies farm out to third party SaaS tools. We've built our own. And as a result, we have this database with every important piece of context that I can now ask questions like, hey, show me all of the investors who invested in a space-related company in the last
Right. It just turns out when when all of that context is in one place, with a little bit of additional uh information about how the schema is laid out, an agent can go and ask any or answer arbitrary questions about about our business. That was a magic moment for sure when I first saw that. Yeah. And the cool thing for me is that it didn't just make it easier to answer questions.
dramatically increased the number of questions that we would ask and dramatically increase the the scale and complexity of the questions that we would dare to ask. Where like, you know, in the in the old days, back when we were using like BI tools. to ask to ask a question like that, you know, like what investors have invested like in space related companies, that would be like several hours of writing sequel. And so like unless it was really important, you just wouldn't bother.
Is it just another example of the you know the this instance of G Von's paradox that you get when you remove the amount of back and forth? uh between different teams in order to get a thing done, right? If if in order to ans ask some kind of complex question about YC, I have to go and knock on on, you know, the data science team's door and wait for them to get it through, you know, their backlog.
I mean, there are people out there watching this who work in places that still use it. The majority of people live in that world still and it's twenty twenty six, which is a little unfathomable actually. There's a long way to go, I think, which is which is really exciting.
¶ Beyond Single-Player Agents
question is how do companies that live in that old world could get sort of wings to move so quickly? Because our the magic for us was as you said, everything was the context was in one place that made it easy. You know, if you think about um data science uh historically, one of the first things that the Googlers had to figure out was uh big table, right? And big table was You know, instead of schema you and joins, you have one big table that um can be map reduced.
And so I think that that's happening again. And I would argue that that's happening now with um Karpathi style knowledge LLM wikis. Uh with G Brain. I mean, that's what I'm seeing anyway. Like, you know, obviously I have I have an open claw. It has uh access to lots of lots of systems. And then I'm normalizing it to my own schema that's relevant to me and the things that I care about. And it is like denormalization. It's you're taking data and you're putting it into a format that
uh is more or less optimized for open claw or Hermes agent, like that particular type of harness to be able to ask questions. And it needs retrieval, it needs rag, it needs graph rag, it needs uh you know, hybrid RRF, like there's re-ranking in there. Like, you know, all the things that everyone has learned about retrieval uh is now inside G Brain. And then when you give the agents a sold.
and it and you give it uh the data and it knows you and what you care about, like suddenly these things have insane wings. Like I just kind of can't believe how it sees around corners and you might ask a question and it'll even you know sort of interpret what your question was about and like give you a thing that uh frankly like it would take a human who really knows you well.
Mm-hmm. To answer um, all that's possible now. And so, you know, your question is like all the data is everywhere. My answer from like the open claw Hermes experience with G Brain is like Yeah, you basically have to take that you're gonna denormalize it and you're gonna put it in a format that is optimized for agent retrieval and understanding. You could wrap it in an MCP, but for whatever reason, I just like intuitively I'd be worried like
It's still sort of you know, these things are really good at working with MCP and CL like they're a little even better with CLI. It seems like you have to denormalize and do the big table thing, but you know, specifically for the agent. Looking back over the the last year and a half. Uh it feels like we're still kind of in the single player era of agents where the harnesses that have gotten really popular.
Claude Code, Codex, Pi, OpenClaw, Hermes. They're all designed to be used by a single human running And it makes a lot of sense. Right, because in that environment these these agents can do just about anything, right? And they they make you incredibly powerful. It's it's they're a lot of fun to use. I think one of the big problems uh that I don't think has been solved well yet by anybody is the multi
Harness, right? It's it's enabling that kind of superpower, but on a team or an organizational level, right? And and and that's I think. been the interesting thing to explore with the infrastructure that we've built at YC is watching which primitives that we've created that have enabled individuals and You asked the question about if you're working inside of a kind of a legacy organization
Uh uh what are the things that you can focus on uh in order to to help enable everybody at your org to use AI to to do more? Uh and we talked about kind of this common concept. And so a data warehouse where just as much of your internal important concept just turns out is extremely useful. There are many tools for connecting individual agent harnesses to You know, other MCP tools, other sources of truth. But just like a coding agent inside a monorepo, it just tends to be much more.
watching our agents operating on our single database that has everything. Tells me that there's a lot of value at least in getting all of the content.
¶ Growing YC's Shared Tool Registry
Having an internal tool registry, this is I think the other really important thing. So in the beginning, like we were talking about, it was just the whole system was really simple. It was like an agent loop and a simple tool registry and you know a few other The model router underneath. The tool registry is where most of the like YC specific stuff lives, right? The tool registry is what turns these agents into.
And we had like twenty tools at the beginning, including this magical ability to query our our SQL database. But over time Teams have added more and more tools. Every time we kind of come upon some piece of work at YC that we think could be improved with an agent, we can just add tools. And there's more than 350.
checked, right? Every team is adding their own tools. You know, I can do things like manage my office hours. Our finance team can uh you know can book journal entries, right? We can help manage the events that we run. There's tools for all of the important work that we do. And now once these all exist in in in in one place, you can make them available to these internal agents.
But you can also make them available to Cloud Code, you know, running running on on on on our individual machines. So those things above all I think were the important pieces that we built that if I were working in any other organization. I mean honestly inspired by what you guys would did with tools like this idea of skillify in OpenClaw and then actually the most important the last part of Skillify Skillify is like the
meta skill that I made in OpenClaw where it's like you just do anything in uh OpenCloud on Hermes. Hermes actually already has skillify, they call it something it's like it makes skills automatically. But the most important thing I I think is actually like plugging it into the resolver which is like your agents dot M D with like the list of things that the agents can do and then like it links to the markdown entry point that like
lets you use a tool basically. And so like this thing keeps coming up in all these different contexts. Like Claude Code has a skill. The skill registry in Claude Code is actually a result. our tool registry is actually a resolver. And then the weird thing that you have to do on top of that is actually um I have a meta skill called check resolvable that I call all the time. So I'm always like, I do something that's new or different in
uh in my agent. And then after it does it and I like it, I say skillify it. And then it becomes basically like a tool call or method call. And then I run check resolvable, which is like, you know, look at all of the other skills and uh tools that exist and is it, you know, dry, don't r don't repeat yourself, and is it uh M E C E, which is, you know, I'm embarrassed to say a McKinsey term.
for um the consultants use it for uh making really good slide decks. Uh mutually exclusive, collectively ex exhaustive. That's like how you're supposed to do slides if you're a McKinsey consultant. But it's useful because it's like an additional layer on top of don't repeat yourself dry. And like the models just seem to know what those things are. And so if you have a dry and M E C E resolver table anywhere. It's actually like the optimal resolver.
Like it's bad to have ten skills that do all the same thing. It's good to have one skill or one tool that has parameters that then let you call'em. So I don't know. I think it's like this is like the wildest time to be alive as like an applied computer scientist. 'Cause it's like simultaneous like discovery of the same useful applied concepts over and over again. And I wonder if like
when people were, you know, developing the first versions of Unix or something. It's like discovering a stack and a heap. It feels like we're right at that moment today. Like we're just coming up with the new primitives for what an agentic system actually is. And you can see it in the parallel
sort of development of like we're just trying to do a thing and it might be in clawed code or it might be in our own internal harness or it might be in open claw, it might be in Hermes. Like these things just keep coming back over and over again. Why C still? Startup School is back. We're hand selecting the most promising builders in the world and flying them out to San Francisco for July 25th and 26th to discuss the cutting edge of tech and startups. Apply now for your sponsor.
¶ The Self-Improving Dream Cycle
Yeah, it's really interesting to look at how some of the other companies that are building this stuff uh have built their infrastructure because you see a lot of these same primitives in in each of Like there's the agent loops, there's tool registries, there's skill registries. Looking at at the way that we're using skills now at YC. So if you you think of skill as a simple abstraction layer over tool. we have a handful of sort of shared skills uh that that we all have access to.
This agent system. And it's been interesting to watch. I think you've talked about this where this progression of like in the beginning you were kind of writing your own system prompts and then uh skills emerged and so you started writing your own skills and then you would start uh meta prompting where you'd uh where you know you'd have the agent write a skill exactly. Improve the prompt. Yes. Automatically. Yes.
seeing us kind of do the same progression internally where we have a couple skills and now we've gotten to the point where we have these sort of autonomous self improving loops. Right. Uh you know, and so Uh auto research from Karpathi again. Yeah. Yeah. Yeah or slash goal now in codex. Like they've they've incorporated it too.
We have this general agent that every night will go and read through all of the agent conversations that employees have had and look for uh things that could have done better and pieces of context that if it had up front it would have done more efficiently. This is OpenClaw's dream cycle. And G Brain also has a dream cycle. This is a um a a skill improvement dream cycle, but it could also potentially um read all the transcript.
and then write them back into the internal uh DB, into the internal CRM on like what we know about people and companies. Indeed. And we we there are cool examples of using transcripts actually to make these skills more effective as well. One of the shared skills that we have
uh is a skill that that partners at YC use to help our companies uh write what we call two-sentence descriptions, right? Everybody here has written hundreds of We should probably explain what a two two sentence description actually is. So a two-sentence description is a concise way of explaining what your company does in natural language that anyone will understand and why. Sounds easy, but it's surprisingly hard for founders to actually
And also no one does it. Weirdly. Weirdly, like even the most experienced founders like forget because they have perfect context. Ac interestingly, uh I now realize Y C itself is uh a context engineering uh sort of process in that like people we're frequently teaching people you have perfect context about what's going on in your brain, but great communication is replicating that same context in someone else's brain.
And that's what a two sentence pitch is. Like what is it? Like I don't even know what the heck this is. And then second part is like, is it interesting or valuable? What you know, is it worth my time? And so that, you know, when I when I teach two sentence pitches, that's my favorite way to do it is like
Do I even know what the heck this is? Yes. Because if you don't know what it is, you can't even ask a question about it. It's like something about computers, I guess, whatever. What what time is lunch again? And then the second part is equally important, which is like if I've heard that, you know, there were like
twenty companies, like there are five other companies in this room that do X. Like and then I don't understand like why this is noteworthy. Like again, I'm like thinking about my pastrami sandwich again. Right. So So the two sentence pitch like viscerally is important for founders. And it's it's a it's a simple kind of atomic thing that every part Practiced over and over.
And over again. I think Tom. Uh one of the one of the partners here wrote a skill that teaches an agent how to uh take some context about a company and can and condense that into And so that was his sort of handwritten prompt or skill about how that was done. And one of the cool things that happened in the last month or two was that a couple of the other partners took a meeting that they had with a a group office hours they had with a bunch of the company
and just went through and had every founder try uh their hand at at a two cent subscription and kind of gave them feedback and input. And so kind of the knowledge that lives in a partner's head about how to do this effectively was exchanged back and forth, right? And and and now lived in the context of of that meeting trend.
And handing that back to the agent and saying, given you know what you've learned by reading through this context, improve the two-sentence description skill. And they got noticeably better after that. Like this thing is now better.
¶ Compounding Superintelligence Happens Now
This is how superintelligence happens inside organizations. I mean this two sentence pitch thing sounds like something kind of small, but uh embedded in it is actually something very powerful. I'm sure you guys have heard um Jack Dorsey talk about what he's doing with block. He basically is trying to turn Block into a mini AGI around helping people in the world make payments to one another. Uh and then this is actually
The micro mechanism by which he's going to do that, right? Like you can look at the operation of any organization as uh the aggregate of, you know, I mean the two sentence pitch at YC is that's sort of one of like thousands of things that I would argue we do for founders. You know, we just walk through a very concrete way where someone wrote a prompt.
used it, used a bunch more, other people used it. Uh a bunch of artifacts came off of that around literally like the transcript of using it becomes a thing that can be used to meta prompt. and improve in an automated fashion on a daily basis the operation of that one skill. And then suddenly that one skill, you just said it. That skill is now better than any of us individually. than bef you know, when before we actually had access to that.
And so this is like a particular like needle pinprick in the fabric of like how any organization does things. And then how do you build superintelligence inside a company? You do that on everything you do. And it's not more complicated than that. Like you literally just compose everything that you do and any given thing that any given person can do, you combine that in aggregate and in this particular process.
And like you have a super organization. It's possible now. Like every single person watching this can do this at any company, at their own company. They can do it at their job. I mean, the interesting thing is that's why you should start a startup. 'Cause people are going to be trapped in organizations with people running organizations that are very powerful and have all these resources and all this capital that do not believe what we just said.
¶ AI as the Organizational Building Layer
Because they keep all the context locked down. Right. This is one of those things that we talk about, um, how to build an AI native organization, right? Part of the key thing is not to ju just use AI as a copilot. I think that's very twenty twenty three, four, right? This is the the thing where you use it as uh really the the building layer for everything. And you need to start recording all the articles.
People wouldn't have thought of uh meeting recordings and it is one of those reasons why all these uh meeting recorders have been taking off. People have been finding them with coaching them on the meetings, but it's not just that. You could take that and improve all the output for you that you do, like writing emails, communication, planning. You have the whole context of everything.
It's funny, I remember the Dario essay where it's like there's some of the blockers on just the rate of progression of AI are not technical, they're just sort of like social cultural things. I think it's kind of like a really interesting example.
Two years ago it would have seemed to I just remember like it felt odd to just like record a meeting or like there was just like people were trying to figure out what the like social etiquette around it was and like how intrusive it was. And today I just feel like it's almost like default assumed that like most meetings are being requested, especially if they're on Zoom, but just in general, like everyone started recording things now.
It's a little scary but I think if you frame this as a way for everyone in an organization to get better at what they do using the like collective skill and instinct of of the people they work with, it's incredibly Having a canonical two-sentence description skill is not just a way to like generate a snippet of text for a founder. It's a way to help me get better at understanding what makes
for effective founder communication, right? Because now I can tap into everything that Diana and Harj and you two have learned over the many years you've done this job, which are now kind of baked into this skill through the conversation.
¶ Shared Organizational Brain and Trust
It's like a shared organizational brain. And this is very empowering. closest thing to us being able to like connect our brains. Right. Yeah. It it it totally is, right? And I I can have an agent now come and I can do practice. sessions with it, right? And I can have it critique my like there there are so many possibilities once you get all of this knowledge into a place where an agent can can work with it. Uh it's a it's a it's a very empowering thing for every
There's some subtle interesting things around here that like, you know, other people might get wrong that like I feel like we've gotten right. I mean, one of them is by default the agent conversation is actually gl um globally view viewable by any full time employee at YC. You know, we sort of weren't sure about that decision. I mean, it felt right and it felt like living in the future, but it did not come easily. I feel like we had a lot of conversations.
Well, then everyone sees everything. Is that okay? And like w you know, what is not okay? And then I'm glad we made the choice to keep it open, actually. Yeah. I agree. People learned how to use it from watching how other people used it. We use that transparency to solve several problems. same time. One, every agent conversation, as you mentioned, was broadcast.
And anybody could join that Slack channel and look and learn, right? And I remember this is another kind of big unlock when it was when you started using. You were like super creative with with the things you were doing with it. And a lot of us watched that was like, Oh wow, I didn't even Yeah. T to use it that way, right? It allows you to be a little more lenient on internal
Right. One of the things we talked about earlier was this trade-off where these agents are at their most powerful when they are given unrestricted. Which runs counter to the way more most organizations. It turns out that by defaulting to public broadcast Conversations, you kind of institute a bit of a social control on what people can do with it. Uh, that, as we learned, I think has been like reasonably effective uh inside of this high trust environment at keeping private information.
Yeah, what's interesting is um it it betrays two traits of uh truly agentic like one thousand X super intelligent organizations that I would not have necessarily guessed would exist but are now like must exist. If you want to create this type of organization, you have to be relatively egalitarian and you also have to be trust by default. And then neither of those things actually are most organizations in the world.
If you're the founder of an organization, you actually have to have those at the core of what you're doing. And I think like that kind of environment honestly works. At startups, right? When it's a small group of people that are all aligned and and and and operating in a high
The other thing you have to do is be willing to spend like ten to a hundred thousand dollars a year on tokens. But if you're willing to do it and you invest in the skills and you like actually do everything in an open way with your team that way. Like basically what I realized is it allows you to live in 2028, right? Like what you spend a hundred thousand or a million dollars a year on now, it will be commonplace.
like in in two years, right? It'll it won't cost a hundred thousand in a year, it'll cost ten thousand and the year after that it'll be like a couple of hundred bucks, right? And everyone will do it. And we'll call it like this is how companies are now. So basically there's a one-time time warp where you can leapfrog every incumbent, all Fortune 500s, all startups that exist. by doing.
¶ AI Raises Employee Skill Floor
Like I'm imagining in the nineteenth. I wonder if it felt similarly when companies started buying computers for their employees. Yeah. They were probably very expensive and probably only certain companies really invested in buying these like expensive, flaky computer systems for their employees. But like What a superpower to have a computer when your competitors like don't have computers.
I think more tactically how I've seen this affect uh YC has been raising the the floor. The floor in a sense. What I mean by that is that you could have a new employee joining and maybe it would have taken them six months to ramp up. But with this, it's sort of like they automatically get a lot of the context from the company working and they know how the best people and the star players in the organization do things.
by apprenticeship automatically with AI instead of uh because partner time is expensive or sometimes the best people in that org they're very busy, right? And you get to kinda run the simulation of what it's like to be like Pete when he does like an awesome job of coaching founders on sales or like Gary when he's like talking to founders and giving very specific advice. I think it helps all the new new entrants in the organization just be A mini version of you a lot faster than
One of the first things that I appreciated about being able to use a coding agent was that all of the dumb questions I was too embarrassed to ask, I had no trouble asking asking the agent. And it this is kind of that same thing, but at an organization.
Right. You're a brand new employee. You're embarrassed to ask. You don't want to bug hard with a with a question. And now you don't have to, right? And which on that means a lot more questions get asked and answered and people ramp up much more quickly.
¶ Horseless Carriages: User-Controlled AI
After you had built all of this agent infrastructure at YC, it inspired you to write this essay, Horseless Carriages, that went like pretty viral on the internet. Maybe you can like explain the ideas behind horseless carriages. I think they're still very relevant. It was a critique of a lot of the the AI software that I And to be totally honest, I think a lot of people are not going to be able to do Still like that. Yeah. It didn't change. Yes.
I just saw a lot of examples of uh companies building software and adding AI features by sort of slotting a little bit of AI inside of a lot of Right. And and the example that I used at the time was the the kind of email Team had had shipped. But the the real idea underneath was this kind of that the that the potential for AI is to shift control of software.
from the developer to the user, right? And and the the simple example I started with was basically that all of these kind of like AI as a little feature Kept a bunch of About how the AI should do a job locked away and hidden from the user, which was just this classic example of like, well, it's the developer's job to figure out how all The developer should write that and we should protect the user from the Safety isn't I hate it.
Right. And and you know, and it and it's just again going back to this contrast between watching the way that some of these tools work and what it was like to use a coding agent on my computer that could do anything, right? And feeling feeling like I I had superpowers. I think the conclusion that this essay points Is that as we get better at building AI native It's going to look a lot more like the agent wrapping software-deterministic tools rather than deterministic software. Right.
Right. And we've done our best to expose that to internal employees with some of these primitives that we filled. Um but we have a lot we have a long way to go. Chat as the interface, I just feel something there's like things going around right now about how there's a need to build a new interface for like AI and what does that
And I think that just comes from people who haven't like touched and felt it yet. Chat is actually pretty good because like you trust the agent, you increasingly trust the agent to do more of the work and you trust its decisions and you don't actually need to like have too much of a UI to go in and like review the things. It's time for just in time software. Yeah, basically, right? Like yes, occasionally you want it to present you like maybe you like a specific view of something.
And it could make the software and build it as a single page JavaScript just purposely built for you at that moment. Yeah. And it could be a skill file that could be like called anytime you want. I was thinking a lot about this because I used to be in the camp that oh perhap when Chat G P D came out and it was twenty twenty three that perhaps chat was not gonna be the UI for all these AI applications. And I've definitely changed my mind.
Part of it is that after experiencing all these tools and I think the more I reflect upon it, why chat is probably the better interface, is because it's the closest thing to human language and human language and writing is basically the closest thing to expression of thinking. So chat is the closest stepping stone to clear intelligence. Yeah. So you can't just put it in a box. I think it just constrained us too much to have a very specific box. So that's why I thought it's like okay.
all in with chat interfaces. I used to be in the other camp and It's multimodal. I know we've talked about, like, telegram is not ideal, but... It's pretty good. Yeah, it's pretty good. I mean the voice memos sometimes when I don't want to type you just do the voice memo and it's it feels like I'm talking to Like I can give it text, I can give it voice, I can give it pictures of things. Great. Any files they give. Like pretty good.
¶ G-Brain: Dawn of Just-in-Time Software
I just experienced this. So like January I think the last episode we did, I just talked about this like I spent January and through Mar February building a half a million lines of code for a Rails app, which was Gary's list. And it was like, yeah, I know people make fun of me for like it was a blog, but it was like I built the blog in like the first week. Like I spent a month and a half building a full agentic framework.
that did like my own version of deep research and like fact checking. But the thing is I built it the way I would have built software in 2013 the last time I wrote code. It was like the web 2.0 version of this. And Cloud Code lets you do that. And uh what's crazy to connect is like I'm working like I don't know, I think I wrote like forty thousand lines of code the last three days just for G Brain.
And G Brain is basically Gary's list two point oh, but it's totally open source, right? So everything I had to write for uh agentic retrieval, everything I had to do for voice extraction, everything I had to do for fact checking, uh all of that now exists inside G Brain and I just gave it to my, you know, Gary's list team. yesterday as their own open claw instance and they're flying now, right? Like they were complaining about like I had made
you know, this monolithic writer chat interface and it was like full of bugs'cause I was like re implementing things that OpenClaw and Telegram already do. And now they just use open and claw, Telegram and my retrieval system with like all the same data that I extracted it out and with our MCP and it's working great. Like basically, you know, Gary's list two point oh, the next rewrite, thankfully, is not half a million lines of
uh Rails code that is like insane to actually you know it's rigid. It's uh takes a long time. It like takes like ten times long you know, even though it was one one hundredth the amount of time to do it like by hand. you don't have to do it by hand. Like that half a million lines of code in Rails is easily like 10,000 lines of like TypeScript.
and like maybe two thousand lines of markdown. And all of that is way more dynamic. Like you you could just say, like, actually for the second paragraph, uh, I really like including a biography of like the politician we're focusing on. And it's like I don't have to code that in Rails. I don't even have to write that into um a Ruby file that then gets evaled in like
you know, my complex eval infrastructure. Like OpenClaw just knows that and I have an eval skill. My editor in chief can just change it on the fly and I didn't touch And it's like this is insane actually. Like this is actually the dawn of just in time software and I can see it right now. The best AI software that I've used, whether it's inside of Wi-Fi. tend to be very small and just add kind of the smallest amount of code
Mm-hmm. And you can build an awful lot with that, right? I can write tens of thousands of lines of code, uh, like like you're saying. But the ability to start at this like extremely simple thing that I need to to understand very little in order to use is incredibly powerful. And I think that's I think most software
We were talking about this earlier, but I think that is what OpenCore did really well. Like there were like a few things that you want. You wanted like some ability to give it a bit of personality, you wanted it to like persist and last for a long time and have some concept of memory. It's not like perfect, but That's like actually like good enough as like for that use case.
Yeah, Claude Code too. Every time Boris comes and speaks at Wesie, he spoke with Diana er earlier this week. One of one of the things that really stands out is how obsessed he is with simplicity and with just like making the product as My favorite example of this is is uh the the this open source harness called Pi, which is a That's what that's what OpenCore uses as an out of the box uh trading agent. It's this beautiful piece of software which is just like the smallest possible.
You can use Pi to ex modify. Right. And it's this kind of idea of like self extending and self referential software. It's really fascinating. Uh and you're right, OpenClaw was built on top of that. One of the things I'm very curious to see is how many other sort of pieces of classic software emerge in this form as this kind of minimal thing that you start with.
uh and then use an agent to extend over time. I think more and more I mean looking at honestly the benefits that we've gotten from having our own customizable software, I suspect that a lot of commercial software uh will come with this capability. Uh out of the box.
¶ Centralized vs. Personal AI Future
There's a really interesting subtle thing that I wanted to talk about around like what I learned from your essay, uh, which is like AI can either be centralizing or decentralizing.
And um the Google Gmail like I can't change the prompt thing is like the perfect example of that. We basically have a choice to be made over the next I don't think it's even that long. I think it's like eighteen to twenty four months. It might take five years, but Um, there's sort of two scenarios and uh what comes to mind is literally like the uh nineteen eighty four Macintosh commercial by Apple where it's like, is twenty thirty four
going to be like nineteen eighty four. And, you know, the nineteen eighty four case would be we have centralized control, like there are five kings, there's only, you know, one of them maybe wins.
They have the most advanced AI. They have uh end run around all compute and power. They have all the space data centers because they could you can't build any terrestrial data centers in America anyway. There's this like centralization of control. And not only that, They don't let you run your own prompt. Like they literally do the Gmail thing, but like for your whole computing existence, right?
and this would be as if like personal computers never existed and there were only mainframes and mini computers. Like this is sort of lost to the sands of time. But, you know, in the nineteen sixties and seventies when computers first came out
Like you couldn't go to the store like you can today. You couldn't go to an Apple store and just buy an iPhone, let alone uh a Mac. You had to get access to like this thing that was worth like hundreds of thousands of dollars to millions of dollars and then And it was like tightly locked down by corporate policies. You're right. And the and the thing that really spurred the computing revolution was when people started having personal computers that that they could experiment with.
Yeah, and it was like the priesthood, right? There was a small priesthood and an institutional base that controlled capital, literally the means of production. And so, you know, this is like a coherent future that we could live in that I don't want to live in. And the alternative to that is actually
uh embedded in the homebrew computer club. It's embedded in the revolution that Steve Jobs and Steve Wozniak gave us when they were in the garage in Mountain View, literally soldering together breadboard. And they like sold 500 of these Apple Ones. And I think we're at the Apple One moment right now. We are coming up with the primitives. We're learning how do these things work and how do we sell it and how do we package it.
Uh but then there's like a lot of choices right now, right? Like most people, the mass you know, a billion users use ChatGPT and ChatGPT like gives you a little access. But MCP is really locked down. You actually, you know, can't
hook things up to your own databases that easily. Um and, you know, for what? Safety. Like I would argue Claude is like a little bit more open, but not really perplexity computer is probably the best version of it, but it's still like you know, d pretty limited compared to what you could do with OpenClaw and Hermes Agent.
And so what does the uh revolution look like that is like the true personal AI moment? And that's what I hope that we are building with things like G Brain and you know, Hermes Agent and OpenClaw, like the ability to run your own software, to change your own prompt. to test all of it, to have your own private repo that like you know is only yours, um, to be able to choose which model to use. And maybe it's an open weight model.
Like to me that's sort of the white pill for AI is uh we could have corporate control, no control of your own prompts and like literally the AI happens to you. you know, you're under the API line. Or like there's this other alternative where I want like a billion people to actually control and program for themselves, what are these things? This should be an extension of yourself and what you care about, not what, you know, Meta or Alphabet or even OpenAI or Anthropic care about.
I always really bristle when I see AI framed as a way to replace people because it just doesn't match the way that I am. in the way that so many of the people around me have experienced it, not as a replacement for humans, but as a thing that empowers.
If you look at at kind of how tech has developed since the era of of of mainframes to PCs to the internet, which gave everyone like a publishing platform to re like it's it's a story overall, above all, of individual empowerment. And I think AI uh is going to play out. I think it is going to enable us to do more uh than we could before. I think it's going to r eliminate kind of the drudgery style work that like made a lot of my job painful in the past.
To me, it's like we have to make choices to do so. By default, like a company is not open. By default, a company is uh command and control. By default, maybe the leadership gets access to these tools, but like the, you know, line level people, the staff people don't, right?
And like you we need like a radically different type of organization and we need to actually offer computing in a different way. And um these are all choices and the people who are watching are gonna be the people who build all these things in society. So We better choose well. Well that's all the time we have for today. I mean, I think we covered some pretty heavy stuff, but Thanks for joining us. Thank you. Thanks for watching guys. We'll see you guys on the next one.
