Intelligence Isn’t Enough: Why Energy & Compute Decide the AGI Race – Eiso Kant - podcast episode cover

Intelligence Isn’t Enough: Why Energy & Compute Decide the AGI Race – Eiso Kant

Nov 06, 20251 hr 6 min
--:--
--:--
Download Metacast podcast app
Listen to this episode in Metacast mobile app
Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Summary

Eiso Kant, co-founder of Poolside, shares insights into Project Horizon, a massive data center complex in West Texas, explaining why owning infrastructure is critical for AI labs to scale and control costs as intelligence becomes a commodity. He also unveils Poolside's innovative reinforcement learning to learn (RL2L) approach, which aims to reverse-engineer the web's thoughts and actions, and discusses the future of agents and AI's non-plateauing progress.

Episode description

Frontier AI is colliding with real-world infrastructure. Eiso Kant (Co-CEO & Co-Founder, Poolside) joins the MAD Podcast to unpack Project Horizon— a multi-gigawatt West Texas build—and why frontier labs must own energy, compute, and intelligence to compete. We map token economics, cloud-style margins, and the staged 250 MW rollout using 2.5 MW modular skids.


Then we get operational: the CoreWeave anchor partnership, environmental choices (SCR, renewables + gas + batteries), community impact, and how Poolside plans to bring capacity online quickly without renting away margin—plus the enterprise motion (defense to Fortune 500) powered by forward deployed research engineers.


Finally, we go deep on training. Eiso lays out RL2L (Reinforcement Learning to Learn)— aimed at reverse-engineering the web’s thoughts and actions— why intelligence may commoditize, what that means for agents, and how coding served as a proxy for long-horizon reasoning before expanding to broader knowledge work.


Poolside

Website - https://poolside.ai

X/Twitter - https://x.com/poolsideai


Eiso Kant

LinkedIn - https://www.linkedin.com/in/eisokant/

X/Twitter - https://x.com/eisokant


FIRSTMARK

Website - https://firstmark.com

X/Twitter - https://twitter.com/FirstMarkCap


Matt Turck (Managing Director)

Blog - https://www.mattturck.com

LinkedIn - https://www.linkedin.com/in/turck/

X/Twitter - https://twitter.com/mattturck


(00:00) Cold open – “Intelligence becomes a commodity”

(00:23) Host intro – Project Horizon & RL2L

(01:19) Why Poolside exists amid frontier labs

(04:38) Project Horizon: building one of the largest US data center campuses

(07:20) Why own infra: scale, cost, and avoiding “cosplay”

(10:06) Economics deep dive: $8B for 250 MW, capex/opex, margins

(16:47) CoreWeave partnership: anchor tenant + flexible scaling

(18:24) Hiring the right tail: building a physical infra org

(30:31) RL today → agentic RL and long-horizon tasks

(37:23) RL2L revealed: reverse-engineering the web’s thoughts & actions

(39:32) Continuous learning and the “hot stove” limitation

(43:30) Agents debate: thin wrappers, differentiation, and model collapse

(49:10) “Is AI plateauing?”—chip cycles, scale limits, and new axes

(53:49) Why software was the proxy; expanding to enterprise knowledge work

(55:17) Model status: Malibu → Laguna (small/medium/large)

(57:31) Poolside's Commercial Reality today: defense; Fortune 500; FDRE

(1:02:43) Global team, avoiding the echo chamber

(1:04:34) Next 12–18 months: frontier models + infra scale

(1:05:52) Closing

Transcript

Cold open - "Intelligence becomes a commodity"

I would actually go as far as saying that intelligence is going to become a commodity. If your foundation model company and you're not. building physical infrastructure. You're cosplaying your business. Project Horizon is us building out one of the largest data center complexes in the United States. We're talking about a scale of compute that gets counted in the hundreds of megawatts and soon gigawatts. So we call it reinforcement

learning to learn, RLTEL. This is the first time I'm probably publicly saying this. Hi, I'm Matt Turk from FirstMark. Welcome to the Mad Podcast. Today, my guest is Izo Klant, co-founder of Poolside. Poolside is a foundation model lab company focused on software engineering.

Host intro - Project Horizon & RL2L

currently reported to be raising a $2 billion round at a $14 billion valuation, including a reported $1 billion check from NVIDIA. We dig into Poolside's ambitious... Project Horizon, an AI factory data center at multi-gigawatt scale, and why AI labs must own energy, compute, and intelligence. ISO also unveils reinforcement learning to learn, a new path that goes beyond pre-training and classic RL. Please enjoy this deeply insightful conversation with ISO. ISO, welcome.

I should say welcome back. You and I chatted almost two years ago now when you guys were just getting started. I was actually listening back to our podcast and I hadn't realized that it was the very first podcast we had done. as poolside. And yeah, two years have flown by. Fast forward to today, at the end of 2025, the race between Frontier AGI Labs has only gotten crazier, frothier. So where do you guys stand? What

Why Poolside exists amid frontier labs

What is the reason for Poolside to exist in a world of hyper-competition with other labs? It comes back to why we started the company. If you go back to early 2023 when we started Poolside, We started it because we had our own point of view on the research. So if you remember earlier in that year, you know, GPT-4 just come out, and the narrative in the world was all we have to do to reach AGI is scale up language models.

do more next token prediction on larger number of parameters and more data. And we agree with the importance of scale, and Tilda State still do, but our point of view was that reinforcement learning was going to become the most important scaling axis for model capabilities. extremely contrarian opinion at that point, still when we were doing our podcast, and I'm probably following, you know, 12 months as well. Today, that's, of course, very different.

Today, I think that's become consensus. And I think the world has understood that we now have an ability to continue to scale models towards more and more capable direction and really close the gap between human intelligence and AI. And that mission was the original mission of Poolside is still very much the mission of Poolside. But along the way, I think what we had predicted would happen played out. We were going to continue to scale up. More compute was going to be required.

But at the same time, the first paradigm of kind of pre-training, of predicting the next token on the web was becoming sigmoidal and was slowing down in terms of the gains that it had. So over the last two and a half years, we caught up on the pre-training side and really on what we refer to as table stakes of foundation model building and language modeling, while building some pretty serious advantages on the reinforcement learning side.

And now we're in a moment where those two things have come together. All of us, like as Frontier Eye companies are developing, our big model runs are still at the same scale. So there's a series of advantages that we've built in our research. There's advantages that we've built on our engineering. And those are probably some of the most important sustained ones. But now we're getting to a point where scaling up our models.

to frontier sizes is also necessary for us. And hence, we've recently had some announcements about the sheer scale of compute that's coming online. we're competing. Like, we're in this race. I think I listened back to our podcast and I said, you know, we're going after Open Eye and Anthropic. That's still very much true. But our starting point was a little bit different, right? We said...

To get to highly capable intelligence that can reason, that can do planning, that can understand the world, you're almost better off putting a set of blinders and focusing on one proxy for that. And for us, that proxy was software development. And as we've gotten more and more capable at doing longer horizon software development tasks, we've also gotten more and more capable at, frankly, all type of knowledge work tasks. The world from a model perspective...

I think we are all over time converging to a similar point. I would actually go as far as saying that intelligence is going to become a commodity. In that world who we want to be is we want to be trusted by enterprises. We want to be trusted by businesses to power the knowledge workforce that started with coding agents and is now going beyond that as well. And I think in the space.

so much is going to transition. We're like a pre and post electricity moment. So I don't think it's a winner takes all market, but it does seem to be that it's a small number of companies who are able to get there.

Project Horizon: building one of the largest US data center campuses

As part of that race towards AGI, you've had some very big news in the last couple of weeks. First of all, there is a rumored large fundraise up to... $2 billion where NVIDIA reportedly would be investing up to $1 billion at 12 billion pre, 14 billion post. It's a very large round. That seems to be very tied to another big news that you guys did formally announce, which is Project Horizon. What is Project Horizon? So Project Horizon is us building

building out one of the largest data center complexes in the United States. And this comes back to something I said earlier. We talked about intelligence becoming a commodity. Our view for... Pretty much since starting the company over two and a half years has been it. There's three layers of the stack that fundamentally are going to matter. It's energy, it's compute, and it's the intelligence built on top.

And within this world, if you think that intelligence is going to become less distinguishable between the companies building it and becomes a commodity, a commodity probably more like oil or cloud compute than like bread at the bakery, is... There's two things that matter, your ability to scale it and the cost at which you deliver it to your end user. Now, the ability to scale it was frankly the primary motivating driver for Project Horizon, but cost as well.

It's important to think about what you could do 12 months ago versus what you could do today. At the scale of compute that we were talking about two years ago in our industry, you could call up a data center colo and say, hey, I want this much compute in six months, and there would be space.

like physical space where you could deploy it, or there'd be someone who has the capacity available to you. Today, we're talking about a scale of compute that gets counted in the hundreds of megawatts and soon gigawatts, but there's no one you can call. These are built to suit data centers that are so large that no one is building them before having a tenant.

And so as you're approaching the frontier and your models are getting more capable and you want to serve that intelligence to the world and you want to scale up your training as well over time, now your lead time from deciding to do that to being able to do that. is no longer calling up a hyperscaler or calling up another partner and having it in months. Now we're talking 12 months, 14 months, 18 months with huge capital numbers attached to it. And so...

My co-founders say this thing to each other and probably not really for podcast material, but if you're a foundation model company and you're not building physical infrastructure, you're cosplaying your business.

And this is kind of the really fundamental nature. It's not that we went out and said, it'll be cool to build infrastructure. It was a necessity for the mission to be able to achieve it. So you think you just get boxed out, right? So obviously OpenAI is a whole target project and just announced.

Why own infra: scale, cost, and avoiding "cosplay"

like I think a fourth data center that they're meant to build. Anthropic has a special relationship with AWS. I think AWS is building one for them. I guess that's what you were saying. You need a tenant in that case.

the hyperscaler is willing to do it because they have Anthropic as a tenant. So as an indie lab, for lack of a better term, you have to own your own destiny and that involves building your own data center just to play it back. Yeah, I think as a foundation model company, you can go two paths.

you can choose to deeply partner with a hyperscaler and kind of have them become an owner in you and really go all the way. And I think that's one direction. But at the same time, The world is getting to a point where I don't think anyone has any doubts anymore that we're now on track to reach human level capabilities and intelligence.

And in that world, $29 trillion of knowledge work rewrites itself, right? Scientific progress starts pushing beyond levels that we've ever seen. And all of it is intelligence on compute. And so we are far more in a... A foundation model company is far more in a physical infrastructure business than most people realize. Because they need to scale that compute, right? And bring it to end users. And frankly, do so cost effectively, right? The cost of your tokens are going to matter more and more.

As our intelligence has all become closer to each other, the ability to scale this up and do so to end users where, you know, the cents per token are kind of a determinant if someone's going to buy it is critical. We already started several years ago asking ourselves the question, like, what would it take to move towards this? And then over time, you know, we learned more, we observed more, and then we started acting.

And the acting towards it is Project Horizon. So Project Horizon, in a nutshell, is today announced a two gigawatt campus. We can actually go far beyond two gigawatts on the site that we're in. It's a partnership with an incredible family out of Texas called the Mitchells. The Mitchells have one of the single largest parcels of land in the United States. It's half a million acres. And it always blows my mind because for context, LA is 300,000 acres.

On that land is renewables being built, there's grid connection, there's water, but there's also a 20 inch main gas line. And that offers us a possibility to start scaling up energy. that powers data centers incredibly fast and an incredibly large scale. And so what you're seeing from us is that before the end of this year, we're starting construction on a 250 megawatt data center. It's natural gas powered.

with Grid Connection as a backup. And it will house an incredible amount of compute, but it will be the first phase of many that we're building out. Great. So just to unpack some of the things you said, you mentioned the impact of owning your physical infrastructure on the economics of Frontier AI Labs. Help us understand that. Obviously, one key question in the world of AI has been...

Economics deep dive: $8B for 250 MW, capex/opex, margins

gross margin where do you think owning your own uh physical infrastructure lead you towards in terms of like overall structure i'm not yet convinced that gross margins in our industry will look like a sas company like 80 plus i think When we're talking about a commodity as intelligence, that we build value-added services on top as a foundation model company, on one hand...

You sell your tokens, your barrels of oil, your raw material. And on the other hand, you're building up value-added services, your products that you bring to end users and customers that unlock value for their businesses or for consumers in the case of others. In that world, I think it looks a lot more like a cloud company.

So I think when we're going to look at the scale of this, it's going to look like cloud companies with a zero behind that. But from a margin perspective, probably pretty similar. Because if you think about what is an AWS or a GCP or an Azure,

it's effectively virtualization of hardware with services on top. I don't think foundation models are that different. So I think from a margin profile, we'll probably sit far closer towards that like 40% mark that you see in cloud companies than you do like the 80%. on the software side. And now, when you start thinking about what sits in the stack of cost of intelligence, you effectively, you've got the land, you've got the energy, you have the data center.

You have the chips inside the data center. And just for context, if we, for instance, break this down with illustrative numbers that are roughly in the right ballpark, to energize 250 megawatts is about half a billion dollars worth of physical infrastructure.

So in the case of gas turbines, it's gas turbines, but there's different paths to it. Building a data center, like the power shell that brings together all of the equipment where the compute can run inside, you're talking kind of north of $2 billion, $2 to $2.5 billion.

Now, the compute that goes inside today would be about $5.5 billion. It's kind of all in all when we're adding this together. The compute being the chips and the networking and everything that kind of is what we'd refer to as the fit out of a data center. And so you've got about an $8 billion cost of 250 megawatts. So when you hear in the industry, you might say a gigawatt is $40 or $50 billion. That's what people are referring to, right? It's the breakdown of those costs.

The life of a chip has a certain amount of years to it. And in our industry, there's lots of, you know... argumentation about how long they last, and it's a demand and supply question. A data center has kind of traditionally, you know, a 15-year life, but it needs retrofitting as new generations of chips come online. And your power infrastructure can last a very long time, but requires maintenance.

In that $8 billion project of 250 megawatts, you're annually spending somewhere north of $300 to $350 million of OPEX. And so you can split that about 50-50 between the cost of financing. and the cost of energy and operations. Energy like being the most. And this is where there's already an interesting insight. All of a sudden you realize that the energy part today is not the biggest part of the stack.

We're going to talk $160 million a year in energy. And of course, I'm taking numbers that are West Texas numbers. So energy can be double that in other places in the country. But still compared to the total cost TCO of compute. It's relatively, you know, minor. Numbers are still very big. The reason I mentioned is that what is going to happen over time, right? Like what's always happened in capitalism is the margins compress everywhere.

Now, margins have already compressed for energy. We already try to make energy as cheaply as possible in the world. Data centers effectively have already been increasingly more commoditized over the last 20, 30 years. GPU compute today...

It's still very high margin business. And over time, we'll find a similar level of compression. And as the chips become more compressed, the other parts of the stack are more important as well. And when you end to end... own all of this and build all of this, everywhere where previously margins sat in between, you know, falls away.

If that's the person who you were paying for the cost of land, if you're talking about the cost of the energy, the cost of building the data center, all the layers in the stack. This adds up to a very large amount because the foundation model itself... is, you know, it has capex for building it. It's your R&D cost and you, you know, you amortize it over time. But the physical infrastructure is really frankly where the majority of your cost sits.

And so I know I went super detailed straight away, but I think it's useful for people to kind of get this sense of where it is. So when you take all those margins out, all of a sudden you can start seeing it. You can serve your tokens 20, 30, 40% cheaper than someone else. And this is going to matter. Because we're shifting from human labor that leads to economic output to intelligence on compute combined with human labor. And we've barely started this in the world.

And so we're going to be in a world where this is a commodity that will look far larger than we've seen cloud compute be. Frankly, it's often the largest bills of companies, what they're paying. In that world, controlling that cost is a big impact. But cost is really only part of the answer here. You have to be able to scale it. You have to be able to say, if I next quarter need more compute, can I bring it online?

Because there's no shortage of chips. There was a clip yesterday from Sacha on a podcast that I saw. where he said, look, it's energy and data centers that are my bottleneck. It's not chips. And it's true. NVIDIA does an incredible job at supplying the market with chips and TSMC and everything in the stack and scale up to huge levels.

But physical space, bringing power online and building data centers, well, that's when we kind of in tech will always start realizing the real world hits us. And you mentioned cost of energy and cost of financing. Just quickly, this is finance how?

project finance kind of setup? Exactly. So if you think about a traditional data center business, and this is not unlike it, you have an amount of equity that goes into that business, you have a loan to cost ratio, and the loan to cost ratio really depends on the lease. the tenant that you have on that data center. He's got a 15-year lease contract traditionally.

And then it's traditional project financing that is highly effective. The world has been doing this for a long time. It is relatively low-rate financing. And this is not new. This is not new to tech. It's been around for a long time.

CoreWeave partnership: anchor tenant + flexible scaling

Part of the idea is that both side will be the anchor tenant, but you'll be renting the facility out to others as well. What is going to be the revenue stream? We are not yet at a place where we can be signing 15-year leases on. massive multi-billion dollar build-outs. But we are ramping up rapidly where we see a path to being able to scale up compute at those levels.

And so we found a really great partnership here with CoreWeave that was announced. CoreWeave, frankly, is second to none in terms of operating NVIDIA's compute. I first have GB200s online at Skill and now coming with the GB300s and have been a great partner for us in the space. And so we found a really great hybrid solution with them. So they are the anchor tenant on our data center.

jointly we can scale up our compute inside that data center and any capacity that we choose not to take or not able to take in our space, they can bring to other customers. And this was really an important thing because we're not a hyperscaler who can make multi-billion dollar bets for two or three years out. So we needed to find our way of having great partnerships that were kind of win-win situations.

where we could scale up our compute, but didn't need to make a lead time decision years ahead of it. And this has been a fantastic setup because it kind of brings the best of both worlds to the site. As you explained all of this, it's fascinating in terms of ambition. It's also fascinating in terms of what that means for ultimately a young software AI company to become a major physical infrastructure company.

Hiring the right tail: building a physical infra org

So the CoreWeave partnership helps, but presumably you have to hire all sorts of people. That's a whole different skill set. How did you think about that and also the financial aspect of it? We followed an algorithm. I would probably say most of my career by now, at least I remember going back as far as nine years, is you have to, by all accounts, avoid the Dunning-Kruger effect as a founder.

Because when you get into something new, like it has been on the physical infrastructure side, you start with learning. You start with reading. You start with meeting the experts, learning and learning. And as you learn more about a topic, you fall on this point where you start thinking you really know it.

but you don't really know it because you haven't hit the real world or you haven't done it before. And what I found to be one of the best advice I got a very long time ago, whenever you find yourself in that situation, Well, this is when you want to start finding the experts to join you. But how do you find the experts? Because you yourself aren't. And here's when you just start interviewing 100 plus people for every role and you build a distribution.

You start understanding who sits on the right tail end outlier of that distribution. And it turns out, you know, even just with... I wouldn't say our knowledge at this point was shallow because it's been years understanding the space and getting close to it, but definitely not the level of it like an expert. We got to the point where we started being able to identify who were the outliers.

And those are the people you hire and you bring on, right? And you empower, you give the right autonomy to. And that in combination with the values that we care about. So at Poolside since day zero, we've always cared about low ego, kind hearted. extremely hard working like in our space you work way too many hours a lot of intellectual curiosity and horsepower and deeply care about the work that you deliver

And we recently added a six because we found that's really been true is like people are very explicit and direct in their communications. It can be very clear. And so we started with that and you start from the top. We found an incredible hire in Lance, who's our VP of data centers. And from there, we've been building out the team. What was particularly interesting to learn about the infrastructure space over time is that...

It's an incredibly small world. Everybody knows each other. It makes tech feel like a big world. The construction of data centers at scale has really only been done by a quite small number of players. And so everybody knows each other across the entire end-to-end supply chain. So you also have an ability to very quickly understand who's considered a good actor and who's considered a bad actor, who has built up a reputation. Small industries are very reputation-driven.

And so it all starts with people, empowering them, learning from them continuously, and knowing where you trust and knowing where you go deep and you ask questions and then try to challenge people. That has actually not just led us to building an infrastructure company with an incredible team, but also to find the people in the industry who were able to start rethinking some of the ways data centers have been built.

So we haven't just gone and said, oh, we're doing the exact traditional thing everyone else is doing. We've actually taken a quite new approach to data center construction as well. That sounds great. Unpack that. So I think the industry has seen the... XAI data center being built in record speed, which I think has reset the bar for everyone. How do you guys go about building that data center? So if you think about data centers, there's kind of been a traditional way of building them.

I wouldn't say new way because it's been around for over a decade, but on one end you have kind of stick-built buildings. Everything is done on site, right? So all the materials are brought there, thousands of tradesmen who are assembling the data center. And on the other hand of kind of like Vertiv here in the West or players like Day One and their parent company out of China have gone full modular. So think of like your data center units are rolling out of a factory.

What was interesting in that both of those approaches have their limitations, right? The moment you're bringing thousands of people to site and you're assembling everything, and especially in what is effectively the Permian Basin, the desert in West Texas comes with its challenges.

both on mobilizing workforces, but also the weather, the dust, the logistics, the supply chain. But then the factory side has not yet scaled up at a level in the West. I would say this is different in China, where you can actually bring out. gigawatts like over the course of you know like couple of years uh and so we we kind of decided to meet in the middle and took an approach where we said we're building the actual building stick build

So that's being built on site. Think of it as an extremely long corridor with leaves attached to it. But let's take a modular approach, but one that fits on the back of a flatbed truck. So how do we kind of do two and a half megawatt compute worthy skids?

Think of this for electrical, for mechanical and cooling, and for compute. Those are kind of the three elements that sit inside a data center. And do it with manufacturing partners here in the United States. And what you start finding here is, and once you start breaking the problem down,

you realize that the electrical modular components can be done by lots of great manufacturers. This is not unique. You actually have great players, including very large ones, that can help you there. When you get to the cooling and mechanical part, It's a smaller number of companies, but again, have been doing this for a long time, highly reputable, and you can work with. And then when it gets to the compute side, there's recently been an unlock.

Because on the compute mixed with cooling side, what you're finding is that now we have direct liquid to chip cooling. The designs are changing. There's a lot more that you can do in a far more contained manner. And so we decided we said we'd rather have a little bit more redundancy on every individual kind of room that we build so that we can bring them incrementally online.

So where most of the data center is being built for AI factories today, you'll see them bringing them online at kind of 40 megawatt chunks. We said, what if you design it in a way where you could bring 2.5 megawatts of compute online? you know as you're doing construction think of it again on this hallway as you're bringing on like units of two kind of a thousand gpus or depending on the gpu type can be less in the future you're bringing this online

And the combination of those things that we refer to as incrementally delivered hybrid modular data centers is kind of bringing the best to both worlds. It usually de-risks your ability to deliver. It de-risks your ability to bring compute online. It also means that if all of a sudden next month you need more compute, you can make decisions on two and a half megawatts. Like I just want a little bit more. On the other hand, you're taking advantage of the fact that

For the kind of bigger construction, the building and stuff, you take the traditional approach. And on the manufacturing side, you can partner with multiple manufacturers and you can do so actually very geographically advantageous as well. Most of the things that we are bringing to our site. are actually manufactured in Texas or very nearby. And so now you're actually reducing as well your supply chain risk and your time to site risk.

The biggest advantage though of all of this is not just speed, but it's also your workforce that you can mobilize. Because we're in a quite remote area and that remote area offers all of these advantages, right? Like pretty much infinite land so we can, you know. build single level horizontally, an incredible amount of source of energy, but also being remote, meaning that mobilizing workforces there is more challenging.

So all of a sudden, you know, when we're building 250 megawatts, we're doing it with a less than 450 person onsite workforce with very skilled tradesmen and construction workers that we're bringing over there and we're housing over there. versus a world where you might need traditionally 2,000 plus people to build the exact same thing. So it's all about de-risking and speed, and you kind of see a common theme. We think about lead time.

We do the same, by the way, on our model building site. We think, you know, how quickly can you go from decision to having something online? And how can you make sure the capital that you have to deploy does not have to sit for 12 to 18 months? That's when you as a smaller company can all of a sudden scale up. Because the big numbers really have more to do with the fact that these decisions need to be made years ahead of time. We can make them months ahead of time. What's the...

target delivery date for like the first part to come online so at the beginning of Q4 end of Q3 so next year we've got the first compute coming online and then we're finalizing in Q1 2027 the first 250 megawatts, but already next summer, the next 250 megawatts starts construction. It's a staggered build. And this means that we will always have a Ryzen on, you know, essentially three times a year, an additional 250 megawatts coming online.

Last question on this and then we'll transition over to the more familiar software and AI side of the conversation. The environmental impact, that's obviously a question that the whole... data center industry grapples with? How do you guys think about it? Let's first be honest about this. We're using natural gas for the generation of power. In the world that we are right now, it's a skill that data centers are being built out.

The sun doesn't shine all day. The wind doesn't blow all the time. And so renewables only, in combination with the battery capacity that you would need to power a data center if you want renewable only, it's just not economically viable yet. But the combination of natural gas with renewables, with batteries, we've got a big best being built out on the site, so big battery system, allows for kind of an optimal point in the middle.

Now, when you talk about natural gas generation, you want to make sure that you have your emissions in check. So what you're adding is SCRs. What's an SCR? An SCR is essentially catalytic reduction. So what you're doing is you are...

filtering out some of the more harmful particles that come out of natural gas generation. And you do this all within federal standards. You file for air permits that you have that are approved at the federal level. And so this is a critical thing to do as a business. And so you're combining those things. So that's on the energy side. The second side of environmental concerns around data centers are usually water, right?

depending on where you are, scarce resource. In other places, it's an abundant resource. It can impact like a local community. And so we're very privileged because of where we sit and the sheer scale of land that our partners have. Water is both a resource that is there at the levels that's required.

required for us without actually taking away from the local community. And this is not the case everywhere else in the United States or in the world. So picking this site was not just about how large it can scale, because it can scale into an incredibly large size. but also can you do so in a way that's responsible? And community, while it goes beyond the environmental topic, is incredibly important. You're building something out, even though we are building out in a relatively remote area.

At the end of the day, this would not be possible without the support that we get from the local community. So we're in a place called Pecos County, which is just some of the best people that I've ever met. Honestly, if you want to go and...

just walk into a bar somewhere and spend time with some of the greatest people in the United States, go to Pecos County. Because we've been amazed with how welcoming the community has been. And so this goes hand in hand with how do we invest in job programs to bring people there.

I don't mean to start sounding like a politician, Matt, because I think this is almost when I talk about community, it feels like you are. But these are projects that are not one-off. When we talk about this big multi-gigawatt project, we've got about eight years worth of construction that we are planning out there.

and potentially as nuclear becomes a viable option in the coming years and we can build out more power potentially in nuclear over there, this can be construction that happens out there for as long as we can imagine because the space is...

close to as infinite as you're going to get. And so bringing kind of everyone along has been kind of a critical point for us. We just try to be good actors and try to be transparent about the positives and also the negative externalities and how to mitigate them. Fascinating. and very different from the world of RL. But let's precisely switch to...

Now, since we chatted a couple of years ago, you mentioned that the world has changed and everybody was very pre-training focused for the last few years and sort of feels like in 2025,

RL today → agentic RL and long-horizon tasks

of pre-training and RIL now has become sort of like the state of the art in what a lot of people do. Walk us through your approach. So when we did this podcast two years ago, we spoke about our approach of reinforcement learning from code execution feedback. I think at the time I called out, we had tens of thousands of these environments where the models would go in, they would be given synthetic tasks, explore solutions,

and then be rewarded on the solutions that we're creating. And maybe you remind people what that is. So you have a whole layer where you have those execution environments that you built. Exactly, yeah. So what we have done is... At the time, it was about 10,000 or, I believe, 20,000 code bases, like real-world repositories that have complex software in them, where we define tasks, and we would look for often single-shot solutions.

So here's a task. Model can write some code. It gets tested. It might have additional signal from reward models and others in there. And it will be able to provide feedback. Was the code correct? Was it wrong? Was it rightly styled? Et cetera. Over time, that has grown a lot across two axes. One is right now we have over a million of those environments and tens of millions of revisions, putting us in like the 30 million range. So just two orders of magnitude larger over the last two years.

And that's because as you increase the diversity of the type of problems that the models need to learn in, you can increase the capabilities of the models. The second axis is that... The models got so capable in their reasoning and longer time horizon capabilities, multi-step like complex planning and execution, that RL moved from kind of single-shot reinforcement learning to agentic RL.

So meaning that now we give a much higher level task, we send in an agent, that agent goes and tries to solve for the problem. by using its tools. So editing code, executing commands, updating packages, doing everything that a software developer would do. And when you see it today in coding agents, it looks even quite natural to how we do things. Increasingly more like how we do it.

and then it gets rewarded along the way as well. So what we're seeing is that we're entering a world where the reasoning capabilities of models and their longer-term horizon planning and execution is now getting so capable that the tasks we provide them are no longer simple tasks like the questions at the end of a textbook, but now it's like the projects that you're made to do in school where you have to do a lot more steps.

And so we've continued to scale on those axes. Today, I believe everyone is scaling amongst those axes. And it's true. Where two years ago, this was a contrarian opinion that no one was doing, today it is absolutely consensus. Where we have, again... found ourselves in a point where we now have again a contrarian opinion on the future uh and it's really not by by trying to be contrarian it's just that we i think have had a pretty

I've had a couple of, you know, a long time thinking about this, right? I started thinking about this first in 2016 when I was building Source and did the first RL with LSTMs, with language models. And now the last two and a half years with these larger LLMs. And our view now has become that the world is going in a direction again that we actually might not fully agree with on RL.

So what you're hearing a lot about at the moment is the scaling up of environments with human expert rubrics. So it's not just going into coding where we've got verified rewards. Now we're entering into areas if that's... you know, how to operate on a spreadsheet or if that's how to do a complicated task as a chemist or, you know, marketing professional where you use kind of a rubric created by an expert.

to make it part of what a model can use to judge the answers. And it's highly effective. And by the way, we do it as well. It is a highly effective way to get models, singular capabilities on singular tasks to push it up. And I think what we'll see in the next couple of years is a lot of model companies ourselves as well. We go into these enterprises, we find high value use cases, we use our agents to get to 60 or 70%.

you know, of quality, and then use additional reinforcement learning with, you know, domain experts' knowledge to get it to 80 or 90 or potentially even 100%. But the narrative that that's going to scale us to AGI, yeah, we don't agree with. We think this is the right thing to do in the awkward teenage years ahead of AGI. It's because it allows you to create really economically valuable outcomes for your customers.

At the end of the day, we're businesses, right? We need to be able to fund our economic engine towards our mission to Mars, towards reaching human level intelligence and beyond. But the singularly collecting of skills, this way by creating environments and experts we don't think will scale to artificial general intelligence our view is that reinforcement learning will have a moment similar like we've seen in language modeling so what makes language modeling so beautiful

It's the fact that by predicting the next token on the web, you have this general self-supervised objective, right? You just can throw the entire internet at it. of course, continue to filter it into more quality and improve it with synthetic, all the things we spoke about last time. But it forces the model to learn. But the reason that never got us to AGI is because... Well, the internet never actually included the data set of the thoughts and actions that created it.

the final piece of code it's the final article you write but not the thoughts that you had and actions that led to it the trial and error exactly and so our view is that there is a generalized objective for reinforcement learning That doesn't require, you know, human judges or environments or reward models or anything external, but can generalize over language. The same way that we have seen, you know, next token prediction generalize.

a way of thinking about it is if you have traditionally we've taken the web and we've used synthetic techniques to reformulate it to improve it we've moved forward like we've since taken the web and generated forward what if you could generate backwards. So what I mean by that is what if you could reverse engineer the web or decompress the web into the thoughts and actions that created it? Is there such a general objective for RL that can be found?

This is where we've done a lot of our research in the last year and a half. And we have now started seeing a lot of promise towards that being possible. Not yet too ready yet to talk more openly about it because we're still at a stage of the research where we've got a lot of things to prove out. Does it have a name as an approach? Yeah, so we call it reinforcement learning to learn. RLTEL.

at least the first time I'm probably publicly saying this and so just to play it back and make sure that it's understandable by everyone so there's one approach which is reinforcement learning where you create environments and

RL2L revealed: reverse-engineering the web's thoughts & actions

and reinforcement learning does trial and error, gets rewarded, not rewarded, and the path to scaling reinforcement learning is to just expand reinforcement learning to all sorts of different tasks. So that's option A, what you're saying is that you're working on an option B, or maybe that's option A, and the first one was option B, but you're working on a different approach where you're going back to internet data, reverse engineer the thoughts that went in reaching that conclusion.

It's a perfect way of framing it. And we don't think they're mutually exclusive. We think that you want to have a model learn how to think and reason as early as possible in its training. And then you want to actually have it learn in the environments to sharpen its skill sets. And it's not unlike us. We'll have learned a lot coming out of university, but then we're put in the job and we're actually doing it and we're learning from experiences.

So the learning from experiences, the reinforcement learning from environments and from experience is effectively like a renewable energy. The density of information in those tokens is not as high. as like in a physics book is, right, where a huge amount of density, like within a small amount of tokens, and experience it's less, but it's highly valuable. So our view is that, you know, we pre-training and predicting the next token on the web.

is an incredible bootstrap of understanding language and helping us get to a level of intelligence. Reinforcement learning to learn, RL to L, internally we also refer to as the Bondi techniques, kind of our code name for it. we think will push models to a level of reasoning and thought that will happen far earlier in their training than it does today.

And then you have reinforcement learning from code execution feedback and from other verified environments that help learn really what is to sharpen those skill sets in simulated environments. And then the fourth stage of training over time increasingly will become

learning, continuous learning from real world experiences of these agents. And so those are kind of the four stages that we think training will go to. If you will talk about that fourth stage a little bit like continuous learning is something that people may have heard about that keeps coming.

Continuous learning and the "hot stove" limitation

back as one of the next avenues for the whole AI systems to progress. What is it and how does that work currently and how does that work in the context of Bullseye? Today, there's one thing about foundation models that we have yet to really optimize almost at all. like overall of this time, which is the ability to learn from a single, you know, sample of data. Foundation models today can do it and we don't yet have a path in them successfully doing so in a way that like really improves it.

Internally, we call it the hot stove problem. If you're a kid and you touch a hot stove once, you're never touching it again. Single sample and you're good. Foundation models, because of the underlying technology gradient descent, It's such a data-hungry algorithm. It requires so many samples to be able to navigate that higher dimensionality space to a place where it does something more optimal. And so we've got this big hole towards AGI.

of how can we get a model to learn continuously from a smaller number of experiences like you and I can. Now, there's a question if that's required to reach AGI. I would actually tend to say that if we take the definition of AGI to be that, you know, foundation models are as capable as you and I to do the vast majority of economically valuable tasks, maybe first behind a laptop and over time embodied in robotics.

I would tend to say that it might not be necessary. But it, of course, is an incredible thing to add to intelligence because it will massively make it more compute efficient and will also make it more valuable over time. And what we do have...

already a path towards is when we've got a large number of users or the real world giving feedback on what an agent is doing, what a model is doing. Models and agents to me are effectively the same thing. We can incorporate that feedback in improving the models. And this is really the learning from real-world experiences. This is when you've got maybe hundreds of thousands or millions of agent trajectories, tasks that were done, where some feedback was provided.

That can be by the person who instigated it or can also be by some form of environment or system. And so it's kind of taking it from in our clusters as foundation model companies to where it actually touches the users. And that feedback will come back. And so is that necessary to reach AGI? Not so sure yet. It's a very honest answer.

But will it be valuable in the journey towards it to make our models more capable and more directed to what they want them to be? Like, absolutely. In all of this, you know, I have a bit of an issue. I started to use the term more because it's just more commonly used, you know, AGI.

But it treats intelligence, we often treat it as this singular spot that once we're there, it encompasses everything. But intelligence is so multidimensional. You've got people who are incredible creative writers who can create incredible works of art at Dostoevsky. On the other hand, you've got incredible mathematicians who can prove theorems. On the other hand, you have incredible people who are amazing at designing a factory. And it is not yet obvious that that is all a singular thing, right?

And we see this very clearly with modalities, right? Someone who is not able to see can still be an incredible software developer, but it's unlikely to be able to be an incredible race car driver, right? Or unlikely, it's called impossible to be an incredible race car driver. And so intelligence is so multidimensional that we can get to a point where it's going to be as capable of doing all the knowledge work that we do behind the laptop.

and we're still not going to agree that it's AGI, and we still won't have solved every technical challenge. And that's also okay. I think we've got a long road ahead of us to truly get to the incredible, versatile features that we have as humans. But will we get to a point in the next couple of years where economically valuable knowledge work can be done by AI? I think absolutely. You just mentioned something interesting about the model and the agent being more or less the same thing.

Agents debate: thin wrappers, differentiation, and model collapse

So to weave that question into the current debate topics, Coparty, a few days ago, said the agents were 10 years away. So what is your take on... where we are with agents as it relates to AGI and is there an opportunity to build agents without being a foundation model company? I haven't yet listened to the Kapathi interview and I really want to because I don't think you noticed but I...

got into this space because of Karpathy. So in 2015, Andre Karpathy wrote an article called The Unreasonable Effectiveness of Recurrent Neural Networks, a blog post about language models. And that singular blog post, let me start my company sourced. And got me down this obsession for now a decade of like what, you know, language models and can do. And so I have an immense amount of respect for his opinion.

And so while I haven't listened yet to the interview, I can't comment specifically on what he said, but what I can say is this, right? What is today the definition that we use of an agent? It's a model running in a loop inside an environment with access to a set of tools, right? And it's doing longer trajectory. work and how are we training agents right agent capabilities as foundation model companies is that we take that agent

The binary, the piece of code that runs this in a loop and has access to. It could be a container or it could be access to a bunch of tools. And we train it together with the model. And so this is why you see the most capable coding agents. Really coming out of people who are training with the models. And this, by the way, will likely hold true for lots of domains.

But when you already have a model that has all of the intelligence and all of the capabilities to do an agentic task, right? So it doesn't require more intelligence. It doesn't need to be pushed further in like reinforcement learning.

Well, then the question, what's the differentiation in the agent? Well, the differentiation in the agent is whoever's building it needs to have access to some form of proprietary data, some form of proprietary environment, something that allows that loop that the model is running in. to be more advantageous than another competitor. And there's lots of opportunity there. But what I am careful about, and we see this in coding, we see this ourselves, is if you decide to start a company,

to build a coding agent where you are not able to improve the model. The agent is not able to train together with the model. And foundation model companies like ourselves are deeply focused on doing so. you don't have that edge. And so the cat and mouse game that you're playing becomes a lot more difficult. It's not impossible. I've seen incredible agents being built for specific things because no one can singularly focus on everything, not any foundation model company and including ourselves.

But we have a phrase at poolside, which is over time, everything collapses into the models. And I think increasingly that's what we're seeing. Frameworks or agents or products that were two years ago, you know, around.

today have either gone or sometimes their UIs have collapsed. So you used to see these products with lots of bells and whistles and lots of things behind the scenes and you would see lots of UI options. And today it's just like... a screen that says agent and you talk to it and it goes off and does the work and so i think you have to ask yourself the question like where do you sit like what's your unfair advantage that you have and if a foundation model company decides to focus on it

will that advantage still sustain or will it fall away because they're able to train the models to be further and more capable in combination with their agent and with yours? Yeah, it's funny how we went from talking about thin wrappers two years ago, three years ago, to thick wrappers last year, and it feels like in 2025 going into 2026, we're starting to...

Think about agents that are really thin wrappers. Again, it's a constant... I think we're all ignoring something, right? Maybe it's the wrong way of putting it. I think we have a hard time holding the point of view that... models will reach the same level of intelligence and capabilities that the world's most capable people in every field have. And when you take a step back, and don't take the next 12-month view, but just take the next...

I think it's in the next 36 months for knowledge work. Maybe it's the next, someone else might think it's the next five years. I think few people today would argue that it's not going to happen in the next decade. And when you're building a business, you're not building it for the next five years. You're building it for the next 20 or 30.

You're building a company that can sustain and be at the level of success, at least particularly when it's venture-backed businesses. In that world, what does your business look like if AI is as capable, is the most capable person in the field? that means certain things still exist. Like my Uber Eats app still exists because I want to look at my food and et cetera. But a whole bunch of vertical software or vertical agents,

are probably gone. And finding out where you sit in that world, I think is critical. I think we have a tendency, all of us and myself included, to get so caught up in what's right in front of us. that we don't take a step back and come back to the big things. And I think the big things really here are is that we are going to reach those levels of capabilities and it rewrites the world really like that pre and post electricity moment.

Far more than we, I think, have even factored in today. I think not even in the financial markets or not even in the way that we operate our businesses. It's truly living on an exponential to a point that I don't think we have ever seen before.

Everything we've created till now was because of intelligence. But now intelligence itself is something we will be able to create and scale. And I think it becomes a very different world afterwards. So what do you tell people that argue that AI progress is actually plateauing? Frankly, the same thing.

"Is AI plateauing?"-chip cycles, scale limits, and new axes

I've had this question for two and a half years now since starting poolside. This is not the first podcast where, you know, there was a, are we hitting a wall? We are, I think at a point where we are. continuing to see with every new generation of chips an ability to make the model larger. And it's important that this links to every new generation of chips because it is not a world where you can throw infinite dollars at scaling.

That's a false narrative. And you can think of it very easily yourself because if I take the size of a model, which is still effectively the biggest determinant of how much compute it takes to train it, Because of the limitations on the networking and because of the limitations on the flops on those chips, it is not that I can linearly add more GPUs and train increasingly a more larger model. If I do so, the time it takes becomes extra.

exponentially longer. So if tomorrow someone said I wanted to train a 300 trillion parameter model completely out of the realm of anything anyone's doing today, it just wouldn't be possible. No matter how much money you have, no matter how many chips that you put up, it would become exponentially more expensive and longer to train that. And this is because of the current hardware limitations.

But every two years, and now the chip cycle is even getting less, we have an incredible new chip and an incredible new networking stack that is improving that all of a sudden makes the next generation of size model possible. I don't think this will scale infinitely. I look at intelligence as compression. At some point, you can compress further, or at least not to a point where it's worth it that it's diminishing returns. We still have an ability to do that.

On one hand, we have a free lunch. As new generations of chips come out, we can build and train larger models, and it continues to show that it improves the model capabilities. On the other hand, and I think far more interesting as we're now getting into the world of scaling reinforcement learning, we're able to train models for longer duration with more data. Because on one hand with the size axis and the other thing, we have the duration axis, how much data we're showing the model.

And every single year, we've also become multiples more efficient with the data we had to make models become more capable in a lesser compute budget. It's kind of mind-blowing to see where even if I think about our pre-training at the beginning of the year or we have a new run that's going on at the moment versus now, you can really see incredible improvements because you're refining the data better. You're creating it cleaner. You're creating a better curriculum.

I know there's a long-winded answer to your question, but I think it's important to understand that it is not an infinite dollar. You can just keep throwing intelligence. It's kind of bounded by physics of every chip generation. As reinforcement learning is becoming increasingly more scaling access, we are able to improve models within those generations far more. But we have not yet found a generalized, infinitely scalable, you know.

dollars we can throw at reinforcement learning either. And so both on increasing model size and on increasing data from RL, there's still limitations. And if those limitations didn't exist, Poolside wouldn't have been able to catch up, right? I think this is a really important part because we're now at a point where we're starting, like, we should definitely spend some time with the models after this. They've gotten really darn good.

We're now scaling up compute to make it to the frontier. That wouldn't have been possible if it was just a world of dollars. But it is definitely a place where, yeah, I think right now, I don't see any limitations. I see more than limitations. I see opportunities in some of the research we're doing in reinforcement learning to learn and others that might...

completely open up new scaling access that can go even further. And then every generation of chip that comes out will see a stepwise function that goes up in our industry. You're very focused on software development. How do you think about... do a bit of lesson is that something that you guys worry about if you go back to our first podcast or we put a post on our website on day zero of the company we always set the path for agr runs through software it is not software

And we laid out this three-step master plan, which was step one, assist people in coding. It's kind of very early days. Step two, allow people, anyone to build software. I think the world is clearly there right now. And then step three, generalize to all domains. And we're now really in that step three moment.

And so already our models today have become generally capable across the board. But because we had kept our blinders on for those first couple of years on really software development capabilities as a proxy for intelligence, it allowed us to go really far.

Why software was the proxy; expanding to enterprise knowledge work

It allowed us to push the things that mattered. In our view, what was really missing was pushing reasoning. Like improving knowledge, you can improve knowledge outside the model. You can give it access to the right sources of information. But improving this kind of complex reasoning is what was missing. So we're kind of converged on the same point.

So today I use our models to, the other day I had to write a sci-fi book while I was reading it, which was a lot of fun, right? And my brother uses it in his growth marketing job and prefers it over using other models. We're already in that domain now, but software development is still, with our deep focus on enterprise and government, often the most highest value or highest driver of cost and also impact of knowledge work in a lot of businesses.

It's not the only one, but it's a big one. So it's been an amazing place to enter into organizations with. And now we're starting to increasingly go beyond that. So we had a big announcement this week with a company called Red Panda, who we're big fans of. And we just integrated their 300 plus enterprise data connectors.

inside it. And the demo that we were showing at the New York Stock Exchange was one of an AML process in finance that our agents are end-to-end together with these data connectors like doing. And so we're starting to open up much more broader outside of software development and come back to what we've always said, like we want to be able to power all of knowledge work in the world. For the last part of this conversation, let's...

Model status: Malibu → Laguna (small/medium/large)

go back down in the weeds and talk about some of the stuff that you just mentioned I think I and people may be curious about the current reality of poolside both from a model standpoint So starting with a model, so you have three products on your website. You have Malibu, you have Point, and then you have Assistant.

What's the current status of those products and what do they do with products slash models? If you go back two and a half years ago, one of the first things was Point, which was the code completion models. Today, there are table stakes. We have them, but it's not where the intelligence sits, right? So our Malibu was our first big family of models. And so within the Malibu models, they were really oriented towards originally to be very capable coding assistants.

Now they've become incredibly capable coding agents. And now they've also become incredible knowledge work agents. And so within that, those models are right now for their size and weight class, best in class. They have become incredibly capable of coding, but they're not yet at what I would define as the frontier.

So Frontier today is OpenAI, Anthropic, Google, and XAI. And that's why we have all of this compute coming online to scale up those model sizes. That's our upcoming family of models, Laguna, where we'll have a small, a medium, and a large.

So Small is finishing training in a couple of weeks. Medium actually starts training this week. And Large starts training as our 41,000 GB300s come online. And you'll have benchmarks for those. Do you have benchmark? We do have benchmarks, yeah. So if you think about... our benchmarks today and we primarily search on privately with the organizations we'll get the commercial part that we work with

But if you think about the Malibu agent as a coding agent, for instance, right now, it sits at a level of like Sweebench verified, for instance, where Gemini 2.5 Pro was when it came out. And so it's a much smaller model, but it's really pushed those capabilities because of our work in reinforcement learning.

And so we should do a demo after. I think we're actually doing a demo in New York publicly because we've only done it privately with enterprises in a couple of weeks at the AI Engineer Summit. And so if anybody's interested, they can come and see it there. And I think that will be recorded as well.

Poolside's Commercial Reality today: defense; Fortune 500; FDRE

That's an interesting thought from a go-to-market standpoint because you guys have been both public but kind of stealth in a way for the last couple of years and we're in a world of just like massive noise and like everybody's like... struggling for attention. How do you think about that tension, working with enterprises on the one hand, but on the other hand, all developers want those products to be in their hands? So our view has been quite simple, actually, is that...

We will make models publicly available when we are clearly best on a very valuable axis. And I think that's important. It's not just about having a hype on Twitter for a couple of weeks and having something out. You need to bring something in that can scale and that is valuable to the world. So before, and this gets to your commercial part of the question, we weren't at the frontier.

And now we are getting increasingly closer and closer, and that's also opened up our market. So before we were at the frontier, we picked a market where no one else could go, which was the defense industrial base and government. because one of the things that we're willing to do kind of from the enterprise DNA we have is take our entire model weights and the full stack all the way with the agents, anywhere the customer needs it to be.

So we today operate on workstations that go into SCIFs. We operate on servers that are on-prem or even as far as air-gapped. We operate in commercial cloud like AWS and private VPC environments. We also opt in more commercial, just cloud regular environments. And then on GovCloud, Top Secret Cloud, kind of the places where the weights have to travel to the customer. And we did this because it wasn't just about the model.

is because we knew that as the world was moving towards agents doing increasingly more work in the enterprise, it was going to become incredibly important to build out everything around the model. Today we can rattle off a very long list of enterprise features that we've built over time so that these agents and models can actually operate in complex regulated environments.

And so that goes everything from a data access layer towards the managing of assigning of agents across role-based access control, deep integrations with all of the kind of active directory-like systems that people have. all of the monitoring, audit logging, like that associates with observability, you know, go on and on. And we battle tested that in the defense industry because they're extremely large organizations. They're highly complex, they're highly regulated.

And they often need segregated deployments for different missions that they're on. And so we've been scaling that up in that industry. Now as the models have gotten to a point where we say, okay, now we feel that we can compete. We're going to wider enterprise. And so you'll see us increasingly more showing up in kind of the large enterprise names amongst financial services, industrials, technology companies.

and we treat our business as kind of two business models on one hand we want to make our models publicly available and allow anyone to use them but we want to do so when we say hey we have something here that sets us apart And that's the scaling up of computers necessary for that. The Laguna family will be public. And then we have the part of business where we go very value add. And here we started with product.

So we started with a coding assistant that sits in VS Code and IntelliJ, now also sits outside of it. Then we started with kind of the web interfaces that you expect to be able to chat and use the models, you know, connected with data sources. But we found something over and over again, which was that a lot of these enterprises and organizations, they have incredibly high valuable problems that can be solved with AI today.

Hell, in many cases with AI capabilities of two years ago or a year ago. But they're not successfully doing so. And a lot of that has to do with the gap between what the intent of the project is. versus being able to bring it all together. The right data sources, the right context engineering, often additional reinforcement learning that's needed. And here we started building up a very strong forward deployed motion.

At Poolside, we have former Palantirians who came over and really kind of instilled some of that DNA. The DNA of what it means to find a high-value problem and come in with high agency and help a customer solve that. You even have like FD. What we found is that there is a gap between the skill set.

of doing traditional forward deployed engineering, right, where you're focused on a high value problem and using kind of piping data sources together and building, you know, interfaces on top, which is, by the way, incredible thing. I have a lot of respect for everyone who has done this. The research engineering side is interesting because what you're looking for is the people in that

who also have the experience and the natural tendency to work very well with models and with agents and who work on additional reinforcement learning if it's necessary. And so we're taking these highly capable research engineers and we're putting them inside our customers. And that's for us something you're going to see us scale up increasingly more. It's why we've...

We're going to be opening an office in the UK actually next week. We're going to be opening an office here in New York and really kind of scaling up that motion of forward deployed research engineering. On that topic of geography, you guys started mostly in Europe, I believe, but it feels like over the last...

Global team, avoiding the echo chamber

couple of years, you've re-centered the company in large part towards the US. Is that a fair sentiment? And if so, what drove it? Yeah, so we've always been an American company. We've been incorporated. headquartered out of the U.S. since day zero. And I think at any given moment, the balance of people would have always sat somewhere between 40-60 or 60-40 one way or another, like throughout the life of the company. So building it as a global company was critical for us.

But the decision we made early on, and I think we spoke about it on the podcast two years ago, is to hire researchers outside of the Bay Area and particularly originally very centered on Europe. And still today, if we look at where researchers for pool sites sit, they sit predominantly 95% outside of Silicon Valley. And this offered an incredible opportunity for us.

It offered the opportunity to find highly capable, highly motivated people who were not in the echo chamber that is the Bay Area. It's one of the most valuable echo chambers in the world. I personally love it. I spent a fair amount of time there.

You've seen and probably realized that there's a bunch of things we've done that were quite contrarian to the belief of that echo chamber. And I think that was partially unlocked by the fact that we sat outside of it and continue to do so. So we hire... all across the united states we hire all across europe we're hiring increasingly more in asia as well as we're scaling up and so but we have and continue to to try to avoid

as much of an echo chamber as possible. Because I think the world, we're very early in all of this still, and the path towards AGI is one that deserves to be built by... opinions and people that aren't all the same. And we have just found that there is no shortage of incredible talent in the world. It just might not be as obvious on a CV or a LinkedIn as it is in the Bay Area. So zooming out the next...

Next 12-18 months: frontier models + infra scale

12 to 18 months, what happens at poolside? What can we expect? Models reach the frontier. That's right now. I think we see a straight line towards this. You'll see the scaling up of physical infrastructure to both empower training even more larger and capable models, but also serving them to the world. You'll see us have forward deployed research engineers and increasingly larger number of enterprises.

globally you'll see us just continue to work towards the mission right i mean for us we never want to lose sight of the fact that we are building this company and and and the economic engine associated with it because we think that the world lives after this is one where a lot of abundance can be created. Abundance through scientific progress that's going to happen, but also abundance through the fact that costing of goods and services ultimately are dependent on

the cost of what it takes to create them. And as we shift intelligence to compute, we can find ourselves in a world where we can drive that cost down. And if you look back at the last hundred years, there's no moment that I think any of us rather live than today. And so that's the core mission. It always has been. And you'll see more of that.

Looking forward to probably being here in 12 months to tell you about what happened. Looking forward to it. Great to catch up. Thank you so much. Congrats on everything that you guys have accomplished in the last couple of years. Excited to see the data center.

Closing

like the new models and thank you for spending time with us appreciate it thank you matt Hi, it's Matt Turk again. Thanks for listening to this episode of The Mad Podcast. If you enjoyed it, we'd be very grateful if you would consider subscribing if you haven't already or leaving a positive review or comment. on whichever platform you're watching this or listening to this episode from. This really helps us build a podcast and get great guests. Thanks and see you at the next episode.

This transcript was generated by Metacast using AI and may contain inaccuracies. Learn more about transcripts.
For the best experience, listen in Metacast app for iOS or Android