¶ Intro / Opening
Support for this show comes from Atio. Atio is an AI native CRM built for the next era of companies. Its powerful data structure adapts to your business model, syncs in all your contacts in minutes, and enriches your business with actionable data. Adio also allows you to create email sequences, real-time reports and powerful automations all to help you build what matters your company join industry leaders like flat file replicate
modal, and more. You can go to adio.com slash decoder and you'll get 15% off your first year. That's A-T-T-I-O dot com slash decoder. AI is moving fast. So fast, it's hard to keep up. In fact, in ServiceNow's latest AI maturity index, scores dipped 20% from last year. But that's okay, because AI isn't a sprint. It's a marathon. You may be behind today, but tomorrow you could be a pace setter.
Dive into ServiceNow's AI Maturity Index and see how you can innovate as fast as your ambitions. Visit servicenow.com slash AI maturity. Support for this show is brought to you by CVS Caremark. You know the saying, less is more? Well, with CVS Caremark, it changes to more for less. With more care, more guidance, and more expertise, CVS Caremark helps your plan members spend less on their prescription drugs.
CVS Caremark leverages their scale to negotiate lower net costs for medications every day. And that's exactly what your members can count on from CVS Caremark. More ways to maximize their benefits. Go to cmk.co slash stories to learn how we help you provide the affordability, support, and access your members need.
¶ The Browser as an AI Agent Platform
Welcome to Decoder. I'm Alex Heath, deputy editor at The Verge and author of the Command Line newsletter. I'm hosting our Thursday episodes while Neelai is out on parental leave. Today, we're talking about how AI is changing the way we use the web. If you're like me, you're probably already using apps like ChatGPT to search for things, but lately I've become very interested in the future of the web browser itself.
That brings me to my guest today. Perplexity CEO Aravind Srinivas is betting that the browser is where more useful AI will get built. He just released Comet, an AI web browser for the Mac and Windows that's still in an invite-only beta. I've been using it, and it's pretty interesting. Arvin's not alone here. OpenAI is also working on its own web browser, and then there are other AI-native web browsers out there like Dia from the browser company.
Then there's Google, which may be forced to spin off Chrome if the U.S. Department of Justice prevails in its big antitrust case. If that happens, it could provide an opening for startups like Perplexity to win market share and fundamentally change how more people interact with the web.
In this conversation, Aravind and I also discuss the future of perplexity, the AI talent wars, and why he thinks people will eventually pay thousands of dollars for a single AI prompt. I hope you enjoy this conversation as much as I did. Here we go. Aravind, before we get into Comet and how it works, I actually want to go back to our last conversation in April.
for my newsletter command line. We were talking about why you were doing this. And you told me at the time that the reason we're doing the browser is, quote, it might be the best way to build agents. And that idea has stuck with me since then. And I think it's been validated by others and some other recent launches. But... Before we get into things, can you just expand on that idea why you think the browser is actually the route to an AI agent? Sure. What is an AI agent? Let's start from there.
A rough description of what people want out of an AI agent is something that can actually go and do stuff for you. It's very vague, obviously, just like how an AI chatbot is vague by definition. People just want it to do. like respond to anything. The same thing is true for agents. It should be able to carry out any workflow end-to-end from instruction to actually completion of the task. Okay, then you boil that down to what does it actually need to do it?
It needs context, right? It needs to pull in context from your third-party apps. It needs to go and take actions on those third-party apps on your behalf. So you need logged in versions of your third-party apps. You need to access your data from those third-party apps. Do it in a way where it doesn't actually constantly ask you to auth again and again. It doesn't actually need your permission to do a lot of the things.
At the same time, you can take over it and complete the things when it's not able to do it because no AI agent is foolproof, especially when we are at a time when reasoning models are still... like far from perfection right so you want this one interface that the agent and the human can both operate in in the same manner where logins are actually seamless
Client-side data is easy to use, and controlling it is pretty natural. Nothing is going to truly be damaging if something doesn't work. You can still take over from the agent and complete it when you feel like it's not able to do it. What is that environment in which this can be done in the most straightforward way without creating virtual servers with all your logins?
having users worry about privacy and stuff like that. It's the browser. Everything can live on the client side. Everything can stay secure. It only accesses information that it needs to complete the task. In the literal same way you access those websites yourself, that way you get to understand what the agent is doing. It's not like a black box. You get full transparency and visibility.
And you can just stop the agent when you feel like it's going off the rails and just complete the task yourself. And you can also have the agent ask for your permission to do anything. So that level of control, transparency, trust. In an environment that we're used to for like multiple decades, which is the browser, such a familiar front end to introduce a new concept of AI is going and doing things for you, makes perfect sense for us to reimagine the browser.
¶ Building Perplexity Comet on Chromium
How did you go about building Comet? When I first opened it, it felt familiar. It felt like Chrome. In my understanding, it's built on Chromium, the open source kind of substrate of Chrome that Google maintains.
and that allows you to have a lot of easy data importing i was struck when i first opened it that it only took one click to basically bring all my context from chrome over to comment even my extensions so why decide to go that route build it on chromium versus doing something fully from scratch First of all, Chromium is a great contribution to the world, right? Most of the things they did on reimagining tabs as processes, the way they've gone about security.
encryption and like just the performance, the core backend performance of Chromium as an engine, rendering engines that they have is all like really good. Like there's no need to reinvent that. And at the same time, it's an open source project. easy to hire developers for perplexity.
They can come work on the comic browser, especially if it's something that has open standards. And we want to continue contributing to Chromium ourselves. So we don't want to just consume Chromium and build a product out of it, but we actually want to give back to the... system so that's natural and the second thing is like
It's the dominant browser right now, like Chrome. And almost, if you actually include Edge, which is also a Chromium fork, DuckDuckGo, Brave, they're all like Chromium forks. Only Safari is based on WebKit. So it's actually the dominant browser. no need to reinvent the wheel here. In terms of UI, we felt like it would be better to retain the most familiar UI people already are used to, which honestly is the Chrome UI.
slightly different ui and some people like it some people do not and it's still like a much smaller share of the market And imports need to work. Otherwise, you're going to be like, oh, this is not working. Oh, that thing has all my personal context. I'm missing out on it. I don't want to go through the friction of logging into all the apps again. I think that that was very important for us.
for the onboarding step, which is not only onboarding you as a human, but also onboarding the AI. Because the moment you're already logged into all the third-party apps that you are logged in on Chrome in the exact same security standards. The agent gets access to that on your client and can immediately show you the magic of the product.
And the agent is seeing it, but you, perplexity, are not. You're not using all of the Chrome data I instantly bring over to train on me or anything like that. No, the agent only sees it when you ask a prompt relevant. For example... based on what I've ordered on Amazon in the last month, recommend me some new supplements or like go and order the magnesium supplement that I've already ordered frequently on Amazon.
The agent only sees that for that one singular prompt and doesn't actually store your entire Amazon history on our servers. And you can always ensure that your prompts get deleted. from our servers. So even the prompts we can choose not to look at, even for like, you know, let's say...
fine tuning purposes. Let's say we want to make our agents good at an aggregate. Oh, users have done Amazon shopping queries. Let's go and make it better on that. We don't even need to look at that if you choose to not retain your prompt.
So that's the level of privacy and security we want to offer. At the same time, the frontier intelligence is all on the server side. This is one of the main reasons why Apple is struggling to ship all Apple intelligence, be it on iOS or macOS or whatever, right? Because I think there's...
generally like an expectation that everything needs to live on the client side. That's not necessary to be private. You can still be pretty secure and private with frontier intelligence on the server. So that's the architecture we brought in on Comet.
¶ Comet's Real-World Use Cases and Browsing Evolution
We're talking now a couple of weeks or so after Comet came out, and it's still invite-only. Or I think it's also restricted to your premium tier, your 200 a month tier. But you've been tweeting a lot of examples of how people have been using it. They've been using it to. make Facebook ads, do FedEx customer support chat, run their smart home accessories, make Facebook marketplace listings, schedule calendar meetings. There's been a lot of stuff that you've shown.
Unsubscribing from spam emails, which is a favorite use case of a lot of people. So maybe that's the one. But I was going to say, what has been the main use case you've seen so far that people are finding with Comet? Actually... while these are the more glamorous use cases i would say the boring Dominant one is always like invoking the sidecar and having it do stuff for you on the webpage you're on. Not necessarily just simple summarization, but more complex.
Questions like you're on a, let's say I'm watching Alex Hill's podcast with Zuckerberg or something. And I want to know specifically what he said about a topic. And I want to take that and send it as a message to my teammates on Slack.
I think that's the thing. You can just invoke the assistant on the site and do it instantly. It's connected to your Gmail, your calendar. It's also able to pull the transcript from the YouTube video. It has fine-grained access, and it's immediately able to retrieve the relevant snippet.
I can even ask it to play it from that exact time step instead of like going through the entire transcript and like, like, you know, control a thing, like whatever I want. That is the level of advantage you have. Like it almost feels like you should never watch.
a youtube video standalone anymore unless you have a lot of time on your hands and it's fantastic right and and people use it for linkedin like honestly searching over linkedin is very hard they don't have a working search engine basically so the agent figures out all these short cuts, like how we figure out using these filters, people search or connection search, and it's able to give a recruiting power that was never possible before. I would say it's better than using LinkedIn premium.
Yeah, and I'm glad you brought up the sidebar because for people who haven't tried it or seen it, that is the main way Comet diverts from Chrome is that you've got this. AI assistant orchestration layer that sits on the side of a web page that you can use to interact with the web page and also just go off and do things. That interface suggests that
You see the web as being less about actually browsing. I mean, you just said no one really has time to watch a YouTube video. Is the browsing part of the browser becoming less meaningful in the world of AI? I think people are still going to watch YouTube videos for fun or exploration, but not... When I'm actually landing at a video, I mean, you do a lot of intellectual stuff, so it's not always fun to watch the entire thing, but I like watching specific things in a video.
And also, by the way, like when I'm in the middle of work, I can't be watching the Verge podcast, right? But I still don't want to wait till the entire weekend to fully watch it. I want to instantly know like what Zuckerberg might have said in your video about like their cluster or something. And then like, you know, in the weekend, I can go back and watch the entire thing. Like, you know, I'm going to have a lot more time on my hands. So it's not actually going to like stop.
The regular browsing, I actually think people are going to scroll through social platforms or watch Netflix or YouTube even more, I would say, because they have more time on their hands. The AI is going to do a lot of their work. It's just that they would choose to spend it on entertainment more than intellectual browsing.
or if people derive entertainment from intellectual stuff like intellectual entertainment i think that's fine too like reading books all these things are fine like reading blog posts that you otherwise wouldn't get time to read when you're in the middle of work
I think these are the kind of ways in which we want the browser to evolve, where people launch a bunch of comment assistant jobs, like tasks that would take a few minutes to complete on the background, and they're chilling and scrolling through X. you know, whatever the social they like. We need to take a quick break. We'll be right back.
Support for this show comes from Adio. Adio is an AI-native customer relationship management system built specifically for the next era of companies. It's extremely powerful, adapts to your unique data structures, and scales with any business model. They say setting up Adio takes less than a minute, and in seconds of syncing your emails and calendar, you'll see all your relationships in a fully-fledged platform, all enriched with actionable data.
Adio can enrich your business with real-time, customizable reports featuring valuable data points. The best part is you can build AI-powered automations and use its research agent to tackle some of your most complex processes so you can focus on what matters most, building your company. Join leaders like Flatfile, Replicate, modal, and more. You can go to adio.com slash decoder and you'll get 15% off your first year. That's A-T-T-I-O dot com slash decoder.
Support for this show comes from Framer. If you have a business, then you have a website. But if you have a bad website, a lot of people will think the same about your business. But you can make sure your website shows how professional you are with the help of Framer. Whether you're overwhelmed by traditional site builders or frustrated with cookie cutters, Framer is the design-first, no-code website builder that lets anyone ship a production-ready site in minutes.
It's free to start. You can browse 700 plus pixel perfect templates or design from a totally blank canvas. Real-time collaboration means your writer, designer, and marketer can all tweak the same page at once. No version control nightmares. Plus, their built-in AI handles the heavy lifting by generating starter layouts, auto-translating your entire site to any language,
and can even suggest on-brand color palettes. If you're ready to build a site that looks hand-coded without hiring a developer, start free today at Framer.com. Go to Framer.com to start building a site for free. Framer.com. Support for this show comes from OpenPhone. Modern businesses need a modern phone system. OpenPhone is a business phone system that streamlines and scales your customer communications. It works through an app on your phone or computer.
so no more carrying two phones or using a landline. With OpenPhone, your team can share one number and collaborate on customer calls and texts. That way, any teammate can pick up right where the last person left off. And their AI agent can be set up in minutes to handle calls after hours, answer questions, and capture leads, so you never miss a customer. See for yourself why over 60,000 businesses trust Open Phone.
OpenPhone is offering our listeners 20% off their first six months at openphone.com slash decoder. That's O-P-E-N-P-H-O-N-E dot com slash decoder. And if you have existing numbers with another service, OpenPhone will port them over at no extra charge. Open phone. No missed calls. No missed customers. We're back with Perplexity CEO Aravind Srinivas.
¶ Overcoming the Learning Curve
Your tagline for Comet is enabling people to, quote, browse at the speed of thought. I find that there's actually a very steep learning curve to understanding what it can do. By the way, Alex, I want to make... I want to make one point. There was some article either from Words or somewhere else that Google was trying to use Gemini to predict maximal engagement time on a YouTube video and show the ad around that timestamp.
Perplexity on the common browser was using AI to exactly save your time to get you the exact timestamp you want in a fine-grained basis and not waste your time. You know, so often people ask, like, why would Google not do this and that? The incentives are completely different here. Yeah. And I want to get into that and have a lot of business model questions about Comet because it is also.
very compute intensive for you and expensive to run which you've talked about but to my point about the learning curve and making it approachable how do you do that because when I first opened it it's kind of like I don't know what I can do with this thing. I mean, I go to your X account and I see all the things you're sharing, but I do think there's going to be a learning curve that the people building these products don't necessarily appreciate.
No, no, I appreciate that. And it's been the thing for me, myself as a user, is that even though it's fun to build all these agent use cases, it takes a while to stop doing things the usual way. and start using the AIs more, which includes even basic things like what reply you type onto an email thread. Even though Google has these automatic suggested replies, I don't actually usually like it.
And it doesn't often pull contacts from outside Gmail to like help me do that. Or like Slack, checking on like unread Slack messages. I usually just go open Slack as a tab and try to like scroll through those, you know, 50, 100 channels I'm on. clicking each of those channels, like reading all the messages that are unread. It takes time to actually train myself to use Comet. So what we plan to do is actually publish a lot of the early use cases on an educational material.
and have it be widely accessible. I think it's going to go through the same trajectory that chatbots had. Like I think the beginning when chat GPT was launched. I'm sure like not a lot of people knew how to use it. What are all the ways in which you could take advantage of it? In fact, I still don't think people really, it's not really a widespread thing. There are some people who really know how to use these AI tools very well.
Most people have at least used it once or twice a week, and they don't actually use it in their day-to-day workflows. The browser is going to go through a similar trajectory. On the other hand... The one use case that's been very natural, very intuitive, that you don't even have to teach people how to use this is a sidecar. It's just picked up so much that...
I feel like it'll be so intuitive. It'll almost be like, without the sidecar, why am I using the browser anymore? That's how it's going to feel. It does quickly make the traditional chatbot perplexity chat GPT interface feel a little arcane. when you have the sidebar with the web page. Exactly. A lot of people are using ChatGPT for like you're on an email and you want to know how to respond.
So you copy-paste a bunch of context. You go there, you ask it to do something, and then you copy-paste it back. You edit it finally in your Gmail box, or you do it in your Google Sheets or Google Docs. All this is going to, like... The comment is going to feel much more intuitive.
you have it right there on the side and you can you can do your edits or you're using it to draft a tweet or like you know elon musk post something you want to like post a funny response to that you can literally ask comment hey like draft me a funny reply tweet to that and it'll automatically have it ready for you. You literally have to click the post button.
All that stuff is going to definitely reduce the amount of times you literally open another tab and keep asking the AI. And firing up jobs right from your current website. to go pull up relevant context for you and having it just come back and push notify you when it's ready, that's feeling like another level of delegation. Where is Comet struggling based on the early data you've seen?
¶ Comet's Capabilities and Limitations
Definitely not perfect yet for long horizon tasks, like something that might take 15 minutes or something. I'll give you some examples. I want a list of engineers who have studied at Stanford and also worked at Anthropic. And they don't have to be currently working at Anthropic, but they must have worked at Anthropic at least once. I want you to give me an exhaust list of people like that, port it over to Google Sheets with their LinkedIn URLs.
And I want you to go to ZoomInfo and try to get me their email so that I can reach out to them. I also want you to bulk draft a personalized cold email to each of them to reach out to for a coffee chat. I don't think it can do this today. It can do parts of it. So you still have to be the orchestrator stitching them together. I'm pretty sure six months to a year from now, it can do the entire thing. I'm betting on progress in reasoning models.
to get us there just like how in 2022 we bet on models like gpt4 and clog 3.5 sonnet to arrive to make the hallucination problem perplexity basically non-existent when you have a good index and a good model I'm betting on the fact that in the right environment of a browser with access to all these tabs and tools, a sufficiently good reasoning model, like slightly better, maybe GPT-5, maybe like Cloud 4.5, I don't know.
could get us right over the edge where all these things are like suddenly possible and then like a recruiter's work worth one week is just like one prompt right sourcing And reach outs. And then you've got to do state tracking. Okay, it's not just about doing this one task, but you want it to keep following up. Keep a track of their responses. If maybe some people respond, go and update the Google Sheets.
Mark the statuses like responded or like in progress and follow up with those candidates, sync with my Google calendar and then resolve conflicts and schedule a chat. And then. push me a brief ahead of the meeting. Some of these things should be proactive. It doesn't even have to be a prompt.
That's the extent to which we have an ambition to make the browser into something that feels more like an OS, where these are processes that are running all the time. It's not going to be easy to do all this today. But in general, we have been successful at identifying the sweet spots where things that are currently on the edge of working and we nail those use cases, get the early adopters to love the product.
And then ride the wave of progress in reasoning models. That's been the strategy. And I'm not sure if it's just the reasoning models or it's just the products early or I haven't. figured out how to use it correctly. It's not like I'm saying everything will work out of the box with a new model. You really have to know how to harness the capabilities and have the right evals and version control the prompts and do any post-training of auxiliary models.
Which is basically our expertise. We are very good at these things. I would say that based on, and I'll caveat that I haven't spent weeks yet with it, but based on my early experience with it, I would describe... it as a little brittle or unpredictable in terms of the success rate. I asked it to book a very, or take me to the booking page for a very specific flight that I wanted. And it did it. And it took me to the
to the page where, and it filled in some stuff. Whereas, you know, the normal perplexity or chat GPT interface would just take me to the webpage. It actually took me a little bit further. It didn't book it, but it took me further. But then I asked it, like, create a list of everyone who follows me on X that works at Meta. And it gave me one person. And I know for a fact there's many more than that. Or, for example, I said, find my last interview with the CEO of Perplexity.
and it said it couldn't but then it showed a source link to the interview so the the answer said it couldn't but the source didn't i see some brittleness in the product and i know it's early but i'm just wondering like Is all of that just bugs or is that anything inherent in the models or the way you've architected it? I can take a look at it. You can share the link with me. But I would say...
The majority of the advertised use cases that we ourselves advertise are things that are expected to work. Now, will it always like 100% of the time work in a deterministic way? No. Are we going to get there in like a matter of months? I think so. You kind of have to be timing yourself where you're not exactly waiting for the moment where everything works reliably. You want to be a little early. You want to be a little edgy.
I think there are some people who just love feeling the being part of the right to like, you know, maybe there are more majority of the users are going to wait. until everything works stable. So that's why we think the sidecar is already a value add for those kinds of people where, okay, they don't have to use the agents that much.
They can use a sidecar. They can use Gmail. They can use calendar connectors. They can use like all this LinkedIn search features, YouTube, or just basic stuff like searching over your own history. These are things that already work well. And this is already a massive value add over Chrome.
And once several minutes worth long horizon tasks start working reliably, that's going to make it feel more than just a browser. That's when you make it feel like an OS, like you want everything in that one container.
¶ Browser vs. Model Context Protocol
And you'll feel like the rest of the computer doesn't even matter. We started this conversation talking about how you think the browser gives you this context to be able to create an actually useful agent. And there's this other technical path that the industry is looking at and getting excited about, which is MCP, Model Context Protocol. And at a high level, it's just this orchestration layer that lets an LLM talk to Airtable.
Google Docs, whatever, and do things on your behalf in the same way that Comet is doing that in the sidebar. You're going at this problem through the browser and through the logged in state of the browser that you talked about and that shortcut. While a lot of people, Anthropic and others, OpenAI are looking at MCP as maybe the way that agents actually get built at scale.
I'm curious what you think of those two paths, and are you just very bearish on MCP, or do you think MCP is for other kinds of companies? I'm not extremely bearish on MCP. I just want it to mature more. And I don't want to wait. I want to ship agents right now.
I feel like AI as a community, as an industry has just been talking about agents for like last two years and no one has actually shipped anything that worked. Yeah. And I got tired of that. And like, we felt like the browser is a great way to do that today. MCP is going to definitely play a contributing factor to agents into the field in the next five years. There's still a lot of security issues they need to figure out there. Having your authentication tokens.
communicated from your client to an mcp server or from remote mcp server to another client all these things are pretty risky today way more risky than just having your persistent logins on your client on the browser Same issues exist with the operator, which tried to create server-side versions of all your apps. This is OpenAI's computer use tool operator. Exactly.
I think there's going to be some good MCP connectors that we'll definitely integrate with, like Linear or Notion. I guess GitHub has an MCP connector. So whenever it makes sense to use those over an agent that just opens these tabs and scrolls through them and clicks on things, we're going to use that.
It's always going to be bottlenecked by how well these servers are maintained and how you orchestrate these agents to use the protocol in the right way. It doesn't solve the search problem on those servers, by the way. You still have to go and figure out what data to retrieve. You define it as the orchestration layer. It's not the orchestration layer. It's just a protocol for communicating between servers and the client.
one server, another server. But it's still not solving the problem of the reasoning and knowing what information to extract and knowing what actions to take and all that chaining together different steps. trying things when things don't work. Whereas the browser is basically something that's been designed for humans to actually operate in. And extracting the DOM and knowing what actions to take are...
seems to be something that these models, the reasoning models seem to be pretty good at. So we're going to do a hybrid approach and see what works best. At the end, it has to be fast, it has to be reliable, and it has to be cheap.
¶ The Web's Future in an AI Era
So if MCP lets us do that better than the browsing agent, then we'll do that, right? Like there's no dogmatic vision here. At The Verge, we care a lot about the way our website looks and feels, the art of it, the visual experience. And with all this agent talk and it collapsing into browsers,
I'm curious what you think happens to the web and to websites that devote a lot to making their sites actually interesting to browse. Does the web just become a series of databases that agents are crawling through MCP? whatever, and this entire economy of the web goes away? No. I actually think if you have a brand, people are going to be interested in knowing what that brand thinks.
And it might go to you, the individual, or it might go to Verge, or it might go to both. Doesn't matter, right? So even within Verge, I might not be interested in articles written by some other people. I might be interested in specific people who have like, you know, like their content or something. So I think the brand will play an even bigger role in a world where like both AIs and humans are surfing the web.
And so I don't think it's going to go away. Maybe the traffic for you might not even come organically. It might come through social media. Let's say you publish a new article. Some people might come click on it through Instagram or X or LinkedIn. Doesn't matter, right? And whether it would be possible for a new platform to build traffic from scratch by just doing the good old SEO tricks, I'm actually bearish on that.
It's going to be difficult to create your own presence by displaying the old playbook. You got to build your brand through a different manner. in this time period. And the existing ones who are lucky enough to already have a big brand presence, they have to maintain the brand also with a different playbook, not just doing SEO or traditional search engine growth tactics. We need to take another quick break we'll be right back. AI is moving fast. So fast, it's hard to keep up.
In fact, in ServiceNow's latest AI Maturity Index, scores dipped 20% from last year. But that's okay, because AI isn't a sprint, it's a marathon. You may be behind today, but tomorrow you could be a pace setter. Dive into ServiceNow's AI Maturity Index and see how you can innovate as fast as your ambitions. Visit servicenow.com slash AI maturity.
Support for this program comes from FM Established nearly two centuries ago, FM is a leading mutual insurance company whose capital, scientific research capability and engineering expertise are solely dedicated to property risk management and the resilience of its policyholder owners.
These owners who share the belief that the majority of property loss is preventable Work with FM to better understand the hazards that can impact their business continuity to make cost-effective risk management decisions combining property loss prevention with insurance protection. At FM, we see what others don't, so we can help protect your business in ways others can't.
Learn more at fm.com and browse Sight Unseen, our new microsite with opinion, research and podcasts about hidden risks facing your business. This episode is brought to you by On Investing, an original podcast from Charles Schwab. I'm Kathy Jones, Schwab's Chief Fixed Income Strategist. And I'm Lizanne Saunders, Schwab's Chief Investment Strategist.
Between us, we have decades of experience studying the indicators that drive the economy and how they can have a direct impact on your investments. We know that investors have a lot of questions about the markets and the economy, and we're here to help. Join us each week as we explore questions like How do you evaluate corporate bonds and what sectors of the stock market are outperforming?
So Kathy will analyze what's happening in the bond market and at the Fed. And I'll give you our latest analysis of the equities market and the US economy. And we often interview prominent guests from across the world of investing and business. So download the latest episode and subscribe. We're back.
¶ Monetizing Perplexity's AI Browser
On Comet as a business, it's very compute intensive. And it's still invite only. I imagine you wish you could just throw the gates open and let anyone use it, but it would melt your servers or your AWS bills. Right. So how do you scale this thing? Not only do you scale it from the product sense and it becoming a thing that.
normal people can easily use and understand that curve of learning it that we talked about, but also just the business of it. You're not profitable. You're venture backed. You have to make money one day. You have to be profitable. How do you scale something like this that is actually even more compute intensive than a chatbot? I think if the reliability of these agents gets good enough.
you could imagine people paying usage-based pricing. You might not be part of the max subscription tier of like $200 a month or anything, but there's like one task you really desperately want to get done and you don't want to spend like three hours doing that. As long as the agent actually completes and you're like satisfied with the response rate, the success rate, you'll be okay with trusting the agent to like paying an advance fee of like $20.
And for the recruiting task I described, give me all the Stanford alumni who worked at Anthropic. I think that is a very interesting way of thinking about it. which is otherwise going to cost you a lot more time, or you have to hire a sourcing consultant, or you have to hire a full-time sourcer whose only job is that.
If you value your time, you're going to pay for it. Maybe like, let me give you another example. Like you want to put an ad on Meta, right? Instagram. And you kind of want to like look at ads done by similar brands, pull that, study that. or like look at the AdWords pricing of like, you know, a hundred different keywords and figure out how to price your thing competitively. Like these are tasks that...
could definitely save you the hours and hours and maybe even give you an arbitrage over what you could do yourself because AI is able to do a lot more. At scale, if it helps you to make a few million bucks, does it not make sense to spend like $2,000 for that prompt? It does, right? I think we're going to be able to monetize in many more interesting ways than chatbots for the browser. It's still early.
But the signs of life are already there in terms of what kind of use cases people have. And if you map reduce your cognitive labor in bulk. to an AI that goes and does it reliably, it almost becomes like your personal AWS cluster with natural language described tasks. And I think we have to execute on it.
But if we do execute on it and if the reasoning models are continuing to work well, you could imagine something that feels more like Cloud Code for life. And Cloud Code is a product that people are paying $1,000 a month also because even though it's expensive, It helps you maybe get a promotion faster because you're getting more work done and your salary goes up and it feels like the ROI is there.
¶ Strategic Shift: Beyond Chatbots
Are you betting so much on the browser for the next chapter of perplexity because the traditional chatbot race has just been completely won by ChatGPT? Like, is perplexity as it exists today going away and the future of it is just Comet? I wouldn't say that I'm betting on it because the chatbot race is over. Let me decouple the two things. The chatbot race does seem like it's over in the sense that it's very unlikely that people think of another product for day-to-day chat.
From the beginning, we never competed on that market. We were always competing on search. We were trying to reimagine search in the conversational style. Yes, every chatbot has search integrations. Some people like that. Some people still like a more search-like interface that we have.
So we never wanted to go after that market, and we're not competing there either. Google is trying to catch up, and Grok's trying to catch up, Meta's trying to catch up. But I feel like all that is wasted labor, in my opinion, at this point. The way I would phrase it is the browser is bigger than chat. It's a more sticky product and it's the only way to build agents. It's the only way to build end-to-end workflows.
It's the only way to build a true personalization memory and context. And so it's a bigger price, in my opinion, than trying to nail the chat game, especially in a market that's so fragmented. It's a much harder problem to crack, too. both in terms of intelligence, how you package it, how you context engineer it, how you deal with all the shortcomings at the current moment, as well as like end user facing UX, which could be the front end, the back end, the security, privacy.
And all the other bugs that you get to deal with when working with a much more multifaceted product like the browser. Do you think that's why OpenAI is going to be releasing a browser? Because they agree with that? I don't know if they are. I read the same leaks that you have, and it was very interesting. It came two hours after we launched. But I would say you also made another point about perplexity.
being ignored and Comet being the next thing. I don't see it that way because you cannot build a browser without a search. A lot of people praise the Comet browser because It doesn't feel like another browser because you know why? One of the main reasons is, of course, we have the sidecar and we have the agent and all that, but the search, default search is perplexity.
And we made it in a way where even if you're having an intent to navigate it, it'll understand that. It'll give you like four or five links if it feels like it's a navigational query. It'll give you images pretty quickly. It'll give you like a very short answer also. So you can combine informational queries or navigational queries, agent queries in like one single search box.
That is only doable if you actually are working on the search problem, which we've been working on since the last two and a half years, right? So I don't see it as two separate things. Basically, you cannot build a product like Chrome without building Google. Similarly, you cannot build a product like Comet. without building perplexity. So is there a Comet standalone mobile app and a standalone perplexity app? Yeah, there will be standalone apps for both.
Some people are going to use the standalone Comet app, just like how they use Chrome or Safari, and it's okay. They probably won't do that because it's going to have an AI that you can talk to on every web page, including in voice mode, actually. You still want to just navigate and get to a website quickly. I just want to go and browse Verge without actually having any question in my mind. That's fine.
And I could go to Perplexity and have all the other things the app has, like discover feeds and spaces and just quick, fast answers without the web interface. That's fine, too. And we're going to support...
¶ Competition and Distribution Challenges
packaged version of the Browse Comet within the Perplexity app, just like how the Google app still supports navigation like Chrome. So, by the way, both the Google app and the Chrome app are WebKit apps on iOS. Similarly, both the Google app and the Chrome app are Chromium apps on Android. We'll have to follow the same trajectory. Speaking of competition, I'm curious what you think of Dia, what the browser company has done. They released it around the same time as you.
uh they're moving in this direction as well obviously they're a smaller startup but They got a lot of buzz with Arc, their original browser, and now seem to be betting on the same idea that you have with Comet. I'm curious if you've gotten to try it or how you think it will stack up against Comet. I haven't tried it myself. I've seen what other people have said.
I think they have some interesting ideas on the visuals on the front end. If I were them, I would have just tried it in the same browser they had instead of going and trying to build distribution on a new one. But yeah, it's interesting. We're definitely going to study every product out there. Our focus, though, more goes on Chrome. It is the big brother. And the way I think about it is even if I take...
1% of the Chrome users set their default as Comet. That's a massive, massive win for us. And a massive loss for them too, by the way, because any ad revenue lost is massive, like at that scale. Is word of mouth the main way you're going to grow Comet, or are you looking for distribution partnerships beyond that? In the beginning, we're going to do more word of mouth growth. It's very powerful. It's worked out well for us in the past with perplexity itself.
And we're going to try to follow the same trajectory here. And luckily, we have an install base of perplexity already of like, you know, 30 to 40 million people. So even if we get a good chunk of those people to try out Comet. and convert some of those people who tried it into setting it as default, it'll already be a massive victory without relying on any distribution partnerships. And then we're obviously going to try seeing how to convert that progress into a partnership.
google has with a bunch of people i just want to like caveat by saying it's going to be extremely hard we've spoken about this in the past where google makes sure every android phone has google chrome as the default browser and you cannot change that. You lose a lot of money if you change that.
Microsoft makes sure every Windows laptop is coming with. Edge is the default browser. Again, you cannot change that. You will lose a lot of money if you change that. Now, the next step is, okay, let them be the default browser. At least, can you have your app?
as part of the android or windows build you still cannot change that easily especially on windows it's basically pretty impossible to convince like large oems to change that so They have all these agreements that are like several years locked in and you work with companies that plan for like...
the device that they're shipping two years in advance. That's their moat. In some sense, it's not even the product. It's not even exactly in the distribution world. It's more in these legalities of how they crafted these agreements, which is why I'm happy to... The DOJ is at least looking into Google and we've made a list of recommendations on that and I hope something happens there.
¶ Apple, Meta, and AI Talent Wars
Yeah, they may have forced a spinoff of Chrome, which would be really interesting and reset things. You know, there's a lot of people that think Apple should buy you. And Eddie Q, one of their top execs, actually has some pretty nice things to say about you on the stand when he was there during the Google trial and said that you guys had talked about working together. Obviously, you can't talk about something that hasn't been announced yet, especially with Apple.
What do you make of that and Apple? I mean, I'm firstly honored by Eddie mentioning us in the trial as a product that he likes and he's heard from his circles that people like it. I would love for the work with Apple on integrations with. you know, Safari or Siri or Apple Intelligence. It's the one product that almost everybody loves using, or it's a status symbol, like everybody wants to graduate using an Apple device. So I'm pretty sure that...
We share a lot of design aesthetics in terms of how we do things and how they do things. At the same time, my goal is to make perplexity as big as possible. It's definitely possible that this browser is so platform agnostic that it can benefit Android and iOS ecosystems, Windows and Mac ecosystems. And we can be pretty big on our own, just like Google was.
Of course, Google owns Android, but you could imagine they would have been pretty successful if they just had the best search engine and the best browser, and they didn't actually own the platform either. I and others also reported that Mark Zuckerberg approached you about potentially joining Meta and working on his reboot of their AI efforts. What was Zuck's pitch? I'm curious. Tell me. Zuck is awesome. He's doing a lot of awesome things.
I think Meta has such a sticky product. It's fantastic. And we look at that as an example of it's possible to build a large business without having any platform yourself. Were you shocked by the numbers that Zuck is paying for top AI research, these nine-figure compensation offers? I think a lot of them are actually tied to Meta's stock needing to increase.
for those numbers to be paid so it's actually pretty contingent on on the business and not just you know guaranteed payouts but still huge numbers yeah huge and Definitely, I was surprised by the magnitude of the numbers. Seems like it's needed at this point for them. But at the same time, Elon and XA have shown, you don't need to spend that much to...
train models competitive with OpenAI and Anthropic. So I don't know if money alone solves every problem here. You do need to have a team that works well together, has a proper mission alignment. milestones. In some sense, failure is not an option for them. The amount of investment is so big. And I feel like the way Zuck probably thinks is, I'm going to get all the people, I'm going to get all the compute, and I'm going to get all the...
like, you know, milestones set up for you guys, but now it's all on you to execute. And if you fail, like it's going to look pretty bad on me. So you better not fail. That's probably the deal. What are the second order effects to the AI talent market, you think, after this, after Zuck's hiring spree? I mean, it's definitely going to feel like a transfer market now, right? Like an NBA or something.
There's going to be a few individual stars who are having so much leverage. One thing I've noticed is anthropic researchers are not the ones getting poached. Mostly. He has poached some, but not as many. It does feel like that's something labs need to work on, which is truly aligning people on one mission, that money alone is not the motivator for them.
And as a company, your company is doing well. Like, you know, the stock is going up and you feel dopamine from working there every day. And you're encountering new kind of challenges. You feel a lot of growth. You're learning new things. And you're getting richer too along the way.
¶ Perplexity's IPO Vision
Why would you want to go? And just because you have some guaranteed payments. Do you think strongly about getting perplexity or profitability to be able to control your own destiny, so to speak? Definitely, it's inevitable. We want to do it before the IPO. And we think we can IPO in 2028 or 2029. I would like to IPO, by the way, just to be clear. I don't want to stay private forever.
companies have chosen to do so. Even though it gives you advantages in M&As and decision-making power, I do think the publicity and the marketing you get from an IPO and the fact that people can finally invest in a search alternative to Google is...
pretty massive opportunity for us to IPO. But I don't think it makes sense to IPO before hitting like a billion dollars in revenue and some profitability along the way. So that's definitely something we want to get to in the next four years or three years. But I don't want to like stunt our own growth and not be aggressive and trying new things today. Makes sense. Yeah. You launched Perplexity. It's crazy that it's already been.
just over three years now. And it was right around when ChatGPT first launched. And it's wild to think about everything we've talked about and that all this has happened. in like barely three years. So maybe this is an impossible question, but I want to leave you with this question. If you look out three years from now, you just talked about the IPO, which, which is interesting, but.
What does perplexity look like three years from now? I hope it becomes the one tool you think of when you want to actually get anything done. And it has... Like a lot of like deep connection to you because it synchronizes with all your context and proactively things on your behalf and truly makes your life a lot easier. All right. We'll leave it there. Arvind, thanks. Thank you.
Thanks to Aravind for taking the time to speak with me and thank you for tuning in. If you'd like to let us know what you thought about this episode or what else you'd like us to cover, drop us a line. You can email us at decoder at theverge.com. We're also on TikTok and Instagram via at decoder pod.
If you like Decoder, please share it with your friends and subscribe wherever you get your podcasts. And if you haven't already, don't forget to subscribe to The Verge, which gets you access to my newsletter command line and a bunch of other great stuff. Decoder is a production of The Verge and is part of the Vox Media Podcast Network. Our producers are Kate Cox and Nick Stat. Our editor is Ursa Wright. The Decoder music is by Breakmaster Cylinder. See you next time.
Support for this show comes from Seven Rooms. For the restaurant operators out there who want to create more regulars on the regular, check out Seven Rooms. Seven Rooms is an all-in-one CRM, marketing, and operations platform that helps you make more money and more magic for your guests.
It gives your staff the tools to deliver service that keeps your guests coming back for more. From direct reservations to smart table management to targeted text and email marketing, Seven Rooms helps you grow your brand and your covers and not your workload. Learn more at sevenrooms.com. Seven Rooms. Make magic. Make money. Support for this show comes from Icy Hot.
You get things done and the last thing you need is to put everything on hold for your aches to go away. You can make your comeback now with Icy Hot Pro Massage Balm. It contains two maximum strength ingredients that target multiple pain receptors. so you can experience instant relief post workout. Ice works fast and heat makes it last. You no longer need to let pain stand between you and your goals. Just massage the Icy Hot Pro Massage Balm directly onto your troublesome aches and boom.
You're so back. Buy Icy Hot Pro Massage Balm now. Delta Airlines has partnered with the WNBA to provide charter flights for all the players. Now the players can enjoy simple card games, learn about everyone's family, and who does and doesn't watch Love Island USA. Yes, you will be judged if you don't. Because at Delta, they believe there's always more potential to own and help you achieve your own individual version of success. And by investing in the WNBA,
Delta has directly brought people to their potential by elevating the player experience through the current charter program. So if you want to get to your destination feeling good about yourself, check out Delta. Keep climbing with Delta. Book your next adventure at Delta.com.
