Maximize the impact of your launches. Be a little louder. You know your customers well, you know where they hang out. Take the incredible work that your engineering product research team is doing and amplify it to the world. Hi, everyone. You're listening to Scaling Dev Tools. I'm joined today by Luke from 0 ElevenLabs, and we dig into the origin story of ElevenLabs. And then Peter's like, "I think I can be the first person to make human-like speech
with AI." So he quit Google, built his own GPU set. Matty was still working at Palantir and basically, like, sharing his salary with Peter. They started with 11 -voices. They started with 11 customers. -How they created such a cool voice model. Peter managed to cross this threshold, like, make the best models. Like, imagine -there are not many GPs. -Launches and why they matter so much. 0 And so, the core assets, normally the first one we actually start with is the
tweet thread. So it's like, what's the hook? And there's a top tip as you start -with the word introducing. -Enjoy the episode. Hi, I'm Luke Harries. It's great to be here with you, Jack. -[laughs] Nice. -So yeah [laughs], great to be here. I'm -excited to chat. -Yeah. So yeah, for people that don't know, ElevenLabs is, like, the, like, best way to generate voice.
Yes, yes. So the way I describe ElevenLabs is we have the best AI audio models, and so we do this for text-to-speech, we were the first company to make human-like speech with AI. Uh, we do speech-to-text, so we've got the highest benchmarking, uh, transcription models. And then we also create AI agents with voices, so conversational AI. And, and we basically take these, like, core foundation models, um, which really is, like, you know, proper research, there's transformers
applied to audio. And then we build different products on top. So we have a consumer app, which is called our Reader app. We have a creator product, which is for making voiceovers for videos or audiobooks. And we have, um, developer products for our, like, API platform. And then these conversational AI voice agent builders. So yeah, quite a few different products going on. Yeah, super cool. I- I have not used it for the developer side, but... Which I'll -ask about in a second. -Yes.
But I'll just say, like, the few ways that I've used it. Um, so I'm using Scary Dev Tools for, like, intros when I wanna say, like, Reddit comments and stuff. -Yes. -Like, it was, like, in the intro saying what people had said on Reddit in different voices. Like, creating a... I used it for my nan's 90th birthday video, where I did, like, uh, her life using, and I, you know... Jean, Jean was born in 1938 or something. -[laughs] -And [laughs] in, on this street...
You won't be showing this to your grandma, this podcast [laughs]. Yeah [laughs], I won't show, I won't show her. Um, and then I also used it when I was, like, uh, trying to... I cre- this was ages ago, but, like, I created, like, a kind of audio book about FFmpeg to, like, learn more about how, like, the -filters work and, like- -Very cool ... using ChatGPT and generating. And it's, E- ElevenLabs is super cool. So, um... But maybe you could tell us about the developer products as well, like, how
-this... How people can use it as devs. -Yeah, 100%. So there's two main ways people use it. So yeah, lots of people who use it are the creators and they use our UI products. But everything right from the start, we've also exposed as APIs. And so the simplest API we have and the most used one is text-to-speech, which is you just send in text and you generate really high quality audio. And we have two
different models you can use for this. We have our, uh, multilingual model, which is like our big model, takes a little longer, but really does, like, the emotion right. And it's fantastic for async use cases. So it's used by companies like Captions, HeyGen, Synthesia, um, and that's actually- -Mm -... where the business got started, on, like, the API side. Um, and then over time, we've, like, refined these models,
made them more powerful and smaller. And so we have, um, the other class of models, which is, like, the low latency models. And so this is Flash and Turbo. And those models are really fantastic for stuff like the, the real-time use cases, so if you're creating a sales agent or you're creating a customer support agent or you're creating a tutor. And what we found... So that's kind of the text-to-speech APIs. And then the other main one is the speech-to-text APIs. And
so you can take this podcast, you can feed it in. Uh, it's also ridiculously cheap. Like, feeding in this entire podcast would only be a couple of cents. Um, and you can feed in the enti- yeah, like, uh, co- or pennies for the Brit. -[laughs] Thank you. -Uh, re- -Wow [laughs]. -Rewire My Brain for $3.00. -[laughs] -And- -That does look good. -But yeah, you, you can feed in the whole
podcast and it would give you, like, the most accurate transcript. You can also have audio events, so you could ha- like, have it output the breathing and the other bits. Um, and you can also get, like, character level timestamps, so if you're -making, like, captions type product. -Yeah. Um, and what we found, though, was lots of people were taking these really cool
APIs, so text-to-speech, speech-to-text. They were plugging those together to create full AI agents and, like, putting an LLM in the middle. -Yeah. -And, and so by about the 20th company that we had helped build these, uh, agents with text-to-speech, L- uh, speech-to-text, LLM, text-to-speech and turn-taking, we were like, okay, maybe we should, like, help people build that as well. And so that's why our conversational AI pro- product is where you can build these end-to-end voice agents.
-Oh, that's cool. That's really cool. -Yeah. Um, didn't realize you had that. I need to try that. -Yeah. -Um... Yeah, it- it's good fun. And the, the way that we've set it up is you can either, uh, bring your own brain... So lots of companies, you know, particularly at larger scale, you've already built an LLM-based or text-based product where you have, like, the LLM you like, you have the RAG, you have the e-bals, you have that
all set up. And you can basically wrap it with, like...... voice in, voice out, and -turn taking. -Mm. And so that's one use case. Or the other way is, like, if you're a small startup, we provide, like, the hosted RAG, the hosted LLM. -Mm. -Like, you can select from any of the top LLM providers, um, like, at cost for the LLMs. And what's really cool about that is, like, you can get to production so quick. So, you can just type in your
prompt, select your voice, and then you have the full agent. And you can either deploy it... Connect a phone number, or like put Snip on your website. Like on my website, you can actually go and chat to me about any of my articles, and that's -just- -Oh, really? -Yeah, I built in like five minutes, so... -Okay, so it is like audio chat with Luke? Yes. Yes. Yeah, yeah, with my own voice though. And it knows what blog? You can ask anything about, like, stuff that you've
-written? -Yes. Yeah, yeah, exactly. So it's got all -my blog posts loaded in, yeah. -Okay. Very, very cool. [laughs] Scaling Dev Tools is sponsored by WorkOS. If things start going well, some of your customers are gonna start asking for enterprise features. Things like audit trails, SSO, Skim provisioning, role-based access control. These things are hard to build, and you could get stuck spending all your time doing that instead of
actually making a great dev tool. That's why WorkOS exists, they help you with all of those enterprise features. And they're trusted by OpenAI, Vercel, and Perplexity. And if you use them for user management, you get your first million, yes million, monthly active users for free. I honestly don't know any dev tools that have a million monthly active users, apart from GitHub maybe. [laughs] So that'll get you pretty far. Here's what Kyle from Depot has to say about WorkOS.
We use WorkOS to effectively add all of the SSO and Skim to Depot. It's single-handedly, like, one of the best developer experiences I've ever seen, for what is, like, a super painful problem, if you were to go and try to roll it out yourself. So, for us, we can effectively offer SSO and Skim, and it's like two clicks of a button, and we don't ever have to think about it. It's, like, one of the
best features, uh, that we can add to Depot. It's super affordable, which effectively allows us to, like, break the SSO tax, uh, joke, and essentially say, like, you can have SSO and Skim as, like, an add-on onto your monthly plan. Like, it's no problem. Uh, so it really allows smaller startups to essentially offer, like, that enterprise feature, uh, without a huge engineering investment behind it. Like, it's literally we can just use a tool behind the scenes, and our life is
-exponentially easier. -I think the biggest question I had about ElevenLabs is, like, sort of like, how? [laughs] How? Because, um, it felt like it came out, and it was, like, so much better than everything else. I was like, "Who are, who are these guys?" [laughs] Like, "How did they do this?" Yeah. So n- ElevenLabs was founded, um, by two guys, Matty and Peter, and they're both Polish, they grew up in Poland, they're childhood best friends,
and they're both super smart. So, uh, Peter, he went to, uh, Oxford, and then Cambridge, and then Google. And Matty, he went to, like, Imperial, ACity at Math, and then Blackrock, and then Palantir. And they're both, like, super smart, childhood best friends. And they would get together, and they would, like, use their vacation time to go together on weekends on holiday and just, like, hack on -projects. -[laughs] Really?
Yeah, yeah. They would, like, just hack together, um, which I think is a sick way -to- -Yeah, absolutely, yeah ... use your vacation time. Like, more people should do that. They, growing up in Poland in the early 2000s, and even now, when you listen to movies or TV shows, like, Poland's a pretty small country relative to the world. Not that many people outside of Poland speak Polish. And so when you translate films and movies into Polish, and largely they don't translate it, so you just get the
subtitles. Or if they do translate it, they just have one speaker, one voice actor who does the entire thing. So if you imagine watching Friends with Rachel and Ross, and they're both played by one middle-aged Polish man- -[laughs] -... the love scene's not gonna be any
good. And so what, uh, what Matty and Peter were riffing on is like, "How can we make it so we can take, like, any content and turn it into any language?" And so when you, like, decompose that problem, it's like, okay, what do you need to do? Well, first you need to, like, understand what they're saying, what the emotion is. Then you need to, like, translate it into the other language, and then you need to turn that text into speech. And, and so they're like, "Okay, decompose the
problem. Let's first try and tackle the text into speech." And Peter, like, ex-Google, he basically spent all his time throughout, um, throughout his life just, like, working on his own research projects. And this problem in making human-like speech, it's not a new problem, like, we've been doing it literally since -the 1700s- -Yeah ... with like wind-powered machines going through various pipes. You then had the early digital speech synthesis machines with, like, Stephen Hawking. And Stephen
Hawking's voice is great. It's very distinctively him, it's fantastic for accessibility. But you wouldn't want to get, like, read your kids a bedtime story -with Stephen Haw- -Yeah, yeah ... Hawking's voice, unless you're saying it's Stephen Haw- It's, like, you know, a wholly different experience to calm with Matthew McConaughey with those deep tones. Yeah, yeah. So he was like, "Can I turn text into speech?" Like, tackled this problem, left
Google, built his own GPU rack, and, like, managed to do it. And then ElevenLabs -launched- -Really? -Uh, yeah. -So he just did it, like, he built his own -GPU rack -He, yeah. And there's a, there's a cool bit whereby Peter tells Matty, like, they had hacked together various, like, prototypes that point, or, like, different bits around audio. And then Peter's like-I think I can be the first person to, like, make human-like speech with AI. And
so he quit Google, he built his own GPU set. Matty was still working at Palantir and basically, like, sharing his salary with Peter. -Oh, that's cool. -And, and then Peter, like, built it, and, like, as they're nearing the move, it's like, Matty then quit Palantir as well. Like, they properly co-founded, incorporated the company. Um, and yeah. And then they- they started with 11 voices. They exposed it through API and
UI. It's just, like, literally a small text box on the site where you could type in text. Um, and they started with 11 customers. Their 11 B2 customers. -No way. -Yeah. Uh, you'll see the word 11 a lot. And they really realized they had something when someone took their entire book, and paragraph by paragraph entered this into this small text box on the site to generate the book and then stitched it together.
-Oh my gosh. It must've taken days. -Yeah, days of, like, copying a pic... We have- we have much better systems for that now, but yeah. [laughs] Yeah. Wow. That- that salary sharing. Do you know Veed? -Yes. Yeah, yeah. -I think that's also how they started. -Really. Yeah, yeah. -Because I think, uh, I think S- I can't -remember if Saba or Tim was working. -Yeah. And then the other one was, like, doing ha- taking half the salary. -Yeah, yeah. Yeah. -And that, yeah. It's actually a really
-cool way to do it. -That's a really cool way to do it. I think -it's- it is a very cool- cool way. -The thing I was curious was, like, the data as well. Like, does it like... How did they train, like... -Yeah. -Just like one... 'Cause one guy with GPUs in his room. Also, like, it seems like the da- the voice data would be really -important. -Yeah. The data... So, in general, we don't -talk about our, like, training data. -[laughs] Yeah, secret. -That's like... Voice is so competitive. -Yeah.
And so, like, there's just certain things where, like, you know, I can talk a lot -about other- -[laughs] -... but that we're just like, you know- -Okay, okay, okay. -Competitors would love to know. -Yeah. And so, yeah. The... But what the- the interesting thing with training these audio models is, unlike other types like intelligence, where you do need a ton of GPUs. Like, Peter managed to get- like, cross this threshold, like, make the best models, like, on actually really not very many GPUs.
-Mm. -And now we have scaled it, and, like, the more you scale it, you do now see increasing results. But that was, like, quite an interesting quirk of the training. -Yeah. Yeah. -Which, like, you may not have otherwise been able to, like... One guy pretty much in his bedroom. It's a good constraint as well to start with. -Yeah. -But if you have that, prove that it could -be good, and then... -Yeah, 100%. -Yeah, blow some money [laughs]. -Yeah, yeah, yeah.
Yeah [laughs]. Um, okay, cool. And then you yourself have been- -Mm -... I was saying to you before that, like, next time you join an early stage startup, I just need to apply to whatever company it is. Because, I mean, you've worked with Posthog. -Yeah. -Now Eleven Labs. Like- -Yeah, yeah. -I mean, people know Posthog, I think, as -well. Like, amazing company. -Yeah.
-Um, how did you join Eleven Labs? -Yeah. So, um, Matty and I, when we were 19, one of my best friends, he's Polish, and- and we were studying at Cambridge together. I was actually a medical student, but spent all my time coding, hacking on projects, trying to do startups. And my friend Philip, me and him would hack together on various things. And he was like, "Luke, there's HackCambridge," which is, like, the biggest hackathon in the university. Um,
"I've got two friends, let's just go do it one weekend." I was like, "Yeah, sure. Sounds like a lot of fun." Turns out this hackathon's probably the most important weekend of my life. So it was me, Philip, and Matty. Matty who then went on to found
Eleven Labs. Um, and we get together and we actually built something which is, like, uh, it's completely different to Eleven Labs but similar space, which is, it was help blind people use the internet by running the images on the website through a vision model, generating captions, and then reading that out loud. Oh, really? Yeah. So it's, like, slightly funky audio related bit. Um, but it worked really well because the issue is, like, you know, loads of pages on the internet don't have
-captions. -Not accessible, yeah. So they can't tell. You just hear image. Like, what's the image? -Yeah. -Um, so we did this hackathon, and we actually ended up winning the Microsoft prize. And so we each got an Xbox. We're like, "Oh, this is sick." Like, we all immediately sold it on eBay for- -[laughs] -... for 350 quid. Um, and then, uh, we,
like, just stayed in touch. And what's been amazing with that group, and that is why, like, if you can just do, like, the only tip I think I would give young people is, um, like, do hackathons with your smartest friends, or do projects with your startest- smartest friends. Don't call it a startup. -Yeah. -Don't try and raise money. At least for... Just get some projects under your belt and work with people who you think are
amazing. Um, and Philip, who organized this hackathon, he went on to found WordWhere, which raised the largest seed round ever from YC. Um, they're building, like, what's the new version of Zapier but in an LLM-first World, where you can basically, like, type in natural text where I want to build no-code tool. And, um, and then Matty went to found on Eleven Labs. -[laughs] -Um, and so me and Matty, we basically catch up every six months and riff on different ideas. Um, and the story of how
I joined Eleven Labs is that, um, I basically missed out on investing in it. I was like, "Oh no, I mucked up." So, the- the story in the investing is we caught up and he goes, um, "Yeah, Luke, I've, uh... I'm just- just leaving Google to found a company." Um, and we were swimming in the Hampstead Heath ponds, which, for the Americans, a pond's a small dirty puddle of water. -[laughs] -And yes, we- we do actually go swimming on
them in London. There's not- there's not the lovely Californian ocean like they're -used to. -Yeah [laughs]. So, we- we're going swimming in these ponds. -It's true. -He taps the thing and he's like, "Ah, it's 11 degrees, my company's called Eleven Labs." And I'm like, "That's a pretty
weird... Eleven Labs? Like, what's the- what's the product do?" He goes, "Well, step number one, we're gonna build the best..."... AI audio models in the world, so we're gonna build the best text-to-speech models, like human-like speech. I'm like, "Cool, but like what's that actually? You s- you're building some AI model." Um, and then he goes, "Well, number two, we're gonna sell it to everyone. We're gonna build a reader app where people can read articles out loud,
uh, on their commutes. We're going to build a creator platform where people can make audiobooks and voiceovers for their videos. We're gonna create a developer platform where we give API accessor- access. And we're going to build a way for people to make end-to-end voice agents." And I was like, "What do you mean you're gonna sell it to everyone?" He goes, "Yeah, from like companies like Netflix to dub their videos to, um, you know, the best startup founders." I'm like, "That's
a terrible go-to market. What do you mean you're selling it to every..." Like, no, surely you need like one persona you go deep on. He's like, "Yeah, I think we're like... Well, we'll just build the models first and then we'll see how it goes." And it turns out they built the models and they launched and it went incredibly. And he didn't explicitly ask to invest, but I'd done like little bit by then. -You could have pushed. [laughs] -Could, should have pushed.
-[laughs] -Should have pushed. That was a mistake. Um, but then we basically stayed, you know, we still were chatting every so often. And I would constantly get VC friends pinging me, being like, "I heard ElevenLabs has just got this many users or created this much audio." I was like, "Oh -my God." -Oh my God. The FOMO must have been insane.
-[laughs] -The FOMO was building. And then Matty at some point rings me and goes, "Oh, Luke, we're looking for someone to do like all -the growthy things at ElevenLabs." -Yeah. Like help us get more users, do SEO, do paid ads. And I was like, "Well, I've never formally done growth stuff." Like previously I was leading product. I founded my own company where you're doing like all the- -Yeah, yeah -... COE type things, basically, you know,
building the first version of product. And I very much saw myself as like more of an -engineer. -Mm-hmm. -But I was like, "Sure, I'll join." -Yeah. Give it a go. And, and it's been a lot of... I've actually really enjoyed like the... I've reframed in my head what marketing is, uh, which I think has helped. So yeah, I've really enjoyed. Really enjoyed it.
-How have you framed it? -Um, I think basically it's just like how do you tell the world about the incredible work the research and the engineering team is doing? And what's, what I've realized I really like with ElevenLabs and it's like, it's not necessarily me building the product. I'm like, look at the incredible models that Peter has built, or the incredible, um, APIs that Tim and Flavio, or the incredible conversational AI product that Josef has
built. And it's like, how do we use our creativity to just like tell the story in -the best possible way? -Mm-hmm. Like, how do we make this incredible launch video, which really encapsulates what this conversational AI product is? Like takes it down to the core abstract in a way which will resonate. How do we create like pages for SEO which genuinely -add value? -Yeah. So when you type in text to speech Hindi, we have a speech box where you can like
type in any Hindi and it'll generate it. Like that's genuinely useful. And so I think the reframes from like these small growth acts of like your own stuff, which feels a bit egotistical, to like know how do we like tell the story of what the team's doing in an amazing way? And then like also do that in a way where we -deliver more value. -Yeah, and I guess that is a lot easier to do when it is such a kind of growing thing. -Yeah. Yeah. [laughs] -Yeah.
I mean, I, I, I always joke it's like, like the growth team, we, we try our best. We have some incredible people on the team. But like it's, it's a different game when you're playing with the quality of these models and products. And the, the analogy I use is it's like we're in a car race. Like me and you, we're both racing along, we're doing engineering, we're doing product, we're speaking to users,
we're creating SEO pages, we're scaling marketing. Like we're in a normal car race, and then Peter and the research team just come in and give you a nitro boost. -[laughs] -I'm like, like genuinely, they drop the -best, uh, speech text model in the world. -Yeah. It's like suddenly the sales team has a much easier job. Or like our marketing
-video is gonna be way better. -Yeah, that's actually... 'Cause I think for that one especially, 'cause I'm obviously interested in that- -Yes -...'cause the podcast and stuff is like, it's just... If, if, if, if you can say, "Okay, that is actually the best in the -world," it's like- -Yes. -Okay, well- -Yeah. I think kind of that's, that kind of trumps everything where you're like... And the nice thing about the speech to text model, which we haven't cracked in
text to speech, speech to text is so objective. There's a ground truth. -Mm-hmm. -We know exactly what we said. -Yeah. -We've not, um... Yeah, like you know exactly what you said. You can compare a benchmark, you can score it, and ElevenLabs beats the new version of Whisper, it beats Gemini, it beats
blah-dee-blah, and it does it in all these different languages. Whereas the tricky thing with text to speech is, and why I respect sound designers so much, is like you need to have your ear attuned to like helping work out, like work out which is the better voice. And what we find is when customers A-B test our voices versus other voices in their application, like retention goes up, engagement goes up, but you don't necessarily have that same like core benchmark.
-Yeah. -Um- So it's kind of like down, downstream from- -Yes. -What- Yeah, yeah. You see it on your business metrics. If you have a great ear, you can hear it. Um, and then there's like a few other axes then where like we tried to make it super clear like we have the widest selection of fantastic voices. -Yeah. -So you can use like, um, a British voice or you could use Californian voice, or you could use older man, younger man- -Yeah -...conversational, audiobook style.
There's all these different axes, uh, of voices, which we've tried to really make it so you can experience the different power and variety of these models. Yeah. What we should do actually, and I need to... And I'll put this in the, in -the notes so you know that I- -Yes ...actually remembered to do it.... if you don't see that, then don't trust it. But -like- -[laughs] ... we should use the raw, like, we should transcribe this with- -Yeah. -... um-
-Yeah, 100%. -And not touch it at all, so that people -can- -Yeah, yeah, raw transcripts. Yeah. -So that people can assess, um- -Yeah ... as well. Um, that would be super cool. -Yeah, yeah, 100%. -Um, yeah. And also, like, I saw that the -Lex Fridman thing. -Mm. -Was that, was that big? 'Cause- -Yeah. So Lex, we've been really fortunate to work with Lex on a couple of projects. Um, so Lex Fridman has this fantastic podcast. He interviewed Volodymyr Zelenskyy.
-Yeah, listen to that one. -Uh, which is like, you know, the president of Ukraine, absolutely massive interview. And during their conversation, they're speaking a combination of English, uh, Ukrainian, and Russian, all within the same podcast. And the number of people, there's not... You know, it's very important that people who speak all those languages, as one person, understand that.
But it's also b- important, like, everyone across the world, who maybe only speaks English or only speaks Russian or only speaks Ukrainian, can also understand the full podcast. And so what we did was we worked with his team to translate that such that everyone, eh, could do it. And when we say translate or dub these, what we do is we create their voice clone from their voice, and then generate, like -regenerate basically the impa- -Mm.
... the entire thing, uh, using either the English or the, or the Russian or the -Ukrainian. -That's really cool. -Yeah. -And did it just kinda happen organically -or was it something that you were like? -I think he basically wanted... So he's, I think, Russian-Ukraine. I'm not exactly sure where he's from. -Yeah. -But he had planned this podcast, and was like... And we've worked together on different bits and things before to dub
his general podcast. And he was like, "Okay, we would love your help to, like, -do this." -Yeah. And, um, dubbing isn't yet at the point where it's, like, fully end-to-end fantastic, um, without human intervention. -Mm-hmm. -So the way we do, we have couple of different versions of dubbing. You can either use, like, end-to-end dubbing, which is, like, fine for consumer use cases. Like, we've not really crossed the threshold yet of it being fantastic. There's Dubbing Studio, where you can go
in and, like, manually edit the translations. And actually, lots of the issue is on the translation side, um, rather than, like, the voice generation. -But you can go in- -Mm-hmm ... and manually edit the translations and regenerate. Or we have this fully managed
service called 11 Productions, where we work with you to do the translation. And that's what we arranged with Lex Fridman's team is like, okay, as soon as he'd finished the podcast, he would send us all the things, and 24 hours later, we'd turn around the full stuff. And what we didn't realize, we thought it was going to be mainly Ukrainian, or mainly... I think we thought it was, actually, we thought it
was, yeah, mainly gonna be Ukrainian. And then we get the audio files come in, and we're like, "Oh, wow, they're selling, they're saying English, Ukrainian, and -Russian, all within the same sentence." -Yeah. And so we need people who are, like, expert level translators. Because the pressure if you mistranslate global leaders- -Yeah. [laughs] -... is huge, yeah. -[laughs] Yeah. -Yeah. -It's like- -So we had that one, and then we had another fantastic one with Modi, who's the President of India.
-Yeah, yeah, yeah. -Which was, yeah, a real privilege for our -team to do. -Okay. And that one was probably a bit, a -little bit easier. -Um, from the technical side, yes, still -very high stakes. -Yeah, yeah. Um, but yeah, we were very pleased with how that one came out. Okay. That's really cool. But I, I feel like Lex, Lex is kind of, like, feeding into, like, the developer, so the developed products.
-Yes. -I felt like... [laughs] I mean, I saw a survey recently, and I think Lex Fridman was, like, top five podcasts for developers. And then the reach beyond that is obviously massive. -Yes. -Um, it must have driven a lot of growth -on, like, the developer product. -So we do a few different things for the
developer products. So we do partnerships with fantastic people like Lex. We do our AI Engineer Pack; I'm not sure if you've seen that, where we partnered with -companies like WorkOS and Resy- -Oh, cool. -Basically all the top- -Yeah -... developer tools in the world. -Yeah. Shout-out WorkOS- -If you want free credits- -... sponsor in the podcast as well. -Yep. Shout-out to WorkOS. -[laughs] Um, if you want free credits to like any of the top dev tools in the world,
MongoDB, Mistral, BlackFrost, li- like, go to aiengineerpack.com. You sign up with GitHub. And as long as you have, like, legitimate GitHub, we will give you, I think it's about $50 in credits for all these different tools. So if you're hacking around, that's really fantastic. That's driven a lot of growth. Um, and then we have, like, a developer experience team, who I personally think they have
the best job in the world. And if I, uh, was to switch jobs, I'd want to join that team, which is they just build cool products and cool demos with their -product. -Really? So yeah. I did one recently, which was Siri. Is Siri, like, iPhone dictation's actually pretty bad, um, and, like, you're constantly having to go back and edit. So I built, like, an Apple shortcut, which, um, uses our speech-to-text to
transcribe what you say. So you just double tap- -Huh -... on the back of your phone or hold down the action button. Uses speech text. You can go on a monologue for 10 minutes, and then it feeds it to OpenAI's LLM or you can do Gemini or any others, and it cleans it up. So say I go, "Hey, James, it is great to meet you. Ha. Actually, hey, -Jack, it's great to meet you." -Yeah. [laughs] It will just automatically adjust it to, "Hey, Jack." -Yeah. -And then that wor- so that's like one demo
-we did. -Sick. Uh, we did a really cool 11 Labs MTP server demo. -Oh, nice. -Um, where you can even use... So conversational AI, on the fly, you can just say, "Order me a pizza from this number," and it will on the spot make you an AI agent and then phone that number.... the agent will then have the prompt, "Your job is to order a pizza with these toppings," da la. It will do it, and cheekily, in the video, we even had it -like negotiate the price of the pizza- -[laughs]
... just with ourselves. But like... And then, and then like... It genuinely can work for like automating any phone call. So there's like the MCP server and there's lots, and we have this like fun process for like launching products and features where we run it through. So yeah, launches are really big for us. We try and maximize the impact of every, like, new model. Okay, cool. And then you've just taken over Louis, uh, from Louis, right? On the,
-um, AI tinkerers? -Oh yeah, yeah, yeah, yeah. So Louis, um, yeah, AI Tinkerers, for people who don't know, I honestly think it's the highest quality engineering community I've ever been to. Like, the people are fantastic- -Yeah -... they're super friendly, and there's a -real like home brew computer club vibe. -Yeah. And so I'd been attending the London one since like the very first lot. I demoed some automated AI email roundup that I built, and then I think I've done a couple
of demos since then. Um, and it's just an incredible community, and so when Louis told me... Louis's the founder of Bloop, which does, um, Cobalt to Java conversion using LLMs, really cool product. And, and when Louis was like, "I'm heading to New York. We've got these big banking customers to close." -You're like, "I gotta swoop in." -No, I was like, "What's happening to the -community?" -[laughs] -I was like, "You started this thing." -[laughs]
I mean, Louis's done a fantastic... Louis's also been the biggest champion- -Yeah -... for AI in London that I know. And he's also hilarious. Also hilarious as well. And I was like, "You're leaving? What's going on?" -Hilarious, hilarious guy as well. -I know. No, he's also... Yeah, I mean he's really funny. So when he was leaving, I was like, "What are you gonna do?" -Yeah. -And then he's like, "Well, would you be up
for doing it when I'm away?" I was like, "Listen, it's your baby. If you ever come -back, it's yours again." -[laughs] But like, I would be honored to, to help keep it going. So yeah, we're, we're... I've got a few plans where I want to do... Louis did fantastic job, like meetups every few months, super high quality. I'd probably love to make it a monthly thing. -I think now just the pace of like- -Mm-hmm ... AI, like the new model droppings. We now have the OpenAI Image Gen model. We
now have... There's just so many cool things, which I think we can, um, get some incredible speakers to come through and, uh, yeah, see if we can scale it up a bit -further. -Yeah. The thing I like, I think most about, uh, AI Tinkerers, that I think Louis did really well, was like he... And I've spoken to him about this on the show, but he... I think he has this cool like, -you have to present work that you- -Yes -... did yourself. -Yes.
And then he also is like so good at interrupting people. [laughs] Yeah. Yeah, yeah. The... Yeah, so the rule is you need to present your own work. And that basically means no dev rels, no salespeople, no marketers. It's like the -vibe is home brew computer club. -Yeah. Like, we do allow sponsors to keep the events going, but like if you want your
sponsor to speak, it has to be the engineer who actually built the thing. And then yeah, the bit which I don't... will be very hard for me to fill- -Yeah, you've got to do it -... is that he just does hilarious -heckles. -Yeah. [laughs] And his main heckle, if you've ever been, is, "Make the font size bigger!" -[laughs] -Because like it will be, yeah, like- -Yeah -... you know, pretty much- Everyone always does that. Everyone does the small- -Everyone always does it. And the- -Yeah.
I had this like running joke, mainly with myself, which is like, "Which will come first, AGI or X?" And, uh, one I did recently was like, "AGI or reliable autofill?" Like, how has no one solved that? Another one is like, "AGI or like -screen mirroring?" Because everyone- -Yeah ... comes up. They plug in their screen. They then see a second monitor- -[laughs] -... on the presenter. -Yeah. -And they're like, "Okay, I now need to -assist in preferences." Anyway. -Yeah. -Yeah. -It's, it's crazy.
-Yeah. -Um, and- -But some problems are just hard. -Yeah. Like, how has no one solved that as a OS level thing? But anyway. -Yeah. -[laughs] Yeah. It's, it's wild. I guess they need, uh... I feel like Raycast are like trying -to solve like just a ton of things. -Yeah. I don't know if it's like... If it's just like anything annoying, I feel like maybe
-Raycast could do it. -Raycast, Raycast is fantastic. The, um, the killer feature I think they've done recently is the focus mode- -Oh, that's so good -... where it turns off Slack- -Yeah -... sets a timer, doesn't let you open it -again, just really slick interface. -And you set a goal as well. I quite like
-that. It's quite cool. -Yeah, set a goal. The only thing I've messaged Thomas being like, "Please can you ship this?" is like automatically change your Slack notification status to like- -Ah -... focus mode back in, however-
-Yeah -... long the duration is. Because the tricky thing, like it's... I feel fine doing it, which is like turn off Slack, um, but I can imagine if you're like a new employee or something- -Yeah -... you feel like you need to keep that -green- -Need to get the fast replies and stuff ... circle on. And I'm like, "No, like do deep work. That's where everything
meaningful actually happens. Turn off Slack." But it would maybe be nice just to signal of like, "Hey, I'm still like working, but, you know, I've turned off my -Slack." -Yeah, yeah, yeah, yeah. True. That would be cool. I think they, they, they're gonna get some Elevenlabs in there, uh, in -Raycast. -I, I personally think... So this hacky -prototype I built on Shortcuts- -Yeah
... which actually, if people are interested, it works really well. And all you have to do is click the iCloud Shortcut link, and then you get it in your -iPhone, as well as on your Mac. -Mm-hmm. And you can bind it to a shortcut. So it works really well. The only tricky bit is you then need to like add in your Elevenlabs API key and your like- -Mm-hmm -... OpenAPI or Gemini, OpenAI or Gemini key. Um, but I was saying to Thomas like, "Surely you guys should do like a Super
-Whisper killer." -Yeah, yeah. Because they already have the fantastic interface. It's all set up. They're doing -all these AI features, but- -Yeah -... we will see. -Yeah, I think he's showing me some cool stuff they're doing where like you could just say like, "Change the desktop -background." -Oh, really? -And it would like do it- -Oh, cool -... and stuff. -Yeah, yeah. -It's like that, yeah, -I have no insight into their roadmap.
-They could do with some cool stuff. -I also think certain companies, the -roadmaps are very clear. -Yeah. And certain ones are really hard. Like, Posthogbot was very cool, was like-... we have this very clear ICP, or product engineers. And they're like, almost is pretty clear of like, what are all the features that product engineers need to build product? Raycast, I don't know how they prioritize- -It's like... [laughs] Yeah -... like what they build, but yeah, kudos
-to them. -Just, just make computers and phones -better, it's like, they're there. -Yeah, they're not playing easy mode. -Yeah, yeah, yeah. [laughs] -Yeah. Um, okay, so on, like you talked a bit about the growth stuff, but like w- with ElevenLabs, it seems like there'd be so many people using it. You get this like crazy growth. What, what really actually like matters? Like what, what can you do -that matters? -So the biggest thing we try and do is
maximize our launches. And so the research and the engineering team just creates incredible models, incredible products. And the best thing we can do is just make -sure it gets out in front of everyone. -Mm-hmm. And so we have a process by which we run through every launch. And so when we get told there's a new model, like speech to text, we go, "Okay, is this a tier one launch?" Is this like top, big, we want everyone to know about it, it should be
surround sound the entire week at that launch? Or is this tier two, which is like it's a really cool feature, we should be loud about it, but it's not like let's use -all our resources possible- -Yeah ... to let everyone... And then tier three is basically like it's on the change log -and the email list. -Yeah, yeah. So the idea is, um, you know, when something comes, and wh- actually the growth team is constantly looking for like what's the engineering team shipped,
-which should be launches. -Mm-hmm. And then we run it through our checklist. And so I have actually a blog post which goes into all of this with like literal templates that we use for our launches. -All right. -And, and for a tier one launch, the first thing we do is like when the product's ready, it's like what's the core messaging we want to get across? And there's normally one primary message, which we want everyone to think about. And for us with speech to text is like the, uh, the
most accurate speech to text model in the world. So like that's your... If we don't launch, right, that's the only thing, like if it's one thing, that's the only thing they remember. There's normally then a couple of secondary, like value props and messages we want to get across. So it would be like it's feature complete, so it has diarization. It has character level timestamps, word level timestamps. So that's like a, a second bit. Another one would be, um, it supports many different
-languages. -Mm-hmm. And like more languages than ever before. So 99 plus languages. One like side point is, a really cool bit is, before you just didn't have accurate transcription models for Serbian, for Malay, for Gujarati, which is like millions of people who just don't have transcription as a thing. Like iPhone dictation does not work for them. -And like now we do. So that was- -Super cool. So we're like, but what's the primary message we get across? Then we turn that
into our core assets. And so the core assets, normally first one we actually -start with is the tweet thread. -Really? -So it's like what's the hook? So- -Mm-hmm ... and there's a top tip, is you start with the word introducing. -Introducing. -Or launching. -Yeah. -Like make it super clear you're launching -something cool. -Yeah, yeah. So go like, "Introducing the world's most accurate speech to text model." Boom, one
line, line break, nice and clear. Copywriting matters, nice and clear. Then you go into like bullet point list or paragraph with some of these secondary bits, or reinforcing the first bit, or you know, on benchmarks it beats x, beats Whisper and it beats... And so that, and then we attach a video. And so every tier one and tier launch, tier two launch video we do, we make a video. And videos are hugely underrated overall. Social algorithms massively focus on videos. So
you need like a video. And there's a couple of different types of videos. One is motion design, which is kind of like design, Figma style design or animated, like design which is animated. So that's one. Uh, the second one is like the -founder led videos- -Mm-hmm ... where like you as the founder are there introducing it. And maybe it's okay given I've spoken very positively about Raycast. Raycast, their product's amazing.
Actually, if you look at their most recent launch video, founder led videos are really risky, I think, for launches because often you speak for about like a couple minutes before you even introduce the feature. -Mm-hmm. -And with the TikTok level of attention, so like make sure you get your like, your, your core message is the most accurate speech to text model. Say that in pretty much the first sentence. Because if you look at then the retention of the video, people drop off. -Yeah. -So like-
-Need to get it in the first. -You need to front load it. Uh, so that founder style videos I think are actually a bit risky unless you're excellent at editing. Um, and then the third one is more like screen share. -Mm-hmm. -And particularly for developer audience who love to nerd out about like product and tech details, that, that can work -really well. So- -And that's like further down the chain, like people that are really interested go and watch the screenshot? Or-
Yes, yeah. Um, but actually for tier two videos we often lead with a screen share. -Mm-hmm. -Because it's like, you know, one feature we've just shipped is agent to agent transfers. Get, so you can create different agents. You can even create human agents, or hum- -Ah -... humans you then transfer the call to. -Mm-hmm. -So say you phone up, it's like- -Yeah -... "Hey, I'm Luke from Eleven Labs," or
like, "I want a refund." It's like maybe the AI isn't allowed to deal with that, so -they- -Yeah ... forward it to a he- So for that, we can actually like within an hour, within half an hour, film a quick screen share of a compelling demo, like showing the technical details. And you as an engineer who's maybe already considering this -product or these types of products- -Yeah ... that's actually exactly what you want to see. -Yeah. -You don't need flashy motion design.
-Yeah. -Um, so yeah, we do the X or Twitter thread. We do the video. Um, and that's normally the com- blog posts for new models with like ideally technical details, benchmarks, it's really great.... and then you explode those into what we call, like distribution. So then repost to every channel, repost to X, repost to Threads, repost to Bluesky, uh, launch it
on Product Hunt. And actually, if you've got the core of these assets, it's actually really quick to do that, and then get your team to boost it, be loud, be proud about the incredible product and- and model you've just launched. Yeah. That- that's actually really cool. So what- what do people search to find the -blog post? It's just like... -Um, so my website's harrys.co, and it's -called Launch Your Product. -Okay, cool. -Yeah. -People should go check that out.
-Yeah. [laughs] -Sounds great. Um, okay. Yeah, and I guess this is just way easier, again, saying this, it's like if you have done something -so cool- -Yes, yeah ... um, would you use the same method if you're like, you know, more of a kind of like traditional dev tool that's like, you know, uh- Yeah. I think a great example would be what Resend does. -Mm. -Resend's building a like reliable email API and like corresponding features around that. -Yeah. -If you don't market it well, it's pretty
-boring. -[laughs] -Like it's not a new problem. -Yeah. You know, you've had SendGrid, you've had... Like there's probably like 50 other -email. Like Amazon does email sending. -Sure. Like it's not particularly new, it's not particularly sexy, but like every time they launch something, they still put in the time to make like the feature great, -to create like these fantastic- -Yeah ... React interfaces. Like they do make the products great. But then also they're
like proud of like, "Hey, we've just shipped this feature." And the really cool thing about launches as a channel as opposed to an SEO or paid marketing or something else is like every time you do a launch, you're telling your new customers, "Hey, we're shipping quickly. We've just built this incredible feature -for you." -Yeah. Like, "Come use this with the rest of our product." But also you're telling all your existing features, "Hey, this is like a new feature we've just shipped which you
should probably use." So it helps with acquisition, helps with activation. Maybe someone signed up, they didn't activate, they got stuck. -Yeah. -They're like, "I really needed agent-to-agent transfers. Now I've churned." It's like, "Oh no, actually they come back, they activate." Um, and then they retain because like suddenly they're going deeper, they're using all these cool features. So yeah, I think pretty much
every product should be thinking, like create a checklist for your launch. Yours, every company will look different depending on who their audience is, where they hang out. At- at Posthog we really cared about like hacker news. Could we... Like the sign was, does your blog, if you had like a hacker news post like really -rank, that was like a wow, we've- we've- -Done a good job.
We've done a good job. Um, so everyone will look different, but like create the checklist, iterate, make your launches impactful, and like you as growthy teams or developer relations or developer experience teams, like you owe it to your engineers and product teams and research teams to take the incredible work they do -and get it out for everyone else. -Yeah. Okay, so we're coming towards the -end. -Yeah.
I just wanted like one last [laughs] question to ask you is like you [laughs] have started, you worked at like two of, I mean, the best dev tools around, fastest growing, especially when you think like, uh, UK somewhat affiliated at least, I don't know if Posthog would say they're br- or Eleven Labs would say British company, but like what [laughs], how- how do you pick [laughs] where to work?
Both of them are pretty esoteric. You know, I've only got two data points. The first one with Posthog was I asked Dalton, so I founded a company which went through YC, and when I was working out my next move, I was like, "Ideally I do want to start another company again soon," and I was like, "Dalton's fantastic, he's our group partner. He's given me more good advice than anyone else in the world." And I was like, "Dalton," like, "I'm working out what I should do next." Like should I
launch another company straight away? Should I like go join a super early stage one? Should I join a, like scale up?" And he goes, "Well, Luke, you've never really seen what like true PMF feels like, what scale really feels like. And so I think you should like at least spend some time in a super fast-growing company." And so that's why I like joined Posthog and helped them tackle some of their product
stuff. Um, but I think that like looking back, probably the core signal is, I think there's different stages, but the most important thing is like are the founders truly excellent? Um, do they have... A really great proxy is like are there really great investors you really trust? Like is a16z involved with the company? That's normally a really great sign. Um, or Sequoia or- or the top firms. And then the product, like is it a product you really deeply resonate with-
-Mm -... and really want to use? Like Posthog I'd used as a product and I knew it was great and like- -Yeah -... I'd deployed my own version of Posthog on a Kubernetes cluster, um, and like used it to ship my own products. Eleven Labs had actually built like a AI priest, which would send you to... So I'd- -Really? -As soon as they launched, I was like -playing around with their API. -Oh my gosh.
I was like this is incredible. And how it would work is just like you could send it questions via text, it would send back voice notes. -That's really cool. -Like really play with the APIs, like is
this something great? Is it like a 10X better product? And so far, I do think I've been very lucky, but I think if you choose founders which are really excellent, which want to build something massive, not just some quick sell, like founders who are really excellent with excellent investors, and like the product -is really differentiated and fantastic- -Mm ... and you join at like a seed or series A, like there should be a lot of like -scale to go and- -Yeah
... hopefully you're only just getting started at this point. And at least Eleven Labs now, like we genuinely are still... We're only two years old. -Yeah. -We're- we're just getting started, we're hiring a lot, we're 200 people now, we're growing very quickly, so I mean, to- to give a cheeky answer, I would suggest they join Eleven Labs. -[laughs] Yeah. -And like one of the really cool things about joining at like this stage with Eleven Labs is like there's still so much -upside. -Yeah.
Like we're only $3 billion company, we genuinely think we can be- beat like the -next Google, we can- -Yeah ... beat OpenAI, we can be in like...... like trillions of dollars is, like, genuinely what everyone is aiming for in terms of, like, size, size and the impact being, like, can we get the best enterprises in the world? Can we get all of them? Can we get the best creators, the best developers? We're going, like,
multi-product and we're only 200 people now. And I personally find the, like, scaling phase super fun because there's still lots of, like, zero to one cool -stuff, but there's also, like- -Yeah ... you know, you get lots of people, fantastic talent joining. So yeah, it's a -lot of fun. -Is it remote or you- do you have offices? So we are technically remote first, um, but we do have in-person offices in New
York; London; Warsaw; San Francisco; um, soon India; Japan; Korea. So, like- -Wow -... we're scaling up these hubs, and what we find... Yeah, love working with the best talent wherever. And if you prefer to work from home, work from home. If you prefer being in the office, come join us -in the office. -Cool. Okay. Amazing. Um, and then I'll just ask, do you have one rogue question? Do you have any final things that you would say to people listening running a dev tool?
Yeah. I think if you're running a dev tool, it would be, yeah, just maximize the impact of your launches. Like you're already building really cool stuff. You know your customers well. You know where they hang out. Just, like, be a little louder. Take the incredible work that your engineering product research team is -doing and amplify it to the world. -Okay. That was amazing. Um, Luke, that was -really fun. -Yeah. Really enjoyed it.
Like super galvanized. [laughs] I'm like, [laughs] let me- let me- let me get my CV -ready. [laughs] -When are you gonna launch- -[laughs] Yeah. -When are you gonna launch, uh, your- -Yeah -... your product? -I- -That's the- [laughs] That's- that's the question. Um, yeah. Uh- But yeah, either that or the CV, I would love to see either. [laughs] -[laughs] Yeah. Um, thanks so much, Luke. -Yeah. And, uh, yeah. Thanks, everyone, for listening.
-Fantastic. Thank you. Really fun. -Oh, we got the first handshake on this. -[laughs] -Oh, really? Huh? I think so. Yeah. We should redo it often. -Yeah. -Yeah. [laughs]