Two Voice Devs - podcast cover

Two Voice Devs

Mark and Allenpodcasters.spotify.com
Mark and Allen talk about the latest news in the VoiceFirst world from a developer point of view.
Last refreshed:
Follow this podcast in the Metacast mobile app to refresh it and see new episodes.
Download Metacast podcast app
Podcasts are better in Metacast mobile app
Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Episodes

Episode 216 - DevAI: Threat or Enabler? Live Q&A from Voice & AI 2024

Join Allen Firstenberg and Noble Ackerson, hosts of the Two Voice Devs podcast, for a lively and insightful Q&A session recorded live at Voice & AI 2024! We dive into the burning questions surrounding AI's impact on software development, exploring the potential threats and exciting opportunities presented by tools like GitHub Copilot and Cursor. From the future of junior developers to the ethics of non-deterministic systems, we tackle it all with our signature blend of technical expertis...

Nov 21, 202435 minSeason 1Ep. 216

Episode 215 - Unlock Cross-Platform Machine Learning Model Deployment

Tired of wrestling with platform-specific machine learning model formats? Join Allen Firstenberg and Mark Tucker on Two Voice Devs as they explore ONNX (Open Neural Network Exchange), a game-changing open format built to streamline your ML model deployment workflow. Discover how ONNX empowers you to train models in your preferred framework (PyTorch, TensorFlow, scikit-learn, etc.) and seamlessly execute them across diverse platforms (Windows, Mac, Linux, iOS, Android, Web) using the efficient ON...

Nov 15, 202428 minSeason 1Ep. 215

Episode 214 - NotebookLM: The Future of Personalized AI Learning for Developers?

Dive into the world of AI-powered learning with Allen and Mark as they explore Google's innovative NotebookLM. This cutting-edge tool offers a fascinating glimpse into the potential of Google's Gemini AI model. NotebookLM allows you to centralize your notes, documents, and even audio/video transcripts, transforming them into an interactive knowledge base. Discover how its conversational interface lets you ask questions, generate summaries with citations, and even create podcasts from your source...

Nov 07, 202431 minSeason 1Ep. 214

Episode 213 - Scary Developer Stories: A Halloween Special

Boo! Join Two Voice Devs for a special Halloween episode filled with chilling tales from the software development crypt. Mark and Allen recount true stories of coding nightmares, from dropped databases to runaway pings, and offer words of wisdom for surviving your own development horrors. Listen with the lights on (if you dare) as they explore the spooky side of coding, complete with a chilling Halloween soundtrack. Don't forget to share your own scary developer stories in the comments! Timestam...

Oct 31, 202422 minSeason 1Ep. 213

Episode 212 - Data Labeling for Developers

Join Mark and Allen, your Two Voice Devs, as they delve into the crucial world of data labeling for machine learning model training. Whether you're a seasoned data scientist or a developer just starting to explore AI, understanding data labeling is essential for building effective models. In this episode, they explore various data labeling techniques, from manual labeling for simple voice apps to automated approaches using open-source libraries like Snorkel. Discover how labeled data powers ever...

Oct 25, 202436 minSeason 1Ep. 212

Episode 211 - Apple Intelligence and Siri's Future (and Beyond)

Join us for a fascinating conversation with John G, a seasoned voice developer, as he shares his insights into Apple's approach to AI and the future of Siri. John discusses his journey from helping content creators to the Alexa ecosystem and then into the Apple world, driven by the potential of App Intents and the evolving landscape of Apple Intelligence. We delve into the technical details, exploring how App Intents, the semantic index, and on-device LLMs are shaping the future of app developme...

Oct 18, 20241 hrSeason 1Ep. 211

Episode 210 - Simplifying Generative AI Development with Firebase GenKit & GitHub Models

Join Mark and Xavier on Two Voice Devs as they dive into the world of generative AI development with Firebase GenKit and GitHub Models. Xavier, a Google Developer Expert in AI, Microsoft MVP in AI, and GitHub Star, shares his insights on these emerging technologies and his open-source project that bridges the gap between them. Discover how Firebase GenKit offers a simpler, more modular approach to building GenAI applications compared to frameworks like LangChain. Learn about GitHub Models and ho...

Oct 11, 202422 minSeason 1Ep. 210

Episode 209 - AI-Powered Pronunciation: Conquering Tricky TTS

This episode of Two Voice Devs, recorded before the exciting announcement of OpenAI's GPT-4o Realtime and Audio previews, tackles a classic developer challenge: taming unruly text-to-speech (TTS) engines. Triggered by a listener question, Allen and Mark dive into the frustrating inconsistencies of TTS pronunciation, particularly when dealing with dynamically generated text from LLMs. They explore the limitations of SSML, experiment with phoneme alphabets like X-SAMPA, and even ponder the possibi...

Oct 04, 202419 minSeason 1Ep. 209

Episode 208 - O1: Reasoning Engine or Agent's Brain?

Join us as we dive deep into OpenAI's latest model, O1, with special guest host Michal Stanislavik, founder of utter.one and one of the voice community builder behind VoiceLunch. We explore the model's "reasoning" capabilities, its potential impact on conversational AI, and how developers can leverage its strengths. Michal shares his insights from hands-on experience, highlighting both the exciting possibilities and the current limitations of O1. Is it ready for prime-time in conversational appl...

Sep 26, 202447 minSeason 1Ep. 208

Episode 207 - Mentorship in Software Development

Join Mark and Allen on this episode of Two Voice Devs as they dive into the often overlooked but crucial topic of mentorship in software development. They explore what mentorship is (and isn't), the benefits for both mentor and mentee, and share personal anecdotes and practical advice. Whether you're a seasoned developer or just starting out, this episode offers valuable insights into fostering a culture of learning and growth within development teams. Timestamps: 0:00:00 - Introduction 0:00:45 ...

Sep 20, 202434 minSeason 1Ep. 207

Episode 206 - Building Powerful AI Agents with LangGraph

Dive into the world of agentic AI development with Allen and Mark as they explore LangGraph, a powerful state management system for building dynamic and complex AI agents with LangChain. Discover how LangGraph simplifies agent design, handles state transitions, integrates tools, and enables robust error handling – all while keeping the LLM at the heart of your application. Further Info: * https://github.com/langchain-ai/langgraphjs * https://github.com/langchain-ai/langgraphjs-studio-starter/ * ...

Sep 13, 202438 minSeason 1Ep. 206

Episode 205 - Gemini + LangGraph Agents + Google Sheets = Vodo Drive

Join us as we explore Vodo Drive, an innovative project that leverages Google's Gemini AI to revolutionize how we interact with spreadsheets. Creator Allen Firstenberg takes us behind the scenes, revealing the architecture, challenges, and breakthroughs of building an agentic system that understands and manipulates data like never before. Discover how Vodo Drive: * Empowers natural language interaction: Say goodbye to rigid formulas and hello to conversational commands. * Integrates image recogn...

Sep 05, 202451 minSeason 1Ep. 205

Episode 204 - Alexa Skill Sunset Strategies

In this episode of Two Voice Devs, Allen and Mark discuss the considerations and strategies for shutting down an Alexa skill. They explore various reasons why developers might choose to sunset their skills, including declining usage, deprecated features, and the evolving Alexa landscape. They also delve into the technical aspects of skill removal, highlighting the options of hiding or removing a skill and the implications of each choice, especially when dealing with in-skill purchases and subscr...

Aug 29, 202425 minSeason 1Ep. 204

Episode 203 - Imagen 3: Stunning Realism & Ethical Questions

Join Allen and Linda as they dive into Google's Imagen 3 and Imagen 3 Fast, a powerful new set of image generation models. We explore its capabilities, pricing, features, and limitations, including a deep dive into the API and how to use it with Python code. This episode features an in-depth look at Imagen 3's photorealism and comparison with its predecessor, Imagen 2. We examine the ethical implications of AI image generation, discussing copyright issues, plagiarism concerns, and the impact on ...

Aug 23, 202441 minSeason 1Ep. 203

Epsiode 202 - Hosting and Large Language Models

Join Allen Firstenberg and Mark Tucker on Two Voice Devs as they discuss the challenges and solutions of hosting large language models (LLMs). They explore various hosting environments, including Firebase, AWS Amplify, Vertex AI, and Docker/Kubernetes, comparing their strengths and weaknesses. Allen shares his experience with Firebase Cloud Functions and the seamless integration with Google Cloud services, while Mark tackles the complexities of Docker, Kubernetes, and enterprise-level deployment...

Aug 15, 202427 minSeason 1Ep. 202

Episode 201 - Introduction to KitOps for MLOps

Join Allen and Mark in this episode of Two Voice Devs as they dive into the world of MLOps and explore KitOps, an open-source tool for packaging and versioning machine learning models and related artifacts. Learn how KitOps leverages the Open Container Initiative (OCI) standard to simplify model sharing and deployment. More info: https://kitops.ml Key Topics and Timestamps: What is DevOps? (0:00:41) - Allen and Mark discuss the fundamentals of DevOps and its role in software development and oper...

Aug 02, 202434 minSeason 1Ep. 201

Episode 200 - Four Years and Looking Forward

Mark Tucker and Allen Firstenberg celebrate 200 episodes and four years of Two Voice Devs! In this special episode, they reflect on the journey so far, the evolution of the AI landscape, and what excites them most about the future of development. Join them as they discuss: 00:00 Four years ago... 00:10 The evolution of large language models (LLMs) and how the landscape has shifted over the past year. 03:10 The emergence of new players in the AI model space and how Google, Microsoft, and Amazon a...

Jul 26, 202426 minSeason 1Ep. 200

Episode 199 - Is the Future of AI Local?

Join Allen Firstenberg and Roger Kibbe as they delve into the exciting world of local, embedded LLMs. We navigate some technical gremlins along the way, but that doesn't stop us from exploring the reasons behind this shift, the potential benefits for consumers and vendors, and the challenges developers will face in this new landscape. We discuss the "killer features" needed to drive adoption, the role of fine-tuning and LoRA adapters, and the potential impact on autonomous agents and an appless ...

Jul 22, 202432 minSeason 1Ep. 199

Episode 198 - Wisdom from Unparsed: LLMs are Hammers, Not Silver Bullets

Join us on Two Voice Devs as we welcome back Roger Kibbe. Fresh off emceeing the developer track at the Unparsed Conference in London, Roger shares his insights on the biggest takeaways, trends, and challenges facing #GenAI, #VoiceFirst and #ConversationalAI developers today. Get ready for a dose of reality as Roger emphasizes the need to view LLMs as powerful tools – think hammers – rather than magical solutions. We dive deep into: Timestamps: * 0:00 - Intro * 1:56 - Exploring the Unparsed Conf...

Jul 12, 202442 minSeason 1Ep. 198

Episode 197 - Alexa Skill Development in the Age of LLMs

What should people developing with LLMs learn from a decade of experience building Alexa skills? How will Alexa skill developers leverage the latest #GenerativeAI and #CoversationalAI tools as they continue to build #VoiceFirst and multimodal skills? Join Allen and Mark on Two Voice Devs as they delve into the evolving landscape of Alexa skill development in the era of large language models (LLMs). Sparked by a thought-provoking discussion on the Alexa forums, they explore the potential benefits...

Jul 05, 202440 minSeason 1Ep. 197

Epsidoe 196 - Is GPT 4o a Game Changer?

OpenAI's ChatGPT 4o and GPT 4o announcements have sent shockwaves through the developer community! In this episode of Two Voice Devs, Mark and Allen dive into the implications of these new models, comparing them to Google's Gemini. We discuss: [00:00:10] Initial takeaways from the OpenAI presentations. [00:02:29] The impressive voice capabilities of ChatGPT 4o. [00:04:49] Concerns about OpenAI's ambitions for conversational AI. [00:07:30] The difference between "doing" and "knowing" AI systems. ...

Jun 06, 202423 minSeason 1Ep. 196

Episode 195 - Android, Agents, and the Rabbit R1

Allen Firstenberg chats with fellow Google Developer Expert (GDE) Mike Wolfson about his career, the evolution of Android, and his new interest in generative AI. Mike shares his thoughts on the future of AI with agents, Large Action Models (LAMs), and the potential of the "Rabbit," a new AI-powered device. Does the Rabbit live up to its promise? If not - what could? Timestamps: 00:00:00 - Introduction 00:01:32 - Mike's career journey 00:04:15 - Transition from enterprise Java to Android developm...

May 30, 202435 minSeason 1Ep. 195

Episode 194 - Google AI/O 2024

Join Allen and Roya as they dissect the major AI announcements from Google I/O 2024. From Gemini updates and new models to responsible AI and groundbreaking projects like ASTRA, this episode dives into the future of AI development. Timestamps: [00:00:00] Introduction and Google I/O Overview [00:02:00] Gemini 1.5 Flash & Gemini 1.5 Pro: New Models and Features [00:04:30] AI Studio Access Expansion for Europe, UK & Switzerland [00:06:20] Choosing the Right AI Model for Your Project [00:06:...

May 17, 202423 minSeason 1Ep. 194

Episode 193 - Revolutionizing Intent Classification

Join Allen and Mark as they delve into Voiceflow's groundbreaking new feature: intent classification using a hybrid of LLMs and classic NLU models. Discover how this innovative approach leverages the strengths of both technologies to achieve greater accuracy and flexibility in understanding user intent. How they're doing it just may blow your mind! 🤯 Timestamps: 0:00:00 - Introduction 0:00:33 - Exploring the concept of intents and slots in conversational UI 0:05:11 - Understanding Natural Langu...

May 09, 202440 minSeason 1Ep. 193

Episode 192 - Google Cloud Next 2024 Recap

Join Allen Firstenberg and guest host Stefania Pecore on Two Voice Devs as they delve into the exciting announcements and highlights from Google Cloud Next 2024! This episode focuses on the latest advancements in AI and their impact on the healthcare industry, providing valuable insights for developers and tech enthusiasts. Learn more: * https://cloud.google.com/blog/topics/google-cloud-next/google-cloud-next-2024-wrap-up Timestamps: 00:00:00: Introduction 00:01:02: Stefania's background and jou...

Apr 26, 202441 minSeason 1Ep. 192

Episode 191 - Beyond the Hype: Exploring BERT

This episode of Two Voice Devs takes a closer look at BERT, a powerful language model with applications beyond the typical hype surrounding large language models (LLMs). We delve into the specifics of BERT, its strengths in understanding and classifying text, and how developers can utilize it for tasks like sentiment analysis, entity recognition, and more. Timestamps: 0:00:00: Introduction 0:01:04: What is BERT and how does it differ from LLMs? 0:02:16: Exploring Hugging Face and the BERT base u...

Apr 19, 202440 minSeason 1Ep. 191

Episode 190 - Google Gemma's Tortoise and Hare Adventure

Embark on a wild race with Gemma as we explore the exciting (and sometimes slow) world of running Google's open-source large language model! We'll test drive different methods, from the leisurely pace of Ollama on a local machine to the speedier Groq platform. Join us as we compare these approaches, analyzing performance, costs, and ease of use for developers working with LLMs. Will the tortoise or the hare win this race? Learn more: * Model card: https://console.cloud.google.com/vertex-ai/publi...

Apr 11, 202428 minSeason 1Ep. 190

Episode 189 - Farewell, ADR: The Impact on Alexa Developers

The Alexa Developer Rewards Program (ADR) is shutting down, leaving many developers wondering about the future of Alexa skills. Mark and Allen discuss the implications of this change, explore alternative monetization options, and share their thoughts on the future of skill development. Timestamps: 0:00 - Intro and announcement of the ADR program ending 1:45 - History of the ADR program and its impact on skill development 7:13 - Discussion of the Skill Developer Accelerator Program (SDAP) and Ski...

Apr 05, 202426 minSeason 1Ep. 189

Episode 188 - Building Responsible AI with Gemini

As large language models (LLMs) become increasingly powerful, ensuring their responsible use is crucial. In this episode of Two Voice Devs, Allen and Mark delve into Google's Gemini LLM, specifically its built-in safety features designed to prevent harmful outputs like harassment, hate speech, sexually explicit content, and dangerous information. Join them as they discuss: (00:01:55) The importance of safety features in LLMs and Google's approach to responsible AI. (00:03:08) A walkthrough of Ge...

Mar 29, 202430 minSeason 1Ep. 188

Episode 187 - LLMs in Developer Tools

In this episode of Two Voice Devs, Mark and Allen discuss how developers can leverage AI tools like ChatGPT to improve their workflow. Mark shares his experience using ChatGPT to generate an OpenAPI specification from TypeScript types, saving him significant time and effort. They discuss the benefits and limitations of using AI for code generation, emphasizing the importance of understanding the generated code and maintaining healthy skepticism. Timestamps: 00:00:00 Introduction 00:00:49 Using A...

Mar 21, 202426 minSeason 1Ep. 187
For the best experience, listen in Metacast app for iOS or Android