The Data Stack Show - podcast cover

The Data Stack Show

Rudderstackdatastackshow.com
Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

Episodes

The PRQL: What are LLMs Actually Good At? Featuring Nicolay Gerold

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data. RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com ....

Sep 02, 20242 min

204: Will a Duck DB-Like Excel Emerge by 2075? And Is Data Every Company’s Most Valuable Asset? Featuring Benn Stancil of Mode

Highlights from this week’s conversation include: Benn's Background and Journey in Data (0:48) Reflection on Strategy and Vision (2:10) The Importance of Doing It Your Way (4:10) Early Experiences and Blogging (6:27) Self-Imposed Pressure in Startups (8:24) The Challenge of Decision-Making (12:11) Key Decisions in a Startup's Trajectory (15:48) Understanding Startup Anxiety (17:24) Importance of Focus in Data Startups (20:02) Product Market Fit Insights (24:38) Cultural Change and Product Fit (3...

Aug 28, 202454 min

The PRQL: Will Excel Exist in 50 Years? Featuring Benn Stancil of Mode

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data. RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com ....

Aug 26, 20242 min

203: From Data Dreams to Practical Marketing Outcomes with Spencer Burke of Braze

Highlights from this week’s conversation include: Spencer's Background at Braze (1:54) The Early Days of Braze (2:41) Finding Product-Market Fit (4:44) First Major Customer (6:33) Unique Aspects of Braze's Growth Team (8:07) Startup Culture Experience (10:40) Data and Marketing Perspectives (12:50) Common Marketing Data Challenges (15:50) Changing Dynamics in Marketing Tech (18:12) Evaluating Marketing Tools (19:38) Transformation of Marketing Tools (22:18) Marketers Becoming More Technical (24:...

Aug 21, 202447 min

The PRQL: Get Better at Data, Get Better at Marketing with Spencer Burke of Braze

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data. RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com ....

Aug 19, 20242 min

202: Predicting the Impact of Competitive Entrants With Synthetic Controls with Evan Wimpey of Elder Research

Highlights from this week’s conversation include: Evan's Background and Journey in Data (0:40) Discussion on Synthetic Controls (1:04) Evan's Educational Journey and Marine Corps Experience (2:54) Joining Elder Research (4:38) Synthetic Controls Explained (6:54) Measuring Impact with Synthetic Controls (9:05) Building the Control Group (12:54) Qualitative Context in Data Analysis (14:50) Final Steps with Synthetic Controls (16:29) Client Analytics Maturity (18:56) Outsourcing Decisions in Analyt...

Aug 14, 202450 min

The PRQL: Data Comedy and Synthetic Controls with Evan Wimpey of Elder Research

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data. RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com ....

Aug 12, 20242 min

201: AI Real-Talk: Uncovering the Good, Bad and Ugly Through Prototyping with Eric, John, and Matt

Highlights from this week’s conversation include: Current State of LLMs (1:12) Historical Analogy to the iPhone (3:32) Limitations of Early iPhones (5:02) Comparing LLMs to Historical Technologies (6:08) Skepticism About LLM Capabilities (9:11) Broad Nature of AI Innovations (10:12) User Input Challenges (14:32) Transcription and Unstructured Data (16:19) Single Player vs. Multiplayer Experiences with LLMs (18:50) Revenue Insights from ChatGPT (20:27) Contextual Use of LLMs in Development (23:43...

Aug 07, 20241 hr 3 min

The PRQL: AI Roundtable: Putting AI in Historical Context and Real-Life Learnings Through Prototyping, with Eric, John, and Matt

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data. RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com ....

Aug 05, 20242 min

200: Data Team Struggles: Telling Stakeholders the Truth vs. What They Want to Hear (How to Tell The Truth, Tactfully)

Highlights from this week’s conversation include: Lightning Round Discussion (1:21) Data Team's Truthfulness (2:21) Culture as a Blocker (9:10) Misconceptions about Data Jobs (10:32) Cultural and Technological Influences (11:51) Challenges in Data Science Projects (15:19) Embracing the Process (17:23) Barriers to Entry (19:36) Hiring Data Leaders (22:06) Challenges of Data Leadership (25:38) Evolving Hiring Criteria (27:30) The Data Stack Show is a weekly podcast powered by RudderStack, the CDP ...

Jul 31, 202429 min

The PRQL: Jaded Takes on Data Linkedin

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data. RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com ....

Jul 29, 20242 min

199: How To Use Data Analytics and AI To Increase Profitability With Smarter Procurement, Featuring Cameron Jagoe of ProcureVue

Highlights from this week’s conversation include: Cameron's Background and Journey in Data (1:49) Running a Bakery (3:03) Applying Analytics to Bakery Operations (7:07) Reevaluating Business Operations (18:08) Optimizing for Profitability (19:09) Working at Newell Rubbermaid (20:11) Value Engineering Projects (22:11) Starting a Center of Excellence (24:53) Productizing the Approach (29:48) Tech Stack for Data Analysis (31:40) Data Cleaning and Classification (35:16) Market Build and Pricing Accu...

Jul 24, 202449 min

The PRQL: Better Analytics, Smarter Purchasing, and Improved Profitability with Cameron Jagoe of ProcureVue

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data. RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com ....

Jul 22, 20243 min

198: Building AI Search and Customer-Enabled Fine-Tuning with Jesse Clark of Marqo.ai

Highlights from this week’s conversation include: Jesse’s background and work in data (0:35) E-commerce Application for Search (1:23) Ph.D. in Physics Experience Then Working in Data (2:27) Early Machine Learning Journey (4:35) Machine Learning at Stitch Fix (7:28) Machine Learning at Amazon (10:39) Myths and Realities of AI (13:49) Bolt-On AI vs. Native AI (17:26) Overview of Marqo (19:46) Product launch and fine-tuning models (23:02) Importance of data quality (25:38) The power of machine lear...

Jul 17, 202452 min

The PRQL: Exploring the Evolution of AI and ML in E-commerce Search Optimization with Jesse Clark of Marqo.ai

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data. RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com ....

Jul 15, 20242 min

197: Deep Dive: How to Build AI Features and Why it is So Dang Hard with Barry McCardle of Hex

Highlights from this week’s conversation include: Overview of Hex and its Purpose (0:51) Discussion on AI and Data Collaboration (1:42) Product Updates in Hex (2:14) Challenges of Building AI Features (13:29) Magic Features and AI Context (15:22) Chatbots and UI (17:31) Benchmarking AI Models (19:06) AI as a Judge Pattern (23:32) Challenges in AI Development (25:31) AI in Production and Product Integration (28:43) Difficulties in AI Feature Prediction (33:38) Deterministic template selection and...

Jul 10, 20241 hr 4 min

The PRQL: Why is Building Great AI Features so Hard? Featuring Barry McCardel of Hex

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data. RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com ....

Jul 08, 20242 min

196: Why Big Query Was a Big Deal, Observability AI, and How AI is Like a Guy at the Bar, Featuring David Wynn of Edge Delta

Highlights from this week’s conversation include: David’s Background and Career (0:49) Econometrics Work at UPS (3:14) Challenges with Time Series Data and Tools (7:15) Working at Google Cloud (11:28) BigQuery's Significance (13:51) Comparison of Data Warehouse Products (17:23) Learning different cloud platforms (20:17) Coherence in GCP (23:04) Observability and data analysis (32:44) Support for Iceberg format in BigQuery (36:31) AI in Observability (40:25) AI's Role in Observability (43:39) AI ...

Jul 03, 202449 min

The PRQL: Google Cloud Deep Dive and Observability AI with David Wynn of Edge Delta

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data. RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com ....

Jul 01, 20243 min

195: Supply Chain Data Stacks and Snowflake Optimization Pro Tips with Jeff Skoldberg of Green Mountain Data Solutions

Highlights from this week’s conversation include: Jeff's Background and Transition to Independent Consulting (0:03) Working at Keurig and Business Model Changes (2:16) Tech Stack Evolution and SAP HANA Implementation (7:33) Adoption of Tableau and Data Pipelines (11:21) Supply Chain Analytics and Timeless Data Modeling (15:49) Impact of Cloud Computing on Cost Optimization (18:35) Challenges of Managing Variable Costs (20:59) Democratization of Data and Cost Impact (23:52) Quality of Fivetran Co...

Jun 26, 202449 min

The PRQL: Breaking down Keurig’s Supply Chain Data Stack with Jeff Skoldberg of Green Mountain Data Solutions

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data. RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com ....

Jun 24, 20242 min

194: Building Retail Churn Prediction on DuckDB with Clint Dunn of Wilde

Highlights from this week’s conversation include: Clint’s Background and Journey in Data (0:51) Starting a Data Career (2:01) Transition to Startup SaaS World (4:27) Clint’s Connection to a Federal Reserve Database (5:31) Challenges in Predictive Modeling (10:27) Data Input Challenges (15:50) Marketers' Workflow and Data Integration (18:29) Soft ROI vs. Hard ROI in Data Analysis (00:21:31) Balancing Internal Marketing and Data Team's Value (22:35) Simplifying Data Inputs for Predictive Models (2...

Jun 19, 202448 min

The PRQL: Hard Data ROI and Productizing Retail Churn Prediction with Clint Dunn of Wilde

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data. RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com ....

Jun 17, 20242 min

193: Introducing the Cynical Data Guy: Is Data-Driven a Myth?

Highlights from this week’s conversation include: Introducing a special edition of the show with the cynical data guy (0:19) Metadata and LLMs (2:32) Data-driven culture (8:44) No-code orchestration tools (17:09) No Code vs. Low Code (19:58) Enterprise Challenges with No Code Solutions (20:08) No Code Tools for Small Companies (21:40) Inappropriate Use of Tools (23:06) Final thoughts and takeaways (24:05) The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Eac...

Jun 12, 202424 min

The PRQL: The Cynical Data Guy’s Origin Story

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data. RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com ....

Jun 10, 20243 min

192: Business Logic As Code: A New LLM-Powered Operating System for Business Automation with Binny Gill of Kognitos

Highlights from this week’s conversation include: The history of computer science and AI inflection point (1:23) Binny's early programming experiences and the constraints of technology (2:14) Getting interested in computer programming (5:02) The experiment that impacted the starting of Kognitos (8:23) Challenges in traditional computer science (16:04) Reimagining programming and debugging through natural language (19:08) The operating system for applications (20:19) Changing the paradigm of prog...

Jun 05, 202448 min

The PRQL: From Programming Tic Tac Toe to Building an Operating System for Natural Language Programs With Binny Gill of Kognitos

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data. RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com ....

Jun 03, 20243 min

191: From Amazon to Consulting: Time Series Forecasting and How to Communicate Data Analytics Insights with David McCandless of McCandless Consulting

Highlights from this week’s conversation include: David's Background and Journey in Data (0:30) Transition to Time Series Forecasting (2:03) Working on Time Series Forecasting at Amazon (2:55) Challenges and Experience in Time Series Forecasting (4:32) Transitioning to a New Role at Amazon (5:52) Tools and Methods for Time Series Forecasting (8:17) Forecasting Impact and Accuracy (15:30) Explaining Variance and Lessons Learned (18:58) Understanding Downstream Consumers and Empathy for Business L...

May 29, 202449 min

The PRQL: Practical Applications for Time Series Forecasting with David McCandless of McCandless Consulting

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data. RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com ....

May 28, 20243 min

190: Aligning Data Teams and Data Tools With Business Needs Featuring Ben Rogojan, the Seattle Data Guy

Highlights from this week’s conversation include: Ben’s background and journey in data (0:18) Relating data to business outcomes (2:33) Facebook's approach to data-driven business outcomes (4:43) Subjectivity and data-driven business outcomes (8:43) Infrastructure and data collection at Facebook (12:04) The importance of first-party data and the death of third-party cookies (16:27) Facebook's Data and Attribution Challenges (20:08) Facebook's Infrastructure and Tooling (23:41) Differences in Dat...

May 22, 202452 min