What is it like to build a data team for a company in the data space? This talk is centered around how dbt Labs is building their data team. We will cover how our team is structured, how we operate and interact with the greater organization, and how we set expectations and responsibilities that are helping us become a self-service organization. Register to catch the rest of Coalesce, the Analytics Engineering Conference, at https://coalesce.getdbt.com. The Analytics Engineering Podcast is brough...
Dec 07, 2021•26 min•Season 1Ep. 13
As a product leader at companies like Heroku and Zendesk, DeVaris specialized in building infrastructure-grade products. Currently, as the CEO of Meroxa, he enables teams to build real-time data infrastructure with the same ease as we now take for granted in batch. In this romp of an episode, Tristan, Julia and DeVaris flow from his experience in tech mentorship, into the nuts and bolts of Change Data Capture (CDC), and how streaming data infrastructure can help data teams provide better end use...
Dec 02, 2021•50 min•Season 1Ep. 12
David is Sr. Director of Data at Lyst, and as leader of their analytics + data science teams he has followed the evolution of data roles closely over the past decade. David spends a lot of time thinking about career progression + data team structure, and in this conversation with Tristan + Julia they dive into the classic individual contributor vs manager conundrum, migrating between warehouses, and reactive vs proactive data workflows. For full show notes and to read 6+ years of back issues of ...
Nov 18, 2021•41 min•Season 1Ep. 11
Julien has a unique history of building open frameworks that make data platforms interoperable. He’s contributed in various ways to Apache Arrow, Apache Iceberg, Apache Parquet, and Marquez, and is currently leading OpenLineage, an open framework for data lineage collection and analysis. In this episode, Tristan & Julia dive into how open source projects grow to become standards, and why data lineage in particular is in need of an open standard. They also cover into some of the compelling use ca...
Nov 04, 2021•49 min•Season 1Ep. 10
Benn is Chief Analytics Officer and a Co-founder at Mode Analytics, but you may know him from his Substack newsletter ( benn.substack.com ), where each Friday he dives into a semi-controversial topic (recent examples: “Is BI Dead?” and “BI is Dead”). In this episode, Benn, Tristan & Julia finally hash out some of these debates IRL: what *is* the modern data stack, why is the metrics layer important, and what’s the point of all of this? For full show notes and to read 6+ years of back issues of t...
Oct 21, 2021•49 min•Season 1Ep. 9
Seth Rosen has broken data Twitter many times, and in his early-fatherhood sleep deprivation developed a wonderful Twitter persona as the battle-tested data analyst. IRL though Seth is a serious data practitioner, and as Founder at the data consultancy HashPath has helped dozens of companies get into the modern data stack + build public-facing data apps. Now, as the founder of TopCoat, he’s empowering analysts to build + publish those same public-facing data apps. In this episode, Tristan, Julia...
Oct 07, 2021•39 min•Season 1Ep. 8
Brittany Bennett is Data Director at Sunrise Movement, the youth climate movement that numbers tens of thousands of members throughout every US state. Given how quickly our industry moves, developing junior data talent is hard, but Brittany’s team at Sunrise makes it look easy. And that’s no accident—because Sunrise hires for mission alignment rather than technical background, they dedicate significant resources to training + mentorship. In this conversation, Tristan, Julia & Brittany dive deep ...
Sep 23, 2021•39 min•Season 1Ep. 7
Caitlin Colgrove is Co-founder & CTO at Hex, a data workspace that allows teams to collaborate in both SQL and Python to publish interactive data apps. In this conversation, Tristan, Julia and Caitlin dive into the possibilities that real-time collaborative notebooks unlock for data teams — what if our collaboration style looked more like Google Docs than a Git workflow? For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt....
Sep 09, 2021•40 min•Season 1Ep. 6
Erik Bernhardsson spent six years at Spotify, where he contributed to the first version of the music recommendation system. After a stint as CTO at Better.com, he’s now working on building new infrastructure tooling for data teams. In this wide-ranging conversation with Tristan & Julia, Erik dives into the nuts and bolts of Spotify’s recommendation algorithm, (paradoxically) why you should rarely need to use ML, and the fundamental infrastructure challenges that drag down the productivity of dat...
Aug 26, 2021•42 min•Season 1Ep. 5
In this episode, we're going to do something a little different, and turn the spotlight on co-host Julia Schottenstein. In this conversation with Tristan, you'll get to know Julia a bit—from her early childhood ambitions of becoming a "computer tycoon" (adorable!), to working in venture at NEA and now as a Product Manager at dbt Labs. They also dive into Julia's opinions on key trends shaping the future of the data industry (the phrase oligopoly makes an appearance). For full show notes and to r...
Aug 12, 2021•32 min•Season 1Ep. 4
Brian Amadio is a Data Platform Engineer at Stitch Fix, where experimentation underpins everything they do across merchandising, planning, forecasting, operations and more. In this conversation with Tristan, Julia, and Brian you’ll get into the weeds of executing multi-armed bandit experiments and learn how you can perform experiments even with limited data. For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com . The Ana...
Jul 29, 2021•39 min•Season 1Ep. 3
Step with Venkat into a world where data is always fresh, queries run in 1ms, and analytics engineers build web-scale, real-time data apps. As Engineering Director at Facebook, Venkat helped build the RocksDB real-time database that powered growth to 5 billion queries per second(!)—and now with his colleagues at Rockset, he's bringing that real-time database infrastructure to the rest of us. In this conversation, Tristan, Julia and Venkat explore the fundamental technological advances that are e...
Jul 15, 2021•45 min•Season 1Ep. 2
Robert Chang is a product manager for the data platform at Airbnb, where he helped build and roll out Minerva, Airbnb's internal metrics store. They use Minerva to track over 12,000(!) metrics and 4,000(!) dimensions with consistency across the organization. In this conversation with Tristan and Julia, Robert dives into why they built it, what it took to get it done—and crucially, what you should do if your company doesn't have the resources to build your own internal metrics store. For full sho...
Jul 01, 2021•38 min•Season 1Ep. 1