The Data Stack Show - podcast cover

The Data Stack Show

Rudderstackdatastackshow.com
Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.
Last refreshed:
Follow this podcast in the Metacast mobile app to refresh it and see new episodes.
Download Metacast podcast app
Podcasts are better in Metacast mobile app
Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Episodes

The PRQL: What is Data Discovery?

In this bonus episode, Eric and Kostas preview their upcoming conversation with Shinji Kim of Select Star. Hosted by Simplecast, an AdsWizz company. See https://pcm.adswizz.com for information about our collection and use of personal data for advertising.

Oct 21, 20224 min

109: How Does Headless Business Intelligence Work? Featuring Artyom Keydunov and Pavel Tiunov of Cube Dev

Highlights from this week’s conversation include: The context of Headless BI (3:31) What Cube Dev does (9:24) How Headless BI works with other tools (13:03) An analysis of LookML (18:04) User interaction with Cube Dev (23:40) Who manages data artifacts (25:22) Taking care of the developer experience (30:37) Levels of performance (30:37) Artyom and Pavel’s background and career journey (35:47) Why you should use Cube Dev (43:38) Roles within a data organization (48:55) How Cube Dev impacts visual...

Oct 19, 20221 hr 1 min

The PRQL: What Comes to Mind When You Think of ‘Headless’?

In this bonus episode, Eric and Kostas preview their upcoming conversation with Artyom Keydunov & Pavel Tiunov of CubeJS. Hosted by Simplecast, an AdsWizz company. See https://pcm.adswizz.com for information about our collection and use of personal data for advertising.

Oct 14, 20225 min

108: You Can’t Separate Data Reliability From Workflow with Gleb Mezhanskiy of Datafold

Highlights from this week’s conversation include: Gleb’s background and career journey (2:51) The adoption problems (10:53) How Datafold solves these problems (18:08) The vision for Datafold (26:27) Incorporating Datafold as a data engineer (38:53) The importance of the data engineer (42:12) Something to keep in mind when designing data tools (46:46) Implementing new technology into your company (53:18) The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each ...

Oct 12, 20221 hr

Shop Talk: Is It Possible for Excel To Die?

In this bonus episode, Eric and Kostas talk shop around the wide world of data. Hosted by Simplecast, an AdsWizz company. See https://pcm.adswizz.com for information about our collection and use of personal data for advertising.

Oct 10, 202218 min

The PRQL: Are Marketers the Worst Data Quality Offenders?

In this bonus episode, Eric and Kostas preview their upcoming conversation with Gleb Mezhanskiy of Datafold. Hosted by Simplecast, an AdsWizz company. See https://pcm.adswizz.com for information about our collection and use of personal data for advertising.

Oct 07, 20224 min

107: Building Modern Data Teams with dbt Labs, REI, and Robinhood

Highlights from this week’s conversation include: Introducing our guests (3:05) Defining “data team” (4:40) How data teams emerge and evolve (14:11) The need that forces the creation of a data team (21:12) The backbone of the data team (26:23) Building a career within a data team (36:39) Advice for new data team managers (47:35) Question and answer time (52:38) The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts...

Oct 05, 20221 hr 3 min

The PRQL: What Can We Learn From the Patterns of Successful Data Teams?

In this bonus episode, Eric and Kostas preview their upcoming panel discussion around the topic of building data teams. Hosted by Simplecast, an AdsWizz company. See https://pcm.adswizz.com for information about our collection and use of personal data for advertising.

Sep 30, 20223 min

106: Optimizing Query Workloads (and Your Snowflake Bill) with Vinoo Ganesh of Bluesky Data

Highlights from this week’s conversation include: Vinoo’s background and career journey (2:43) How to benchmark cost (7:54) How Bluesky addresses rising Snowflake bills (14:01) “Workload” as defined by Bluesky (17:14) Space for BI optimization (22:55) How products manage bill growth (28:34) How to optimize your workloads (35:37) Bluesky’s partnerships (39:53) Getting real-time feedback on your work (44:50) Where to begin reevaluating your Snowflake game (50:47) The Data Stack Show is a weekly po...

Sep 28, 202256 min

Shop Talk With Eric and Kostas: Data Politicians

In this bonus episode, Eric and Kostas talk about data politicians in this new special show format. Hosted by Simplecast, an AdsWizz company. See https://pcm.adswizz.com for information about our collection and use of personal data for advertising.

Sep 26, 202222 min

The PRQL: Comparing Snowflake to a Car

In this bonus episode, Eric and Kostas preview their upcoming conversation with Vinoo Ganesh of Bluesky Data. Hosted by Simplecast, an AdsWizz company. See https://pcm.adswizz.com for information about our collection and use of personal data for advertising.

Sep 23, 20224 min

105: The Modern Data Stack Is Just Getting Started with Astasia Myers of Quiet Capital

Highlights from this week’s conversation include: Astasia’s background and career journey (3:03) How Astasia evaluates data companies (5:25) Defining “modern data stack” (8:39) The limit of the complexity of a solution (18:44) How risky early-stage acquisition really is (26:15) Flashing headlight advice for investing (30:17) Signs you should do a product integration (33:38) The next data infrastructure opportunities (36:19) The likelihood of two data worlds merging (43:55) How important open sou...

Sep 21, 202259 min

The PRQL: Kostas Becomes a Prophet

In this bonus episode, Eric and Kostas preview their upcoming conversation with Astasia Myers of Quiet Capital. Hosted by Simplecast, an AdsWizz company. See https://pcm.adswizz.com for information about our collection and use of personal data for advertising.

Sep 16, 20223 min

104: A Decade of Change in the Data Space with Benn Stancil of Mode

Highlights from this week’s conversation include: Benn’s background and career journey (2:28) The problem Benn sought to solve (4:48) Data engineering a decade ago (9:58) Technology inside vs. outside Silicon Valley (18:11) What’s next for data (24:42) Mode’s evolution and journey (29:31) Challenges of getting enough context to create (39:21) Current trends that won’t see long-term benefits (48:44) The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week ...

Sep 14, 202257 min

The PRQL: What Does 10 Years in the Data Space Give You?

In this bonus episode, Eric and Kostas preview their upcoming conversation with Benn Stancil of Mode. Hosted by Simplecast, an AdsWizz company. See https://pcm.adswizz.com for information about our collection and use of personal data for advertising.

Sep 09, 20224 min

103: Everyone Is Invited to the Data Lakehouse with Kyle Weller of Onehouse.ai

Highlights from this week’s conversation include: Kyle’s background and career journey (2:38) Unique challenges in building data engineering products (9:33) The problem set Databricks resolves (13:46) About Onehouse (17:15) From Microsoft to Onehouse (20:59) Why there’s so much distance between data powers (24:45) Why the data lake is not enough (30:15) Who should have a lake house (39:03) Why we have all three data platforms (43:53) How to step into the data lake house world (49:48) The Data St...

Sep 07, 202256 min

The PRQL: Who Really Needs To Know How a DBMS Works?

In this bonus episode, Eric and Kostas preview their upcoming conversation with Kyle Weller of Onehouse.ai. Hosted by Simplecast, an AdsWizz company. See https://pcm.adswizz.com for information about our collection and use of personal data for advertising.

Sep 02, 20225 min

102: Building Pinot for Real-Time, Interactive User Analytics with Kishore Gopalakrishna of StarTree

Highlights from this week’s conversation include: Kishore’s background and career journey (2:30) Internal analytics versus user-facing analytics (3:49) New ways of thinking about analytics (8:06) What makes Pinot different (13:45) How Pinot transforms systems (21:53) Understanding the data landscape (32:40) The Pinot user experience (36:27) Something exciting about StarTree (40:05) When you should adopt this technology (43:15) The Data Stack Show is a weekly podcast powered by RudderStack, the C...

Aug 31, 202249 min

The PRQL: Data Warehouses on Steroids

In this bonus episode, Eric and Kostas preview their upcoming conversation with Kishore Gopalakrishna of StarTree. Hosted by Simplecast, an AdsWizz company. See https://pcm.adswizz.com for information about our collection and use of personal data for advertising.

Aug 26, 20223 min

101: The Future of Machine Learning with Willen Pienaar of Tecton and Tristan Zajonc of Continual

Highlights from this week’s conversation include: When is it right to use ML? (5:22) ML business models (10:21) Significant changes in delivering ML (19:07) Why ML is different (25:19) SQL becoming more important (34:39) Graduating from SQL-based to real-time (37:22) Space for a new role (45:11) State-of-the-art models (49:03) The most exciting thing in the ML space (53:59) Open source in ML (56:39) The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week...

Aug 24, 20221 hr 4 min

The PRQL: Can Machine Learning Be Commoditized?

In this bonus episode, Eric and Kostas preview their upcoming live stream episode featuring Willem Pienaar of Tecton and Tristan Zajonc of Continual. Hosted by Simplecast, an AdsWizz company. See https://pcm.adswizz.com for information about our collection and use of personal data for advertising.

Aug 19, 20225 min

100: Data Quality Is Relative to Purpose with James Campbell of Superconductive

Highlights from this week’s conversation include: James’ role at Great Expectations (2:33) What Great Expectations does (5:49) How Great Expectations approaches data quality (7:01) Why a data engineer should use Great Expectations (16:41) Defining “data quality” (19:16) Translating expectations from one domain to the other (27:00) Community around Great Expectations (30:59) The user experience (33:41) Something exciting on the horizon (40:27) Interacting with marketers in a non-technical way (43...

Aug 17, 202254 min

The PRQL: What’s the Hardest Part About Data Quality?

Eric and Kostas preview their upcoming conversation with James Campbell at Superconductive. Hosted by Simplecast, an AdsWizz company. See https://pcm.adswizz.com for information about our collection and use of personal data for advertising.

Aug 12, 20224 min

99: State of the Data Lakehouse with Vinoth Chandar of Apache Hudi

Highlights from this week’s conversation include: Vinoth’s background and career journey (3:08) Defining “data lakehouse” (5:10) Databricks versus lake houses (13:37) The services a lakehouse needs (17:37) How to communicate technical details (26:55) Onehouse’s product vision (31:41) Lakehouse performance versus BigQuery solutions (36:44) How to deliver customer experience equally (40:17) How to start building a lakehouse (44:00) Big tech’s effect on smaller lakehouses (55:33) Skipping the data ...

Aug 10, 20221 hr 13 min

98: Category Theory and the Mathematical Foundation of the Technologies We Use with Eric Daimler of Conexus

Highlights from this week’s conversation include: Eric’s background and career journey (3:30) Presenting to people without knowledge of AI (11:04) Why math was chosen over AI (19:03) From compilers to databases (25:42) The contribution of category theory (30:09) The Connexus customer experience (37:45) The primary user of Connexus (46:33) Interacting with 300,000 databases (51:07) When Connexus begins to add value (54:02) The best way to learn this mathematical approach (55:46) The Data Stack Sh...

Aug 03, 20221 hr 2 min

The PRQL: Farm to Table Abstract Mathematics

Eric and Kostas preview their upcoming conversation with Eric Damlier of Conexus AI. Hosted by Simplecast, an AdsWizz company. See https://pcm.adswizz.com for information about our collection and use of personal data for advertising.

Jul 29, 20224 min

97: How To Build an Organization-Empowering Data Team with Emilie Schario of Amplify Partners

Highlights from this week’s conversation include: Emilie’s background and career journey (3:00) Hypergrowth at GitLab (5:23) Being close to the money in data (9:50) Big things taken from GitLab to Netlify (13:00) Defining “data organization” (17:53) The first roles you should hire for (22:06) Defining “analytics engineer” (23:44) One role to bridge different needs (27:26) Why data analysts are needed (30:51) How to avoid a kitchen sink of data (40:20) Data engineer archetype (45:48) Data roles c...

Jul 27, 202254 min
For the best experience, listen in Metacast app for iOS or Android