The Data Stack Show - podcast cover

The Data Stack Show

Rudderstackdatastackshow.com
Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

Episodes

115: What Is Production Grade Data? Featuring Ashwin Kamath of Spectre

Highlights from this week’s conversation include: Ashwin’s background in the data space (2:43) The unique nature of working with data in finance (7:32) Technological challenges of working in the finance data space (13:55) The third-party data factor and judging if it is reliable enough (17:07) What made Ashwin decide to go out and build his own company? (31:47) Defining data decay and data storing and why both are important (37:52) Advice on the importance of data quality (42:10) Final takeaways...

Nov 30, 202255 min

114: Solving Data Infrastructure Problems at Startups and Enterprises with Max Werner of Obsessive Analytics Consulting

Highlights from this week’s conversation include: Max’s career journey (2:54) Going from a small startup to a big enterprise (11:15) Dynamics of a switchboard operator (17:09) Common threads through different companies (20:53) When data is not the answer (26:57) The evolution of CDP (29:38) Data sources to include in a CDP (35:16) Working with event data (37:19) Max’s take on other tools (41:18) The cutting edge in data (43:09) Building your data company in an evolving environment (49:28) Find M...

Nov 23, 202259 min

The PRQL: The Data Switchboard

In this bonus episode, Eric and Kostas preview their upcoming conversation with Max Werner of Obsessive Analytics Consulting.

Nov 21, 20223 min

113: What Is Streaming Graph? Featuring Ryan Wright of thatDot

Highlights from this week’s conversation include: Ryan’s background and career journey (2:49) Quine and where it came from (4:36) Graph databases 101 (7:17) Use cases for graph databases (13:44) Purposes for graphs (22:27) How to use Quine (31:49) Quine’s performance and scale (43:06) Educating users about a new product (49:13) The team that would optimize Quine (52:23) When graph will gain popularity (56:15) Quine: https://quine.io/ The Data Stack Show is a weekly podcast powered by RudderStack...

Nov 16, 20221 hr 5 min

The PRQL: Graph as a Utility

In this bonus episode, Eric and Kostas preview their upcoming conversation with Ryan Wright of thatDot.

Nov 14, 20223 min

112: Python Native Stream Processing with Zander Matheson of bytewax

Highlights from this week’s conversation include: Zander’s background and career journey (2:32) Introducing bytewax (5:16) The difference between systems (10:57) Bytewax’s most common use cases (16:15) How bytewax integrates with other systems (20:25) The technology that makes up bytewax (24:31) Comparing bytewax to other systems (34:17) What’s next for bytewax (36:31) Try it out: bytewax.io The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll t...

Nov 09, 202250 min

111: What if Your Code Just Ran in the Cloud for You? Featuring Erik Bernhardsson of Modal Labs

Highlights from this week’s conversation include: Erik’s background and career journey (2:51) Managing scale in a rapidly changing environment (6:35) The people side of hypergrowth (12:36) Coding competitions (17:50) Introducing Modal Labs (19:02) How Erik got into building Modal (21:45) The employee experience at Modal (28:09) How a data engineering team would use Modal (31:21) What it takes to build a platform like Modal (36:27) What makes Modal different (42:49) Evolution coming for the data ...

Nov 02, 202258 min

110: How Can Data Discovery Help You Understand Your Data? Featuring Shinji Kim of Select Star

Highlights from this week’s conversation include: Shinji’s background and career journey (3:35) Defining “data discovery” (6:03) The best conditions to use Select Star (8:45) Where Select Star fits on the data spectrum (13:38) Why Select Star is needed (17:35) How Select Star uses metadata (21:02) Exposing data queries (27:04) Composing queries into metadata (33:27) Automating BI tools (37:28) Limits to data governance (41:39) Maintaining economies of scale (48:56) The Data Stack Show is a weekl...

Oct 26, 202258 min

109: How Does Headless Business Intelligence Work? Featuring Artyom Keydunov and Pavel Tiunov of Cube Dev

Highlights from this week’s conversation include: The context of Headless BI (3:31) What Cube Dev does (9:24) How Headless BI works with other tools (13:03) An analysis of LookML (18:04) User interaction with Cube Dev (23:40) Who manages data artifacts (25:22) Taking care of the developer experience (30:37) Levels of performance (30:37) Artyom and Pavel’s background and career journey (35:47) Why you should use Cube Dev (43:38) Roles within a data organization (48:55) How Cube Dev impacts visual...

Oct 19, 20221 hr 1 min

108: You Can’t Separate Data Reliability From Workflow with Gleb Mezhanskiy of Datafold

Highlights from this week’s conversation include: Gleb’s background and career journey (2:51) The adoption problems (10:53) How Datafold solves these problems (18:08) The vision for Datafold (26:27) Incorporating Datafold as a data engineer (38:53) The importance of the data engineer (42:12) Something to keep in mind when designing data tools (46:46) Implementing new technology into your company (53:18) The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each ...

Oct 12, 20221 hr

107: Building Modern Data Teams with dbt Labs, REI, and Robinhood

Highlights from this week’s conversation include: Introducing our guests (3:05) Defining “data team” (4:40) How data teams emerge and evolve (14:11) The need that forces the creation of a data team (21:12) The backbone of the data team (26:23) Building a career within a data team (36:39) Advice for new data team managers (47:35) Question and answer time (52:38) The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts...

Oct 05, 20221 hr 3 min

106: Optimizing Query Workloads (and Your Snowflake Bill) with Vinoo Ganesh of Bluesky Data

Highlights from this week’s conversation include: Vinoo’s background and career journey (2:43) How to benchmark cost (7:54) How Bluesky addresses rising Snowflake bills (14:01) “Workload” as defined by Bluesky (17:14) Space for BI optimization (22:55) How products manage bill growth (28:34) How to optimize your workloads (35:37) Bluesky’s partnerships (39:53) Getting real-time feedback on your work (44:50) Where to begin reevaluating your Snowflake game (50:47) The Data Stack Show is a weekly po...

Sep 28, 202256 min

105: The Modern Data Stack Is Just Getting Started with Astasia Myers of Quiet Capital

Highlights from this week’s conversation include: Astasia’s background and career journey (3:03) How Astasia evaluates data companies (5:25) Defining “modern data stack” (8:39) The limit of the complexity of a solution (18:44) How risky early-stage acquisition really is (26:15) Flashing headlight advice for investing (30:17) Signs you should do a product integration (33:38) The next data infrastructure opportunities (36:19) The likelihood of two data worlds merging (43:55) How important open sou...

Sep 21, 202259 min
For the best experience, listen in Metacast app for iOS or Android
Open in Metacast