The Data Stack Show - podcast cover

The Data Stack Show

Rudderstackdatastackshow.com
Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

Episodes

131: How Data Teams Interact With Marketing Tools with Jason Davis of Simon Data

Highlights from this week’s conversation include: Defining CDPs (2:28) The data team's role in marketing (7:41) Leveraging commonalities across businesses (12:49) Building a CDP with customer data (18:05) Challenges in identity modeling (23:00) CDP lifecycle and one-to-one data (30:06) Segmentation and optimization (33:23) Real-time data in the cloud (40:37) The future of AI and machine learning (43:02) Final thoughts and takeaways (46:42) The Data Stack Show is a weekly podcast powered by Rudde...

Mar 22, 202348 min

130: From Business Intelligence to Product Analytics and Beyond with Vijay Ganesan of NetSpring.io

Highlights from this week’s conversation include: Vijay’s background in data (2:09) The journey of founding ThoughtSpot and its impact in the world of BI (2:49) The maturation of BI (6:34) What is NetSpring.io? (8:21) Bridging the gap of BI and product analytics (14:41) Why data warehouses and not time-series databases? (19:58) The difficulty of using SQL in product analytics (28:35) Challenges in pricing models for product analytics and tooling (35:41) Combining analytics and attribution (42:00...

Mar 15, 202358 min

129: Databases, Data Warehouses, and Timeseries Data with David Kogn of Timescale

Highlights from this week’s conversation include: David’s background and journey to Timescale (2:12) What are time series databases? (14:13) How Timescale would have impacted David’s trajectory early in his career (17:51) Innovation in postgreSQL (21:02) Why does Timescale build their timeseries databases differently? (27:08) The challenges of building a new database on top of an old software (32:22) Writing outside of SQL and Timescale’s secret sauce (37:47) The importance of the developer expe...

Mar 08, 20231 hr 9 min

The PRQL: Time-Series Data 101

In this bonus episode, Eric and Kostas preview their upcoming conversation with David Kohn of Timescale.

Mar 06, 20234 min

128: The Possibilities Are Endless for Synthetic Data with Alex Watson of Gretel.ai

Highlights from this week’s conversation include: Alex’s background working for NSA and starting a company (1:51) The Gretel.ai journey (9:30) Defining synthetic data (13:26) The evolution of AI in deep learning data and language learning (16:28) The properties of synthetic data (21:31) Boundaries between synthetic data and prediction models (25:52) The developer experience in Gretel.ai (36:44) Stewardship and expansion of deep learning models in the future (45:36) Final thoughts and takeaways (...

Mar 01, 202357 min

127: The Anatomy of a Data Lakehouse with Alex Merced of Dremio

Highlights from this week’s conversation include: Alex’s background in the data space (2:41) Comics and Pop Culture Blending with Finance training (5:20) What is a data lake house? (7:36) What is Dremio solving in for users? (11:21) Essential components of a data lake house (16:35) Difference between on-prem and cloud experiences (33:53) What does it mean to be a developer advocate? (41:31) Final thoughts and takeaways (49:02) The Data Stack Show is a weekly podcast powered by RudderStack, the C...

Feb 22, 202353 min

126: Crossing the Product Analytics Chasm with Spenser Skates of Amplitude Analytics

Highlights from this week’s conversation include: Spenser’s journey to Co-Founding Amplitude (3:02) Looking back over the last decade of success at Amplitude (8:31) Going from Engineer to Sales (14:41) Comparing product analytics and general analytics (20:11) How cloud data warehousing has impacted analytics (31:38) Providing an out-of-the-box experience for consumers (41:12) Final thoughts and takeaways (54:27) The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for develope...

Feb 15, 202358 min

125: Authorization Is A Data Problem with Jeff Chao of Abbey Labs

Highlights from this week’s conversation include: Jeff’s background at Netflix and Stripe leading him to Abbey Labs (2:22) What Abbey is solving in the space (5:16) Tackling permissions in an organization (7:30) Opportunities to improve the availability of data (10:14) The challenge of tackling a new problem area at a new company (14:59) What is the most common challenges in the identity and security space (18:43) Importance of identity and the ability to track it in data (22:46) Connecting all ...

Feb 08, 202355 min

124: Pragmatism About Data Stacks with Pedram Navid of West Marin Data

Highlights from this week’s conversation include: Pedram’s journey into the world of data (4:05) What should the datastack at an early-stage startup look like? (9:53) New ideas surrounding access control for data (24:45) What can data teams learn around complexity from software engineering (30:55) Scaling up instead of scaling out in processing data (37:40) Why DuckDB is making so much noise in the market (41:06) Final thoughts and takeaways (53:25) The Data Stack Show is a weekly podcast powere...

Feb 01, 202357 min

123: What Is a Universal Database? Featuring Stavros Papadopoulos of TileDB, Inc.

Highlights from this week’s conversation include: Stavros’ journey into data and founding TileDB (3:12) What problem was TileDB going to solve? (12:05) Defining database systems (21:35) What part of database architecture is TileDB? (31:58) Storage engine solutions (42:37) What does the API look like in using TileDB? (50:40) What makes genomics unique in working with data (55:28) Final thoughts and takeaways (1:06:46) The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for dev...

Jan 25, 20231 hr 11 min

122: Why Accounting Needs Its Own Database with Joran Greef of Tiger Beetle

Highlights from this week’s conversation include: Joran’s background leading him from accounting to coding (3:10) What is Tiger Beetle? (5:53) Double-entry accounting and why it is important for a database (12:28) The need for low latency and high throughput (26:27) Why financial database software needs a laser focus (29:01) What are people using to implement a double-entry system? (36:09) Safety in financial software and addressing storage faults (40:26) Final thoughts and takeaways (55:52) The...

Jan 18, 20231 hr

121: Materialize Origins: Breaking Down Data Flow Layers with Arjun Narayan and Frank McSherry

Highlights from this week’s conversation include: Defining data flow (2:31) Are there limitations in timely data flow operation and/or building operators? (8:20) Areas of incremental computation that are having an impact today (17:10) Building a library vs building a product (24:06) Combining delight and empathy into a focus (27:52) Final thoughts and takeaways (32:42) The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, ...

Jan 11, 202337 min

120: Materialize Origins: A Timely Dataflow Story with Arjun Narayan and Frank McSherry

Highlights from this week’s conversation include: What is Materialize? (2:43) Frank and Arjun’s journey in data and what led them to the idea of Materialize (6:22) The good and the bad of research in academia vs starting a company (25:20) The MVP for databases (33:49) Materialize’s end-to-end benefit for the user experience (43:03) Interchanging Materialize in warehouse and cloud data usage (48:25) The trade-offs within Materialize (1:00:02) Final takeaways and previewing part two of the convers...

Jan 04, 20231 hr 14 min

119: The Data Stack Show Wrapped: 2022

The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data. RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com ....

Dec 28, 202212 min

118: Bringing Powerful Business Intelligence to Mobile with Zack Hendlin of Zing Data

Highlights from this week’s conversation include: Zack’s extensive background in the world of data and the genesis of Zing (3:02) Working on relevance, feeds, and ads at Facebook & LinkedIn (9:20) Exploring BI and queries on mobile devices (16:48) Reliance of input quality in data (23:28) Delivering a mobile-first experience in BI (30:11) Limitations of visualization on mobile devices (34:00) How BI tools interact with one another in Zing (45:21) The future of user-experience in consuming data (...

Dec 21, 20221 hr 3 min

117: DX for Data Tooling with Taylor Murphy of Meltano

Highlights from this week’s conversation include: Taylor’s journey into data (3:09) What’s been going on at Meltano recently? (7:28) Addressing basic problems in data even with advancements in technology (12:23) What makes Meltano unique in the space (16:53) Why the CLI experience is important (25:37) Quality vs quantity in supporting connectors (35:51) What does data ops look like for Meltano (46:44) Takeaways and closing thoughts (52:56) The Data Stack Show is a weekly podcast powered by Rudde...

Dec 14, 202256 min

116: Data Democratization & Self Service with Aron Clymer of Data Clymer

Highlights from this week’s conversation include: Aron’s background in the world of data (2:18) Recent Clients and major projects (3:30) Helping to spearhead data-driven growth at Salesforce (6:50) Stories about Marc Benioff, co-founder of Salesforce (16:12) Biggest learnings as a consultant in the data strategy space (17:58) The need for data democratization (23:33) Advice for Aron’s younger self in consulting (28:45) Current trends in data democratization and sales service (35:01) Aron’s favor...

Dec 07, 202254 min
For the best experience, listen in Metacast app for iOS or Android
Open in Metacast