The PRQL: Kaskada Serving as a Recommendation Engine with Davor Bonaci of DataStax
In this bonus episode, Eric and Kostas preview their upcoming conversation with Davor Bonaci of DataStax.
In this bonus episode, Eric and Kostas preview their upcoming conversation with Davor Bonaci of DataStax.
Highlights from this week’s conversation include: Aditya’s background and journey in the data space (2:47) What does Ponder do? (5:18) 101 on Pandas and why people utilize it (6:42) The challenge of translating Pandas to a big data platform (16:11) Data Warehouses and ML workflows (21:27) The differences in the “zoo” of data languages (26:56) Why do ML and data engineering have to be so different in languages? (34:39) Builders should be adapting to the users and not the other way around (39:32) ...
In this bonus episode, Eric and Kostas preview their upcoming conversation with Aditya Parameswaran of Ponder.
Highlights from this week’s conversation include: A.J.’s background and journey in data (2:23) Challenges with Hadoop ecosystem (8:50) Starting InfinyOn and the need for innovation (10:02) Challenges with Kafka and Microservices (14:01) Real-time data streaming for IoT devices (19:28) Paradigm shift to real-time data processing (22:17) Benefits of Rust (29:45) Web Assembly and Platform Features (36:29) Analytics and Event Correlation (40:16) Real-time data processing (47:03) ETL vs ELP (52:20) F...
In this bonus episode, Eric and Kostas preview their upcoming conversation with A.J. Hunyady, Founder and CEO of InfinyOn.
Highlights from this week’s conversation include: Josh’s background in data working at Google, Slack, and other companies (1:21) The need and process for high quality data (4:33) Digging into auction code (14:03) Joining Slack and working in the early days of the company (18:00) Not fighting the last war in data (25:42) Building a product, while using the product (30:35) Transitioning to the search team at Slack (36:50) Usage patterns of search (41:21) Josh’s work in helping build DuckDB (46:20)...
In this bonus episode, Eric previews his upcoming conversation with Josh Wills, an experienced data scientist who has worked with IBM, Google, Slack, DuckDB, and more.
Highlights from this week’s conversation include: Dhruba’s journey into the data space (2:02) The impact of Hadoop on the industry (3:37) Dhruba’s work in the early days of the Facebook team (7:54) Building and implementing RocksDB (14:33) Stories with Mark Zuckerberg at Facebook (24:25) The next evolution in storage hardware (26:14) How Rockset is different from other real-time platforms (33:13) Going from a key value store to an index (37:15) Where does Rockset go from here? (44:59) The succes...
In this bonus episode, Eric and Kostas preview their upcoming conversation with Dhruba Borthakur of Rockset.
Highlights from this week’s conversation include: The origin story of Data Council (0:39) Developments for the future of Data Council (2:42) The emphasis of AI and ChatGPT at this year’s conference (3:54) The support of the data community (5:31) Biggest changes and innovations in the industry (7:10) What’s next for the Data Council? (10:46) Getting connected with Data Council (13:07) The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to d...
Highlights from this week’s conversation include: Gunner’s background in data (0:32) Setting the vision in early days of Red Hat and spearheading Debezium (6:20) Replication of data in Debezium (9:47) The patterns and processes of Debezium (16:21) Debezium working with Kafka (19:03) Building a diverse system while incorporating common interfaces (24:09) The importance of documentation in open-sourced projects (27:59) Debezium’s vision moving forward (31:32) Why aren’t there more CDC open-sourced...
Highlights from this week’s conversation include: Michael’s journey to co-founding Tecton (0:22) The evolution of MLops and platform teams (3:50) Understanding boundaries between the data platform and the MLops (8:42) Differences in machine learning vs data pipelines (16:58) The systems needed to handle all these types of data (22:22) Developer experience in Tecton (25:15) Automating challenges in ML development (32:30) The most difficult part of the life cycle of prediction (37:24) Exciting new...
Highlights from this week’s conversation include: Will’s background in data (0:28) Privacy dynamics and data anonymization (4:18) Addressing data privacy problems in the space (10:33) Developer experience with Privacy Dynamics (13:49) How does Privacy Dynamics work? (21:09) Update of real-time anonymized data (26:29) The problem of dates and other complexities in data (31:24) Being a data engineer in a startup (34:44) Moving at the speed of a startup (41:01) Connecting with Will and Privacy Dyna...
Highlights from this week’s conversation include: Chase’s journey to where he is today (0:51) Lessons in go-to-market roles which helps in the VC world (2:38) Differentiating between go-to-market and distribution (8:13) Taking an idea to the market (11:33) Hardest part of the pitch (17:08) Playbooks for go-to-market founders to follow (20:25) Focus of sales and marketing in go-to-market strategy (28:01) Answering the what and how of the problem you are solving (32:30) The importance of pricing i...
Highlights from this week’s conversation include: Introducing the team from Featureform (0:31) In the work vs. leading the work (3:01) Difference between MLOps and data ops (7:06) The MLOps cycle (10:12) What is Featureform and what makes it different? (13:30) Is there another layer needed in feature stores? (18:46) Getting in touch with Featureform (23:55) The Data Stack Show is a weekly podcast powered by RudderStack, the CDP for developers. Each week we’ll talk to data engineers, analysts, an...
Highlights from this week’s conversation include: Eric’s journey to becoming CEO of Decoable (0:20) Does real time matter? (2:12) Differences in stream processing systems (7:57) Processing in motion (13:04) Why haven’t there been more open source projects around CDC? (20:34) The Decodable experience and future focuses for the company (24:31) Streaming processing and data lakes (32:54) Data flow processing technologies of today (39:01) The Data Stack Show is a weekly podcast powered by RudderStac...
Highlights from this week’s conversation include: Origins of OtterTune (4:43) The problem of knob tuning (6:25) Roles of machine learning (9:32) OtterTune’s development and industry recognition (12:03) The challenges of database tuning and the role of human expertise (16:15) Tuning in production (20:23) Observability and Data Collection (23:37) Data Security and Privacy (29:59) Optimizing on-prem vs. cloud workloads (35:52) Performance benchmarks (40:20) Future opportunities OtterTune is focusin...
In this bonus episode, Eric and Kostas preview their upcoming conversation with with Andy Pavlo and Dana Van Aken of OtterTune.
Highlights from this week’s conversation include: The journey of H.O. into data and becoming the CEO of FeatureBase (2:37) Characteristics of the super evolution in technology (6:36) ChatGPT as the missionary of AI (9:45) The tension between authenticity and technology (13:12) What is FeatureBase? (17:53) Comparing FeatureBase to feature stores (25:58) Workload capacities and possibilities in FeatureBase (33:20) The importance of developer experience on a platform (38:23) Exciting developments f...
On this bonus episode, Eric and Kostas preview their upcoming conversation with H.O. Maycotte of FeatureBase.
Highlights from this week’s conversation include: Sammy’s background in data and tooling (2:46) Going from multipurpose engineering to a CTO position (5:14) Changes in technology and deep learning models (7:31) The state of self-driving and adoption (13:49) What is Eventual and what are they solving in the space? (20:54) What are daft and data frame and how they work? (28:11) Building a query optimizer (33:42) Sammy’s take on what is going on in data and future possibilities (45:18) Eventual’s f...
In this bonus episode, Eric and Kostas preview their upcoming conversation with Sammy Sidhu, Co-Founder and CEO of Eventual.
Highlights from this week’s conversation include: Chad’s background in data (2:10) Breaking down data quality (4:02) Semantic and logical layers of data (10:04) What are data contracts and how do they work? (17:41) Implicit contracts at companies (24:01) Where do data contracts fit in data infrastructure? (28:14) The value of data contracts to the producer and consumer (31:18) Tools needed in effective data contracts (46:13) The importance of community in data quality (50:53) Getting connected t...
In this bonus episode, Eric and Kostas preview their upcoming conversation with Chad Sanderson of Data Quality Camp.
Highlights from this week’s conversation include: Defining CDPs (2:28) The data team's role in marketing (7:41) Leveraging commonalities across businesses (12:49) Building a CDP with customer data (18:05) Challenges in identity modeling (23:00) CDP lifecycle and one-to-one data (30:06) Segmentation and optimization (33:23) Real-time data in the cloud (40:37) The future of AI and machine learning (43:02) Final thoughts and takeaways (46:42) The Data Stack Show is a weekly podcast powered by Rudde...
In this bonus episode, Eric and Kostas preview their upcoming conversation with Jason Davis of Simon Data.
Highlights from this week’s conversation include: Vijay’s background in data (2:09) The journey of founding ThoughtSpot and its impact in the world of BI (2:49) The maturation of BI (6:34) What is NetSpring.io? (8:21) Bridging the gap of BI and product analytics (14:41) Why data warehouses and not time-series databases? (19:58) The difficulty of using SQL in product analytics (28:35) Challenges in pricing models for product analytics and tooling (35:41) Combining analytics and attribution (42:00...
In this bonus episode, Eric and Kostas preview their upcoming conversation with Vijay Ganesan of NetSpring.io.
Highlights from this week’s conversation include: David’s background and journey to Timescale (2:12) What are time series databases? (14:13) How Timescale would have impacted David’s trajectory early in his career (17:51) Innovation in postgreSQL (21:02) Why does Timescale build their timeseries databases differently? (27:08) The challenges of building a new database on top of an old software (32:22) Writing outside of SQL and Timescale’s secret sauce (37:47) The importance of the developer expe...
In this bonus episode, Eric and Kostas preview their upcoming conversation with David Kohn of Timescale.