Plumbers of Data Science - podcast cover

Plumbers of Data Science

Andreas Kretzlearndataengineering.com
Data Engineering is the plumbing of data science. Almost invisible, but super important and a big mess when done wrong. We talk about interesting Data Engineering trends and topics. I also train Data Engineering in my Data Engineering Academy at LearnDataEngineering.com
Last refreshed:
Follow this podcast in the Metacast mobile app to refresh it and see new episodes.
Download Metacast podcast app
Podcasts are better in Metacast mobile app
Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Episodes

#90 Taylor McGrath - The Future of the Modern Data Stack

Super happy to have Taylor with me on this stream. She is the VP of Data Labs at Rivery and therefore has a lot of experience with data platforms. We'll talk about the modern data stack and where it's going. I'm excited to hear her experience about the changes that are happening in the data space, and what that means for data engineers & data teams.

Jan 25, 202347 minSeason 5Ep. 2

#89 Piyush Sachdeva - Getting Into Google After Eight Rejections from Amazon!

In this video I talk to Piyush who's an engineer at Google and has his own YouTube channel: "Tech Tutorials with Piyush". He's a really good guy and I love how he's dedicated to teaching engineering. We are talking about some awesome topics like: Is Linkedin a must for getting a job? Tips for recording yourself Cloud Engineering vs Data Engineering Which Cloud Platform should you choose right now? The amazing Google work culture explained Everybody should learn how to use Kubernetes How getting ...

Jan 16, 202344 minSeason 5Ep. 1

#88 - Wouter Trappers - How to Realize a Data Strategy Like a Pro!

I have seen people doing that wrong a few times. Luckily Wouter Trappers who is helping companies as a professional can help. We talked about The steps you need to take from value proposition to dashboards. Wouter is really knowledgeable and it was super fun talking with him and hearing his approach.

Apr 12, 202240 minSeason 4Ep. 2

#87 - Dhruba Borthakur - From Hadoop to real time analytics

Dhruba Borthakur is CTO at Rockset and a passionate Data Engineer. Before co-founding Rockset he played a big role in development of Hadoop HDFS at Yahoo as well as HBase and RocksDB at Facebook. His current project is the serverless Rockset platform where you can gain real time analytics insight into your data. I tried it out before our talk and really liked it.

Apr 12, 20221 hr 6 minSeason 4Ep. 1

#86 The Ultimate Data Engineering Introduction

The Podcast is back!!!! I promise I am going to keep it up to date this time ;) In this episode I talk about my newest Data Engineering course. I think it's the ultimate 1 hour 15 minutes introduction to Data Engineering. There were also a ton of questions from the chat that I answered. Think you really enjoy this.

Jan 14, 20211 hr 15 minSeason 3Ep. 1

#085 Big Data and Data Science Landscape plus trying to read Tweets with Nifi

We are looking into the network communication protocol map. I first saw this like 10 years ago and its awesome. Then we check out the Big Data and Data Science Landscape image. It shows you all the tools available to do data science, machine learning and data engineering. Which is very helpful if you are researching for tools to use. Before using the Twitter API you got to create a developer account. So, I show you how I created one. After that I tried to get Nifi to download Tweets but it is no...

May 28, 201943 minSeason 2Ep. 55

#083 Data Engineering at OLX Case Study

Today a case study about OLX with a guest it was super fun! Here are the slides Alexeyand I talked about: https://www.slideshare.net/mobile/AlexeyGrigorev/image-models-infrastructure-at-olx

May 27, 20191 hr 11 minSeason 2Ep. 53

#082 Reading Tweets With Apache Nifi & IaaS vs PaaS vs SaaS

In this episode we install the Nifi docker container and look into how we can extract the twitter data. We are also talking about the differences between infrastructure as a service, platform as a service and application as a service.

May 27, 20191 hr 19 minSeason 2Ep. 52

#081 How to get tweets from the Twitter API

In this episode we look into the Twitter API documentation, which I love by the way. How can we get old tweets for a certain hashtags and how to get current live tweets for these hashtags.

May 27, 20191 hr 10 minSeason 2Ep. 51

#077 Lambda and Kappa Architecture

In this episode we talk about the lambda architecture with stream and batch processing as well as a alternative the Kappa Architecture that consists only of streaming. Also Data engineer vs data scientist and we discuss Andrew Ng's AI Transformation Playbook

May 27, 20191 hr 22 minSeason 2Ep. 47

#076 Cloud vs On Premise How To Decide

How do you choose between Cloud vs On-Premise, pros and cons and what you have to think about. Because there are good reasons to not go cloud. Also thoughts on how to choose between the cloud providers by just comparing instance prices. Otherwise the comparison will drive you insane.

May 27, 20191 hr 16 minSeason 2Ep. 46

#071 Data Engineering At Spotify Case Study

In this episode we are looking at the data engineering at Spotify, my favorite music streaming service. How do they process all that data?

May 27, 201943 minSeason 2Ep. 41

#070 The Engineering Culture At Spotify

In this podcast we look at the engineering culture at Spotify, my favorite music streaming service. The process behind the development of Spotify is really awesome.

May 27, 201955 minSeason 2Ep. 40

#068 A Budget Data Science PC Build

Configuring a sub 1000 dollar PC for data engineering and machine learning Link to the builds: 900$ build: https://pcpartpicker.com/list/22ThcY 1500$ build: https://pcpartpicker.com/list/hXJdV6

May 27, 201921 minSeason 2Ep. 38

#065 Data Engineering At CERN Case Study

A look into how CERN is doing Data Engineering. They get huge amounts of data from the Large Hydron Colider. Let's check it out.

May 27, 20191 hr 16 minSeason 2Ep. 35
For the best experience, listen in Metacast app for iOS or Android