108: PySpark - Jonathan Rioux - podcast episode cover

108: PySpark - Jonathan Rioux

Apr 09, 202032 minSeason 1Ep. 108
--:--
--:--
Download Metacast podcast app
Listen to this episode in Metacast mobile app
Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Episode description

Apache Spark is a unified analytics engine for large-scale data processing.
PySpark blends the powerful Spark big data processing engine with the Python programming language to provide a data analysis platform that can scale up for nearly any task.

Johnathan Rioux, author of "PySpark in Action", joins the show and gives us a great introduction of Spark and PySpark to help us decide how to get started and decide whether or not to decide if Spark and PySpark are right you.

Special Guest: Jonathan Rioux.

Sponsored By:

Links:

★ Support this podcast on Patreon ★
For the best experience, listen in Metacast app for iOS or Android
Open in Metacast
108: PySpark - Jonathan Rioux | Test & Code podcast - Listen or read transcript on Metacast