Data Pipelines with Apache Airflow - podcast episode cover

Data Pipelines with Apache Airflow

Nov 26, 202444 min
--:--
--:--
Download Metacast podcast app
Listen to this episode in Metacast mobile app
Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Episode description

This Book provides a comprehensive guide to Apache Airflow, a powerful open-source workflow management platform commonly used in data-intensive environments. It covers the fundamentals of Airflow, including defining data pipelines as directed acyclic graphs (DAGs), scheduling and executing these pipelines, monitoring their performance, and handling failures. The book also explores advanced topics such as templating tasks, building custom components, integrating with external systems, and designing tests for your pipelines. The authors then demonstrate how to deploy and operate Airflow in production environments, including securing the system, managing resources efficiently, and collecting metrics for monitoring. Finally, the book includes detailed guidance on deploying Airflow in various cloud environments, including AWS, Azure, and GCP.


You can listen and download our episodes for free on more than 10 different platforms:
https://linktr.ee/cyber_security_summary

Get the Book now from Amazon:
https://www.amazon.com/Data-Pipelines-Apache-Airflow-Harenslak/dp/1617296902?&linkCode=ll1&tag=cvthunderx-20&linkId=39a43518fff3b8fca733494faa3cb6df&language=en_US&ref_=as_li_ss_tl




Discover our free courses in tech and cybersecurity, Start learning today:
https://linktr.ee/cybercode_academy
For the best experience, listen in Metacast app for iOS or Android