Название: Data Pipelines with Apache Airflow, Second Edition (MEAP v9)
Автор: Julian de Ruiter, Ismael Cabral, Kris Geusebroek, Daniel van der Ende, Bas Harenslak
Издательство: Manning Publications
Год: 2025
Страниц: 401
Язык: английский
Формат: pdf (true), epub
Размер: 56.2 MB
Simplify, streamline, and scale your data operations with data pipelines built on Apache Airflow. Apache Airflow provides a batteries-included platform for designing, implementing, and monitoring data pipelines. Building pipelines on Airflow eliminates the need for patchwork stacks and homegrown processes, adding security and consistency to the process. Now in its second edition, Data Pipelines with Apache Airflow teaches you to harness this powerful platform to simplify and automate your data pipelines, reduce operational overhead, and seamlessly integrate all the technologies in your stack. This book focuses on Apache Airflow, a batch-oriented framework for building data pipelines. Airflow’s key feature is that it enables you to easily build scheduled data pipelines using Python, while also providing many building blocks that allow you to stitch together the many different technologies encountered in modern technological landscapes. For DevOps, data engineers, Machine Learning engineers, and sysadmins with intermediate Python skills.