
eBook - ePub
Apache Airflow Best Practices
A practical guide to orchestrating data workflow with Apache Airflow
- English
- ePUB (mobile friendly)
- Available on iOS & Android
eBook - ePub
Apache Airflow Best Practices
A practical guide to orchestrating data workflow with Apache Airflow
About this book
Confidently orchestrate your data pipelines with Apache Airflow by applying industry best practices and scalable strategies
Key Features
- Seamlessly migrate from Airflow 1.x to 2.x and explore the key features and improvements in version 2.x
- Learn Apache Airflow workflow authoring through practical, real-world use cases
- Discover strategies to optimize and scale Airflow pipelines for high availability and operational resilience
- Purchase of the print or Kindle book includes a free PDF eBook
Book Description
Data professionals face the challenge of managing complex data pipelines, orchestrating workflows across diverse systems, and ensuring scalable, reliable data processing. This definitive guide to mastering Apache Airflow, written by experts in engineering, data strategy, and problem-solving across tech, financial, and life sciences industries, is your key to overcoming these challenges. Covering everything from Airflow fundamentals to advanced topics such as custom plugin development, multi-tenancy, and cloud deployment, this book provides a structured approach to workflow orchestration. You'll start with an introduction to data orchestration and Apache Airflow 2.x updates, followed by DAG authoring, managing Airflow components, and connecting to external data sources. Through real-world use cases, you'll learn how to implement ETL pipelines and orchestrate ML workflows in your environment, and scale Airflow for high availability and performance. You'll also learn how to deploy Airflow in cloud environments, tackle operational considerations for scaling, and apply best practices for CI/CD and monitoring. By the end of this book, you'll be proficient in operating and using Apache Airflow, authoring high-quality workflows in Python, and making informed decisions crucial for production-ready Airflow implementations.What you will learn
- Explore the new features and improvements in Apache Airflow 2.0
- Design and build scalable data pipelines using DAGs
- Implement ETL pipelines, ML workflows, and advanced orchestration strategies
- Develop and deploy custom plugins and UI extensions
- Deploy and manage Apache Airflow in cloud environments such as AWS, GCP, and Azure
- Plan and execute a scalable deployment strategy for long-term growth
- Apply best practices for monitoring and maintaining Airflow
Who this book is for
This book is ideal for data engineers, developers, IT professionals, and data scientists looking to optimize workflow orchestration with Apache Airflow. It's perfect for those who recognize Airflow's potential and want to avoid common implementation pitfalls. Whether you're new to data, an experienced professional, or a manager seeking insights, this guide will support you. A functional understanding of Python, some business experience, and basic DevOps skills are helpful. While prior experience with Airflow is not required, it is beneficial.
]]>Frequently asked questions
Yes, you can cancel anytime from the Subscription tab in your account settings on the Perlego website. Your subscription will stay active until the end of your current billing period. Learn how to cancel your subscription.
At the moment all of our mobile-responsive ePub books are available to download via the app. Most of our PDFs are also available to download and we're working on making the final remaining ones downloadable now. Learn more here.
Perlego offers two plans: Essential and Complete
- Essential is ideal for learners and professionals who enjoy exploring a wide range of subjects. Access the Essential Library with 800,000+ trusted titles and best-sellers across business, personal growth, and the humanities. Includes unlimited reading time and Standard Read Aloud voice.
- Complete: Perfect for advanced learners and researchers needing full, unrestricted access. Unlock 1.4M+ books across hundreds of subjects, including academic and specialized titles. The Complete Plan also includes advanced features like Premium Read Aloud and Research Assistant.
We are an online textbook subscription service, where you can get access to an entire online library for less than the price of a single book per month. With over 1 million books across 1000+ topics, we’ve got you covered! Learn more here.
Look out for the read-aloud symbol on your next book to see if you can listen to it. The read-aloud tool reads text aloud for you, highlighting the text as it is being read. You can pause it, speed it up and slow it down. Learn more here.
Yes! You can use the Perlego app on both iOS or Android devices to read anytime, anywhere — even offline. Perfect for commutes or when you’re on the go.
Please note we cannot support devices running on iOS 13 and Android 7 or earlier. Learn more about using the app.
Please note we cannot support devices running on iOS 13 and Android 7 or earlier. Learn more about using the app.
Yes, you can access Apache Airflow Best Practices by Dylan Intorf,Dylan Storey,Kendrick van Doorn in PDF and/or ePUB format, as well as other popular books in Computer Science & Data Processing. We have over one million books available in our catalogue for you to explore.
Information
Table of contents
- Apache Airflow Best Practices
- Contributors
- Preface
- Part 1: Apache Airflow: History, What, and Why
- 1
- 2
- Part 2: Airflow Basics
- 3
- 4
- Part 3: Common Use Cases
- 5
- 6
- 7
- 8
- 9
- Part 4: Scale with Your Deployed Instance
- 10
- 11
- 12
- 13
- Index
- Other Books You May Enjoy