null
Skip to main content

Data Pipelines with Apache Airflow, Second Edition: Orchestration for data and AI [9781633436374]

Paperback
SKU: 9781633436374
Buy More - Save More. Below are the available bulk discount rates for each individual item when you purchase a certain amount
Quantity Price Savings
25 - 99 15%
100 - 249 16%
250 - 499 17%
500 - 999 18%
1000+ 20%

Format Lightweight and affordable. Perfect for student groups and classrooms, and a versatile option for corporate trainings, team reads, or large-scale events.

Price $59.99

Total for 25 copies:

Adding to cart… The item has been added
You can purchase this title directly online anytime! If you need a formal quote for budget approval, submit a request and we’ll get it to you quickly.
  • Free shipping over $95
  • Price Match Guarantee. Found a better price? Let us know! We’ll work to match it so you get the best value with BookPal.

Overview



Data Pipelines with Apache Airflow has empowered thousands of data engineers to build more successful data platforms. This new second edition has been fully revised for Airflow 3 with coverage of all the latest features of Apache Airflow, including the Taskflow API, deferrable operators, and Large Language Model integration. Filled with real-world scenarios and examples, you'll be carefully guided from Airflow novice to expert.

Using real-world scenarios and examples, this book teaches you how to simplify and automate data pipelines, reduce operational overhead, and smoothly integrate all the technologies in your stack. Part reference and part tutorial, each technique is illustrated with engaging hands-on examples, from training machine learning models for generative AI to optimizing delivery routes.

In Data Pipelines with Apache Airflow, Second Edition you'll learn how to:

• Master the core concepts of Airflow architecture and workflow design
• Schedule data pipelines using the Dataset API and time tables, including complex irregular schedules
• Develop custom Airflow components for your specific needs
• Implement comprehensive testing strategies for your pipelines
• Apply industry best practices for building and maintaining Airflow workflows
• Deploy and operate Airflow in production environments
• Orchestrate workflows in container-native environments
• Build and deploy Machine Learning and Generative AI models using Airflow

About the Technology

Apache Airflow provides a unified platform for collecting, consolidating, cleaning, and analyzing data. With its easy-to-use UI, powerful scheduling and monitoring features, plug-and-play options, and flexible Python scripting, Airflow makes it easy to implement secure, consistent pipelines for any data or AI task.

About the book

Data Pipelines with Apache Airflow, Second Edition teaches you how to build, monitor, and maintain effective data workflows. This new edition adds comprehensive coverage of Airflow 3 features, such as event-driven scheduling, dynamic task mapping, DAG versioning, and Airflow’s entirely new UI. The numerous examples address common use cases like data ingestion and transformation and connecting to multiple data sources, along with AI-aware techniques such as building RAG systems.

What's inside

• Deploying data pipelines as Airflow DAGs
• Time and event-based scheduling strategies
• Integrating with databases, LLMs, and AI models
• Deploying Airflow using Kubernetes

About the reader

For data engineers, machine learning engineers, DevOps, and sysadmins with intermediate Python skills.

About the author

Julian de Ruiter, Ismael Cabral, Kris Geusebroek, Daniel van der Ende, and Bas Harenslak are seasoned data engineers and Airflow experts.

Table of Contents

Part 1
1 Meet Apache Airflow
2 Anatomy of an Airflow DAG
3 Time-based scheduling
4 Asset-aware scheduling
5 Templating tasks using the Airflow context
6 Defining dependencies between tasks
Part 2
7 Triggering workflows with external input
8 Communicating with external systems
9 Extending Airflow with custom operators and sensors
10 Testing
11 Running tasks in containers
Part 3
12 Best practices
13 Project: Finding the fastest way to get around NYC
14 Project: Keeping family traditions alive with Airflow and generative AI
Part 4
15 Operating Airflow in production
16 Securing Airflow
17 Airflow deployment options
A Running code samples
B Prometheus metric mapping

The book, Data Pipelines with Apache Airflow, Second Edition: Orchestration for data and AI [Bulk, Wholesale, Quantity] ISBN#9781633436374 in Paperback by Julian de Ruiter, Ismael Cabral, Kris Geusebroek, Daniel van der Ende, Bas Harenslak may be ordered in bulk quantities. Minimum starts at 25 copies. Availability based on publisher status and quantity being ordered.

Details

Author:
Julian de Ruiter Ismael Cabral Kris Geusebroek Daniel van der Ende Bas Harenslak
Format:
Paperback
Publication Date:
01/27/2026
ISBN-10:
1633436373
ISBN-13:
9781633436374
Pages:
512
Publisher:
Manning

Customer Reviews

This product hasn't received any reviews yet. Be the first to review this product!

Need Books? BookPal Makes it Easy

  • Free Shipping

    Enjoy free ground shipping on us! Most orders over $95 qualify for free standard ground shipping.It takes an estimated 7-10 business days to deliver and may require additional processing time

    Learn More
  • Dedicated Account Managers

    At BookPal, we go beyond the transaction by providing personal support and a dedicated account manager for every customer.

    Learn More
  • Flexible Delivery Options

    We offer flexible delivery options such Free Ground Shipping (on most orders over $100), Expedited Premium, Expedited Express, International Shipping etc.

    Learn More
  • Sales Tax Exemption

    BookPal is a tax-exempt supplier for all 50 states. We can provide you with a tax-exempt certificate to use on your orders.

    Learn More
  • Price Match Guarantee

    With over 3 million book titles available, it's impossible to always be the lowest priced. If you find a lower price on a new title elsewhere that is available to ship in the quantity you need, we are happy to discount your books and match the lower price.

    Learn More
  • Multiple Payment Options

    BookPal accepts all major credit cards, PayPal, and checks by mail, along with Purchase Orders upon approval. We also accept ACH payments and wire transfers.

    Learn More

We are here to help, reach out to our team anytime!

Connect With Us

Subscribe to our newsletter for $25 off your next order of $500+

Review Your Cart Close Close
Your cart is empty Your cart is empty Your cart is empty
Recently Viewed Recently Viewed
Back to top Back to top