Real case: how Airbnb runs their data platform

Module 8 has covered the operational practices that turn a data platform from a collection of pipelines into something a team can actually run. Orchestration, asset-oriented thinking, observability, SLOs, data quality, incidents, on-call. This lesson stitches them together by walking through one company’s published architecture, picking out the artefacts that became industry-standard, and pulling out the lessons that generalise.

The company is Airbnb. Two of the tools in nearly every modern data team’s stack came out of Airbnb’s data platform organisation: Airflow, the orchestrator that defined the category, and Knowledge Repo, the analysis-sharing system that shaped how data teams think about institutional knowledge. Airbnb has also been unusually open about parts of the platform that stayed internal but are still architecturally instructive: Minerva (the metrics layer), Dataportal (data discovery), and the data-quality framework. The combination gives a fuller view of a mature platform than most public case studies offer.

The framing matches the Module 5 Netflix case (lesson 40) and the Module 6 Uber case (lesson 48). One company, a decade of published engineering posts, takeaways that transfer to teams a hundred times smaller. Constants differ; shapes do not.

The scale and the problem

Airbnb’s published numbers shift over time. The rough order of magnitude as of the mid-2020s: hundreds of data engineers and analysts, thousands of dashboards, a petabyte-scale warehouse, tens of thousands of pipelines, and a culture in which every product change goes through an A/B experiment that has to be readable across the organisation.

The pressure points are recognisable. The same metric defined three ways across three teams, with three different numbers in three dashboards, and nobody able to say which was right. Tables nobody could find without asking the person who built them. Pipelines that broke silently and produced wrong dashboards for hours. Analyses emailed around as one-off PDFs and lost when the analyst left. Each problem gets worse with scale, and each is what the investments below were designed to address.

The Airbnb engineering blog (https://medium.com/airbnb-engineering, retrieved 2026-05-01) and the AirbnbEng GitHub organisation are the primary public sources.

Airflow: the orchestrator that defined the category

Airflow was created at Airbnb in 2014 by Maxime Beauchemin and open-sourced in 2015. The original announcement, “Airflow: a workflow management platform” (Maxime Beauchemin, Airbnb Engineering blog, June 2015, https://medium.com/airbnb-engineering/airflow-a-workflow-management-platform-46318b977fd8, retrieved 2026-05-01) is the canonical reference. The project entered the Apache Incubator in 2016 and graduated to top-level in 2019.

The problem was Airbnb-specific in scale and industry-general in shape. Airbnb had hundreds of pipelines whose dependencies were managed by cron, ad-hoc scripts, and bespoke runners. The cron-and-scripts approach has the failure mode lesson 57 covered: it scales badly, debugging is painful, and the team keeps rebuilding the same scheduling primitives. Airflow was the general solution: pipelines as Python code, a DAG abstraction, a scheduler that handles dependencies and retries, a web UI that shows the state of every job, and a plugin model that integrates with arbitrary executors.

The architectural decisions that became defaults for the category:

Pipelines as code. A DAG is a Python file in a Git repo, not a UI configuration. Code review, version control, and refactoring tools apply. The pipeline stops being a separate artefact and becomes normal software.

The DAG abstraction. A directed acyclic graph of tasks with explicit dependencies is the right primitive for batch orchestration. Luigi (the older Spotify project Airflow drew from) used it, and every major orchestrator since has used it. The asset-oriented refinements of lesson 58 assume the DAG as a starting point.

Scheduler and web UI as separate concerns. The scheduler enforces dependencies and triggers tasks; the UI shows what is running and what failed. Operations work happens in the UI; pipeline development happens in the IDE.

A plugin model. Operators (the units of work in a DAG) are pluggable: Bash, Python, Spark, Hive, HTTP, S3, BigQuery, and a long tail of others.

Airflow’s success is measured by how thoroughly the industry adopted it. Within a few years of open-sourcing, it was the default orchestrator for new data platforms. Managed offerings (Astronomer, Google Cloud Composer, AWS MWAA) made it the default even for teams that did not want to operate it themselves. Later competitors (Prefect, Dagster, Argo Workflows) defined themselves partly in contrast to Airflow’s choices, which is the truest measure of a tool’s influence: it became the baseline newer tools argue with.

The lesson is not “build your own orchestrator”. It is that the orchestrator is critical infrastructure, the choice matters, and the problem space is now well-served by mature open-source tools.

Minerva: the metrics layer

Minerva is Airbnb’s internal semantic layer for metrics, described in the 2020 to 2021 sequence of Airbnb engineering posts including “How Airbnb Achieved Metric Consistency at Scale” (https://medium.com/airbnb-engineering/how-airbnb-achieved-metric-consistency-at-scale-f23cc53dea70, retrieved 2026-05-01).

The problem Minerva solved is the most chronic in any analytics organisation past one team. The same metric (active hosts, gross booking value, conversion rate, retention by cohort) gets defined separately in every dashboard, A/B framework, and report. Definitions drift. The same name produces three different numbers across three surfaces. Disputes about which is right consume meeting time. Decisions get made on whichever number was at hand.

Minerva makes the metric definition the source of truth, defined once and consumed everywhere:

A central metric repository. Metrics are defined in a configuration language with SQL generated under the hood. Each has a name, definition, owner, description, and version history. Adding a metric means a file plus code review.

A query layer. Consumers (dashboards, A/B analyses, alerts, the experimentation platform) ask Minerva for a metric by name across a dimension and time range. Minerva generates the SQL and returns the result.

Dimensional consistency. Dimensions (country, host type, listing type, channel, week, cohort) are defined alongside metrics, with the same review process. “By country” means the same thing everywhere.

Pre-computation. Frequently queried metrics get pre-aggregated into denormalised tables. Consumers see fast responses; the platform owns the pre-computation.

The architectural argument: the cost of building the layer is paid once. The cost of inconsistency is paid forever, in every dashboard review and every A/B debrief, distributed across every analytical conversation and therefore invisible. The trade-off favours building the layer at any organisation past a small handful of teams.

Open-source equivalents that arrived after Minerva include dbt’s metrics features, Cube (https://cube.dev/), Lightdash, and Transform’s MetricFlow (acquired by dbt Labs in 2023). As of 2026 the market is mature enough that most teams should adopt one rather than build their own. The lesson is not “every team needs a custom metrics layer”; it is that every team needs a metrics layer, and the build-versus-buy calculus has shifted decisively toward buy.

Dataportal: data discovery as a product

Dataportal is Airbnb’s internal data-discovery system, described in “Democratizing Data at Airbnb” (Chris Williams et al., Airbnb Engineering blog, May 2017, https://medium.com/airbnb-engineering/democratizing-data-at-airbnb-852d76c51770, retrieved 2026-05-01). The pitch: a search UI for tables, dashboards, metrics, and the people who own them.

Small teams do not have this problem; large teams cannot avoid it. With ten tables, an analyst remembers them. With a hundred, the analyst asks a colleague. With ten thousand, asking stops working: the colleagues do not know either. Without a discovery tool, the time-to-answer for “where is the data on this thing” stretches from seconds to days, and analytical productivity collapses long before the platform itself does.

Dataportal treats discovery as a search problem and a product problem. The search problem: index every table, dashboard, and analysis along with its metadata (schema, owner, last update, lineage, tags, descriptions, usage). The product problem: treat the UI as a first-class consumer of the platform, with the same care for usability that a customer-facing product would get.

Open-source successors include LinkedIn DataHub, Lyft’s Amundsen, Apache Atlas, and OpenMetadata. The lesson Dataportal taught the industry is that data discovery cannot be an afterthought: at scale, finding the right table is harder than querying it.

Wall and the data-quality framework

Airbnb’s internal data-quality framework, referred to as Wall in some older posts, is described in “Achieving High Quality Data at Airbnb” and related posts (https://medium.com/airbnb-engineering/data-quality-at-airbnb-e5d5a44f7b5b, retrieved 2026-05-01). The shape is what lesson 61 covered in the abstract: declarative tests on tables, automated alerts when a test breaks, a central system that owns the test results and routes them to the owners.

The Airbnb-specific elements are scale and integration. The framework runs against thousands of tables. Tests live next to the table definitions, so adding a new table comes with the expectation it will have tests. The alert routing is integrated with on-call (lesson 63): a critical test failure pages the team that owns the table, not a generic data-team channel nobody reads.

The lesson is the one lesson 61 made: data quality is not a project, it is a programme. Without the substrate, every issue is an ad-hoc heroic effort. With it, data quality is routine. The open-source equivalents (Great Expectations, dbt tests, Soda Core, Monte Carlo, Datafold) are mature enough that a 2026 team should adopt one rather than build fresh.

Knowledge Repo was open-sourced by Airbnb in 2016 and described in “Scaling Knowledge at Airbnb” (Chetan Sharma, Airbnb Engineering blog, March 2016, https://medium.com/airbnb-engineering/scaling-knowledge-at-airbnb-875d73eff091, retrieved 2026-05-01). The repo is on GitHub (https://github.com/airbnb/knowledge-repo).

The problem is cultural as much as technical. An analytical organisation produces a continuous stream of analyses: A/B write-ups, deep-dive investigations, ad-hoc queries that became insights. Without a shared place to live, each is a one-off email or PDF, and the knowledge it represents disappears within months. New analysts rebuild the same investigations the old ones already did.

The design is deliberately simple: analyses are Markdown files (with embedded code and plots) in a Git repository. A web UI renders them. Analyses go through pull-request review, just like code. Every published analysis has an author, a date, tags, and a searchable body.

The architectural lesson is small. The cultural lesson is large. A team that publishes its analyses internally develops a habit of writing them up well, because the audience is its colleagues. A team that reads its colleagues’ analyses develops a shared vocabulary and a higher analytical floor. The tool is the substrate; the culture is what the substrate enables. Airbnb’s rollout paired the tool with explicit encouragement to write up analyses, with leaders modelling the behaviour.

The lessons

Five takeaways, structured the way the Netflix case (lesson 40) and the Uber case (lesson 48) structured them.

Build the shared layer. When ten teams need the same metric, define it once. The cost of the shared layer is paid once; the cost of inconsistency is paid forever. Minerva is the prototype, and the lesson generalises to catalogues, quality frameworks, and orchestration: the platform’s job is to absorb complexity application teams should not manage.

Discoverability is its own product. At scale, finding the right table is harder than querying it. Teams that ignore this discover their analysts spend more time hunting than analysing.

Open-source what you build, when the calculus favours it. Airflow paid back in community contributions and ecosystem alignment. Knowledge Repo had a similar smaller story. Minerva and Dataportal stayed internal, and open-source equivalents (dbt metrics, DataHub, Amundsen) emerged later. Smaller teams should usually adopt the resulting open-source tools rather than build new ones.

The metrics layer is critical infrastructure. Not a BI feature, not a code-review convention. A platform component with the same operational seriousness as the warehouse or the orchestrator. Define ownership, define SLOs (lesson 60), test it (lesson 61), monitor it (lesson 59), put it in the on-call’s scope (lesson 63).

Culture is a deliverable. Knowledge Repo is the clearest example. The metrics layer only delivers consistency if the team uses it. The data-quality framework only delivers quality if the team treats test failures as real failures. The on-call rotation only delivers reliability if the team treats it as real work. Tools enable culture; tools do not replace it.

Cross-references back into Module 8

The Airbnb stack exercises every primitive Module 8 introduced. Airflow is the orchestrator from lesson 57. The asset-oriented refinements of lesson 58 are the natural evolution Airflow itself has been moving toward through the Datasets feature added in 2.4. Observability (lesson 59) is the substrate that makes the data-quality framework actionable. SLOs (lesson 60) are how the platform team makes promises about Minerva’s freshness. Data quality (lesson 61) is the Wall framework. Incidents (lesson 62) and on-call (lesson 63) are the human layer that catches what the automated layers miss.

The patterns transfer. A team running ten dbt models against a small warehouse is doing the same thing Airbnb is doing at petabyte scale. Constants differ. Shapes do not.

Module 8 closes here

Module 8 walked through the operational practices that turn a data platform from a collection of pipelines into a system a team can run. Module 9 opens with cost optimisation. The connection is direct: a reliable but expensive platform will eventually be cut, and a cheap but unreliable platform costs the business more than it saves. Module 9 covers the levers a mature team has to make the platform sustainable on both axes: storage layout, compute right-sizing, query optimisation, FinOps, and the architectural decisions that compound across years of cloud bills.

Citations and further reading

Maxime Beauchemin, “Airflow: a workflow management platform”, Airbnb Engineering blog, June 2015, https://medium.com/airbnb-engineering/airflow-a-workflow-management-platform-46318b977fd8 (retrieved 2026-05-01). The original announcement of Airflow.
Apache Airflow project, https://airflow.apache.org/ (retrieved 2026-05-01). The current home of the project, including documentation and the Datasets feature relevant to lesson 58.
Airbnb Engineering, “How Airbnb Achieved Metric Consistency at Scale”, https://medium.com/airbnb-engineering/how-airbnb-achieved-metric-consistency-at-scale-f23cc53dea70 (retrieved 2026-05-01). The Minerva description, with follow-on posts on the experimentation platform integration.
Chris Williams et al., “Democratizing Data at Airbnb”, Airbnb Engineering blog, May 2017, https://medium.com/airbnb-engineering/democratizing-data-at-airbnb-852d76c51770 (retrieved 2026-05-01). The Dataportal description.
Airbnb Engineering, “Data Quality at Airbnb”, https://medium.com/airbnb-engineering/data-quality-at-airbnb-e5d5a44f7b5b (retrieved 2026-05-01). The data-quality framework description.
Chetan Sharma, “Scaling Knowledge at Airbnb”, Airbnb Engineering blog, March 2016, https://medium.com/airbnb-engineering/scaling-knowledge-at-airbnb-875d73eff091 (retrieved 2026-05-01). Knowledge Repo announcement and motivation.
Airbnb, Knowledge Repo on GitHub, https://github.com/airbnb/knowledge-repo (retrieved 2026-05-01). The open-source project.
Airbnb Engineering blog, https://medium.com/airbnb-engineering (retrieved 2026-05-01). The broader collection of public posts that the summary above draws from.