Airbnb shares their production-tested approach to migrating a large-scale metrics pipeline from StatsD to OpenTelemetry and Prometheus. The migration prioritizes collecting metrics at full scale first to expose bottlenecks before validating the new system. They reduce CPU overhead from 10% to under 1%, address high-cardinality performance issues with delta temporality, and implement streaming aggregation via vmagent to achieve an order of magnitude cost reduction.
Infrastructure
Moving a large-scale metrics pipeline from StatsD to OpenTelemetry / Prometheus
Airbnb reduced their metrics pipeline CPU overhead from 10% to <1% by migrating to OpenTelemetry/Prometheus with delta temporality and streaming aggregation, cutting costs by an order of magnitude.
Thursday, April 16, 2026 12:00 PM UTC2 MIN READSOURCE: Hacker NewsBY sys://pipeline
Tags
infrastructure