从供应商到先锋:Airbnb 在 observability 所有权方面的来之不易的经验教训

How a complex, large-scale migration to an in-house observability platform led to superior tooling, consistent data, and a fundamental reset of the developer experience.

一场复杂的大规模迁移到内部 observability 平台如何带来了优越的工具、一致的数据以及开发者体验的根本重置。

By: Callum Jones, Rong Hu

作者: Callum JonesRong Hu

Observability — the function of providing visibility into the performance and reliability of applications using metrics, logs and traces — is one of the most important tools of the Infrastructure group at any company. Without a reliable, cost-effective, and user-friendly observability platform, you limit an organization’s ability to empower engineers to assess, support, and improve the reliability of their application.

Observability — 使用 metrics、logs 和 traces 提供对应用程序性能和可靠性的可见性的功能 — 是任何公司 Infrastructure 组最重要的工具之一。没有一个可靠、经济有效且用户友好的 observability 平台,你就会限制组织赋予工程师评估、支持和改进其应用程序可靠性的能力。

Like many of its peers, Airbnb started out by outsourcing its observability needs to vendors. But, as the company matured, our needs diverged from the typical vendor’s incentives. Vendors charge by the amount of data ingested, so Airbnb’s costs were rising, but more data does not automatically lead to faster insights or reduced mean time to detect (MTTD) or repair (MTTR). Also, by being out of the feedback loop of how observability data is consumed, our ability to enhance the monitoring workflows of our customers or pursue cost optimizations on observability spend was severely hampered.

像许多同类公司一样,Airbnb 最初将可观测性需求外包给供应商。但是,随着公司的发展,我们的需求与典型供应商的激励机制产生了分歧。供应商按摄入的数据量收费,因此 Airbnb 的成本不断上升,但更多数据并不一定能带来更快的洞察或减少平均检测时间 (MTTD) 或修复时间 (MTTR)。此外,由于脱离了可观测性数据消耗方式的反馈循环,我们提升客户监控工作流或追求可观测性支出成本优化的能力受到了严重限制。

To meet our objectives, Airbnb embarked on a complex migration, overhauling every part of our metrics infrastructure. This challenging journey involved replacing our instrumentation, collection, storage, and visualization systems as we transitioned from a third-party, vendor-managed observability platform to a custom, in-house solution built on open-source technology based on Prome...

开通本站会员,查看完整译文。

首页 - Wiki
Copyright © 2011-2026 iteam. Current version is 2.155.0. UTC+08:00, 2026-03-19 20:14
浙ICP备14020137号-1 $访客地图$