从监控到可观察性:我们通往云原生平台的超马之旅
Managing a global corporate network at Uber’s scale can feel a bit like running an ultra-marathon. There are long stretches of smooth sailing, but you’re always preparing for the unexpected mountain pass or sudden change in weather. For years, our engineering teams have navigated this terrain with a traditional, monolithic monitoring system. Frankly, it felt like running in heavy hiking boots—sturdy, but slow, inflexible, and exhausting to scale up any hill.
在 Uber 的规模下管理全球企业网络,感觉有点像在进行超马拉松。虽然有长时间的顺利航行,但你总是在为意外的山口或突如其来的天气变化做准备。多年来,我们的工程团队一直在用传统的单体监控系统来应对这种环境。坦率地说,这就像穿着沉重的登山靴——结实,但缓慢、缺乏灵活性,并且在攀登任何山丘时都令人疲惫。
We knew we needed to switch to a modern pair of carbon-fiber running shoes. This meant a complete overhaul: a journey to replace our legacy system with a cloud-native observability platform built for speed, flexibility, and endurance on an open-source stack.
我们知道需要换一双现代的碳纤维跑鞋。这意味着彻底改革:一场用云原生可观察性平台替换我们遗留系统的旅程,该平台旨在速度、灵活性和耐久性上基于开源技术栈。
Before diving deeper, it’s important to clarify where this system operates and what we wanted to achieve.
在深入探讨之前,重要的是要澄清该系统的运行范围以及我们想要实现的目标。
The CorpNet Observability Platform focuses exclusively on Uber’s corporate network—the infrastructure that connects offices, data centers, cloud environments, and internal services.
CorpNet可观察性平台专注于Uber的企业网络——连接办公室、数据中心、云环境和内部服务的基础设施。
It’s not a production telemetry platform; instead, it monitors and analyzes:
这不是一个生产遥测平台;相反,它监控和分析:
- Network and infrastructure devices like switches, routers, PDUs, and IoT sensors
- 网络和基础设施设备,如交换机、路由器、PDU和物联网传感器
- Connectivity, latency, and device health across Uber’s internal regions
- Uber内部区域的连接性、延迟和设备健康
- Operational data flows supporting enterprise networking and internal applications
- 支持企业网络和内部应用的操作数据流
The mission is simple: make Uber’s internal network as observable, reliable, and automated as the systems it supports.
使命很简单:使Uber的内部网络与其支持的系统一样可观察、可靠和自动化。
Our vision was to build a new system on the pillars of data quality, scalability, and actionable data. We chose a foundation of best-in-class open-source tools.
我们的愿景是...