通过Slack进行自动化的事件管理

How Airbnb automates incident management in a world of complex, rapidly evolving ensemble of microservices.

在一个复杂的、快速发展的微服务组合的世界中,Airbnb如何实现事件管理的自动化。

Vlad Vassiliouk

Vlad Vassiliouk

Incidents are unforeseeable events that disrupt normal business operations and are inevitable in complex systems that must be up and running 24/7. This is why it’s important to prepare and to train people to handle incidents in a timely and organized manner. Although each incident is unique, we follow the same procedure for detection, escalation, management, and resolution of incidents.

事故是不可预见的事件,它扰乱了正常的商业运作,在必须24小时运行的复杂系统中是不可避免的。这就是为什么要准备并培训人们及时和有组织地处理事件的原因。尽管每个事件都是独特的,但我们遵循相同的程序来检测、升级、管理和解决事件。

At Airbnb, we utilize a service oriented infrastructure which involves many interconnected services managed by small teams. Quickly figuring out what service is in trouble, and who to page is paramount to timely incident resolution. We found that our teams spent a lot of time switching between applications such as Slack, Pagerduty and Jira to raise an incident, page responders, and provide context. In order to have quick resolutions of incidents, we developed an incident management bot, a centralized automation tool for incident management.

在Airbnb,我们使用面向服务的基础设施,其中包括许多由小型团队管理的相互关联的服务。迅速弄清什么服务有问题,以及找谁来处理是及时解决事件的关键所在。我们发现,我们的团队花了很多时间在Slack、Pagerduty和Jira等应用程序之间切换,以提出一个事件,呼叫响应者,并提供背景。为了快速解决事件,我们开发了一个事件管理机器人,一个用于事件管理的集中式自动化工具。

Incident Management Slack bot

事故管理Slack机器人

Our goal was to centralize incident management in Slack. Everyone at Airbnb is familiar with and has access to Slack, and it’s easy to bring people and resources together in an incident channel. In addition, the incident channel acts like a timeline of events which makes putting together a post mortem report easy.

我们的目标是将事件管理集中在Slack中。Airbnb的每个人都熟悉并能使用Slack,而且很容易将人员和资源集中到一个事件频道中。此外,事件频道就像一个事件的时间轴,这使得整理事后报告变得容易。

Our requirements were as follows:

我们的要求如下。

开通本站会员,查看完整译文。

首页 - Wiki
Copyright © 2011-2025 iteam. Current version is 2.148.1. UTC+08:00, 2025-11-19 21:59
浙ICP备14020137号-1 $访客地图$