Uber公司的Apache Hadoop基础设施的容器化

Containerizing Apache Hadoop Infrastructure at Uber

Introduction

简介

As Uber’s business grew, we scaled our Apache Hadoop (referred to as ‘Hadoop’ in this article) deployment to 21000+ hosts in 5 years, to support the various analytical and machine learning use cases. We built a team with varied expertise to address the challenges we faced running Hadoop on bare-metal: host lifecycle management, deployment and automation, Hadoop core development, and customer facing portals.

随着Uber业务的增长,我们在5年内将Apache Hadoop(本文中称为 "Hadoop")的部署规模扩大到21000多台主机,以支持各种分析和机器学习用例。我们建立了一个具有不同专长的团队,以解决我们在裸机上运行Hadoop所面临的挑战:主机生命周期管理、部署和自动化、Hadoop核心开发,以及面向客户的门户网站。

With the growing complexity and size of Hadoop infrastructure, it became increasingly difficult for the team to keep up with the various responsibilities around managing such a large fleet. Fleet-wide operations using scripts and tooling consumed a lot of engineering time. Bad hosts started piling up without timely repairs.

随着Hadoop基础设施的复杂性和规模不断增加,团队越来越难以跟上围绕管理如此庞大的舰队的各种责任。使用脚本和工具的全舰队操作消耗了大量的工程时间。坏的主机开始堆积如山,没有及时维修。

As we continued to maintain our own bare-metal deployment for Hadoop, the rest of the company made significant progress in the microservices world. Solutions for container orchestration, host lifecycle management, service mesh, and security laid the foundations, making management of microservices far more efficient and less cumbersome.

当我们继续维护我们自己的Hadoop裸机部署时,公司的其他部门在微服务领域取得了重大进展。容器协调、主机生命周期管理、服务网格和安全的解决方案奠定了基础,使微服务的管理更加有效,不那么繁琐。

In 2019, we started a journey to re-architect the Hadoop deployment stack. Fast forward 2 years, over 60% of Hadoop runs in Docker containers, bringing major operational benefits to the team. As a result of the initiative, the team handed off many of their responsibilities to other infrastructure teams, and was able to focus more on core Hadoop development. 

2019年,我们开始了重新架构Hadoop部署栈的旅程。快进2年,超过60%的Hadoop在Docker容器中运行,给团队带来了重大的运营效益。由于这一举措,该团队将许多职责移交给其他基础设施团队,并能够更加专注于核心Hadoop开发。

Figure 1: Team Responsibilities Shift

图1:团队责任的转变

This article provides a summary of problems we faced, and how we solved them ...

开通本站会员,查看完整译文。

Home - Wiki
Copyright © 2011-2024 iteam. Current version is 2.129.0. UTC+08:00, 2024-07-05 09:05
浙ICP备14020137号-1 $Map of visitor$