话题 › 公司 › Netflix

公司：Netflix

关联话题: 奈飞、网飞

Netflix（/ˈnɛtflɪks/）（官方中文译名网飞，非官方中文译名奈飞）是起源于美国、在世界各地提供网络视频点播的OTT服务公司，并同时在美国经营单一费率邮寄影像光盘出租服务，后者是使用回邮信封寄送DVD和Blu-ray出租光盘至消费者指定的收件地址。公司由里德·哈斯廷斯和马克·兰多夫在1997年8月29日成立，总部位于加利福尼亚州的洛斯盖图，1999年开始推出订阅制的服务。2009年，Netflix已可提供超过10万部电影DVD，订阅者数超过1000万人。另一方面，截至2022年6月的数据，Netflix的流服务已经在全球拥有2.20亿个订阅用户，在美国的订户已达到7330万。其主要的竞争对手有Disney+、Hulu、HBO Max、Amazon Prime Video、YouTube Premium及Apple TV+等。

Netflix在多个排行榜上均榜上有名：2017年6月6日，《2017年BrandZ最具价值全球品牌100强》公布，Netflix名列第92位。2018年10月，《财富》未来公司50强排行榜发布，Netflix排名第八。2018年12月，世界品牌实验室编制的《2018世界品牌500强》揭晓，排名第88。在《财富》2018年世界500大排名261位，并连年增长。2019年10月，位列2019福布斯全球数字经济100强榜第46名。2019年10月，Interbrand发布的全球品牌百强榜排名65。2020年1月22日，名列2020年《财富》全球最受赞赏公司榜单第16位。2022年2月，按市值计算，Netflix为全球第二大的媒体娱乐公司。2019年，Netflix加入美国电影协会（MPA）。另外，Netflix也被部分媒体列为科技巨擘之一。

Streaming SQL in Data Mesh

Data powers much of what we do at Netflix. On the Data Platform team, we build the infrastructure used across the company to process data at scale.

In our last blog post, we introduced “Data Mesh” — A Data Movement and Processing Platform. When a user wants to leverage Data Mesh to move and transform data, they start by creating a new Data Mesh pipeline. The pipeline is composed of individual “Processors” that are connected by Kafka topics. The Processors themselves are implemented as Flink jobs that use the DataStream API.

Since then, we have seen many use cases (including Netflix Graph Search) adopt Data Mesh for stream processing. We were able to onboard many of these use cases by offering some commonly used Processors out of the box, such as Projection, Filtering, Unioning, and Field Renaming.

netflix技术

Zero Configuration Service Mesh with On-Demand Cluster Discovery

Netflix’s service mesh adoption: history, motivations, and how we worked with the Envoy community on a feature to streamline mesh adoption.

netflix技术

AVA Discovery View: Surfacing Authentic Moments

At Netflix, we have created millions of artwork to represent our titles. Each artwork tells a story about the title it represents. From our testing on promotional assets, we know which of these assets have performed well and which ones haven’t. Through this, our teams have developed an intuition of what visual and thematic artwork characteristics work well for what genres of titles. A piece of promotional artwork may resonate more in certain regions, for certain genres, or for fans of particular talent. The complexity of these factors makes it difficult to determine the best creative strategy for upcoming titles.

Our assets are often created by selecting static image frames directly from our source videos. To improve it, we decided to invest in creating a Media Understanding Platform, which enables us to extract meaningful insights from media that we can then surface in our creative tools. In this post, we will take a deeper look into one of these tools, AVA Discovery View.

netflix技术

Curbing Connection Churn in Zuul

When Zuul was designed and developed, there was an inherent assumption that connections were effectively free, given we weren’t using mutual TLS (mTLS). It’s built on top of Netty, using event loops for non-blocking execution of requests, one loop per core. To reduce contention among event loops, we created connection pools for each, keeping them completely independent. The result is that the entire request-response cycle happens on the same thread, significantly reducing context switching.

There is also a significant downside. It means that if each event loop has a connection pool that connects to every origin (our name for backend) server, there would be a multiplication of event loops by servers by Zuul instances. For example, a 16-core box connecting to an 800-server origin would have 12,800 connections. If the Zuul cluster has 100 instances, that’s 1,280,000 connections. That’s a significant amount and certainly more than is necessary relative to the traffic on most clusters.

As streaming has grown over the years, these numbers multiplied with bigger Zuul and origin clusters. More acutely, if a traffic spike occurs and Zuul instances scale up, it exponentially increases connections open to origins. Although this has been a known issue for a long time, it has never been a critical pain point until we moved large streaming applications to mTLS and our Envoy-based service mesh.

netflix技术

Detecting Scene Changes in Audiovisual Content

本文提出了两种基于场景边界检测的方法，利用场景文本、音频和视频等多种模态信息。第一种方法使用预训练的句子级嵌入和动态时间规整方法在场景文本与时间戳文本的对齐信息上确定场景边界；第二种方法使用已标注的场景变化数据训练双向GRU序列模型，并利用预训练的多模态镜头嵌入来丰富模型。实验结果表明，这些方法在视频摘要、内容检索、配音质量评估和视频编辑等任务上的效果与最先进方法相当甚至更好。作者还提出了将场景边界检测与场景分类和关键时刻识别等任务相结合，进一步提高多模态机器学习模型在长形式内容中的表现。

netflix技术

Migrating Critical Traffic At Scale with No Downtime — Part 2

Picture yourself enthralled by the latest episode of your beloved Netflix series, delighting in an uninterrupted, high-definition streaming experience. Behind these perfect moments of entertainment is a complex mechanism, with numerous gears and cogs working in harmony. But what happens when this machinery needs a transformation? This is where large-scale system migrations come into play. Our previous blog post presented replay traffic testing — a crucial instrument in our toolkit that allows us to implement these transformations with precision and reliability.

netflix技术

Migrating Netflix to GraphQL Safely

In 2022, a major change was made to Netflix’s iOS and Android applications. We migrated Netflix’s mobile apps to GraphQL with zero downtime, which involved a total overhaul from the client to the API layer.

Until recently, an internal API framework, Falcor, powered our mobile apps. They are now backed by Federated GraphQL, a distributed approach to APIs where domain teams can independently manage and own specific sections of the API.

Doing this safely for 100s of millions of customers without disruption is exceptionally challenging, especially considering the many dimensions of change involved. This blog post will share broadly-applicable techniques (beyond GraphQL) we used to perform this migration. The three strategies we will discuss today are AB Testing, Replay Testing, and Sticky Canaries.

netflix技术

Ensuring the Successful Launch of Ads on Netflix

In November 2022, we introduced a brand new tier — Basic with ads. This tier extended existing infrastructure by adding new backend components and a new remote call to our ads partner on the playback path. As we were gearing up for launch, we wanted to ensure it would go as smoothly as possible. To do this, we devised a novel way to simulate the projected traffic weeks ahead of launch by building upon the traffic migration framework described here. We used this simulation to help us surface problems of scale and validate our Ads algorithms.

netflix技术

Native Frame Rate Playback

This article talks about a novel HDMI technology and how it is used within the Netflix Application to improve a user’s experience.

netflix技术

ABAC on SpiceDB: Enabling Netflix’s Complex Identity Types

The authorization team at Netflix recently sponsored work to add Attribute Based Access Control (ABAC) support to AuthZed’s open source Google Zanzibar inspired authorization system, SpiceDB. Netflix required attribute support in SpiceDB to support core Netflix application identity constructs. This post discusses why Netflix wanted ABAC support in SpiceDB, how Netflix collaborated with AuthZed, the end result–SpiceDB Caveats, and how Netflix may leverage this new feature.

netflix技术

Debugging a FUSE deadlock in the Linux kernel

The Compute team at Netflix is charged with managing all AWS and containerized workloads at Netflix, including autoscaling, deployment of containers, issue remediation, etc. As part of this team, I work on fixing strange things that users report.

This particular issue involved a custom internal FUSE filesystem: ndrive. It had been festering for some time, but needed someone to sit down and look at it in anger. This blog post describes how I poked at /procto get a sense of what was going on, before posting the issue to the kernel mailing list and getting schooled on how the kernel’s wait code actually works!

netflix技术

Migrating Critical Traffic At Scale with No Downtime — Part 1

Hundreds of millions of customers tune into Netflix every day, expecting an uninterrupted and immersive streaming experience. Behind the scenes, a myriad of systems and services are involved in orchestrating the product experience. These backend systems are consistently being evolved and optimized to meet and exceed customer and product expectations.

When undertaking system migrations, one of the main challenges is establishing confidence and seamlessly transitioning the traffic to the upgraded architecture without adversely impacting the customer experience. This blog series will examine the tools, techniques, and strategies we have utilized to achieve this goal.

netflix技术

Improved Alerting with Atlas Streaming Eval

Netflix采用基于实时流式度量的告警方式，用于处理高维度/基数限制的时间序列数据库。这种方法可以支持更多的用例，提高系统的可靠性，避免误报和漏报。另外，流式评估可以放松高基数限制，监控并警报一些高基数用例，如从自由形式日志数据中派生的指标。Netflix还将其应用健康监控系统Telltale从轮询指标缓存转换为实时Atlas流式处理，以检测异常并计算相关性，减少故障恢复平均时间（MTTR）。此外，Netflix发现流式度量方法不受监视查询数量的限制，且支持更高的度量维度/基数，这为更精确的警报和更好的可观察性提供了机会。

netflix技术

Building a Media Understanding Platform for ML Innovations

The media understanding platform serves as an abstraction layer between Machine Learning (ML) algos and various applications.

netflix技术

NTS: Reliable Device Testing at Scale

At Netflix, we test hundreds of different device types every day, ranging from streaming sticks to smart TVs, to ensure that new version releases of the Netflix SDK continue to provide the exceptional Netflix experience that our customers expect. We also collaborate with our Partners to integrate the Netflix SDK onto their upcoming new devices, such as TVs and set top boxes. This program, known as Partner Certification, is particularly important for the business because device expansion historically has been crucial for new Netflix subscription acquisitions. The Netflix Test Studio (NTS) platform was created to support Netflix SDK testing and Partner Certification by providing a consistent automation solution for both Netflix and Partner developers to deploy and execute tests on “Netflix Ready” devices.

Over the years, both Netflix SDK testing and Partner Certification have gradually transitioned upstream towards a shift-left testing strategy. This requires the automation infrastructure to support large-scale CI, which NTS was not originally designed for. NTS 2.0 addresses this very limitation of NTS, as it has been built by taking the learnings from NTS 1.0 to re-architect the system into a platform that significantly improves reliable device testing at scale while maintaining the NTS user experience.

netflix技术

Data Reprocessing Pipeline in Asset Management Platform @Netflix

At Netflix, we built the asset management platform (AMP) as a centralized service to organize, store and discover the digital media assets created during the movie production. Studio applications use this service to store their media assets, which then goes through an asset cycle of schema validation, versioning, access control, sharing, triggering configured workflows like inspection, proxy generation etc. This platform has evolved from supporting studio applications to data science applications, machine-learning applications to discover the assets metadata, and build various data facts.

During this evolution, quite often we receive requests to update the existing assets metadata or add new metadata for the new features added. This pattern grows over time when we need to access and update the existing assets metadata. Hence we built the data pipeline that can be used to extract the existing assets metadata and process it specifically to each new use case. This framework allowed us to evolve and adapt the application to any unpredictable inevitable changes requested by our platform clients without any downtime. Production assets operations are performed in parallel with older data reprocessing without any service downtime. Some of the common supported data reprocessing use cases are listed below.

netflix技术

公司：Netflix的相关资料

公司：Netflix