Streaming SQL in Data Mesh

出处：netflixtechblog.com

存档：存档

译文：中文

摘要

Data powers much of what we do at Netflix. On the Data Platform team, we build the infrastructure used across the company to process data at scale.

In our last blog post, we introduced “Data Mesh” — A Data Movement and Processing Platform. When a user wants to leverage Data Mesh to move and transform data, they start by creating a new Data Mesh pipeline. The pipeline is composed of individual “Processors” that are connected by Kafka topics. The Processors themselves are implemented as Flink jobs that use the DataStream API.

Since then, we have seen many use cases (including Netflix Graph Search) adopt Data Mesh for stream processing. We were able to onboard many of these use cases by offering some commonly used Processors out of the box, such as Projection, Filtering, Unioning, and Field Renaming.

阅读原文

逅天蓝蓝于 2023-11-07 分享

14938

关联话题： #Netflix #SQL

欢迎在评论区写下你对这篇文章的看法。

Streaming SQL in Data Mesh

Streaming SQL in Data Mesh

摘要

评论

文库