Database File Format Optimization: Per Column Dictionary

摘要

At Mixpanel, our proprietary multi-tenant columnar store database, known as ARB, serves as the backbone of our analytics platform. Following the ingestion of customer events, these data sets undergo a compaction process to create columnar files, which are then uploaded to Google Cloud Storage (GCS). In this blog post, we will delve into the recent updates we’ve implemented in this columnar file format. These updates enhance our query servers’ ability to accommodate more ARB files in their local SSDs, all without actually reducing file sizes. In some instances, these changes may even result in larger files.

欢迎在评论区写下你对这篇文章的看法。

评论

首页 - Wiki
Copyright © 2011-2024 iteam. Current version is 2.125.1. UTC+08:00, 2024-05-18 16:59
浙ICP备14020137号-1 $访客地图$