Learn AI with global developers community

Inside Apache Druid Storage and Query Engine

Apr 21 2021, 12:00 PM PDT

Virtual SF Big Analytics
120 RSVPs

This event is hosted by SF Big Analytics Group. https://www.meetup.com/SF-Big-Analytics

Apache Druid is an open-source columnar database known for high performance at scale; its largest deployments comprise thousands of servers. But no matter the scale, high performance starts with good fundamentals. This talk will dive into those fundamentals by exploring the inner workings of a single data server. We’ll cover how Apache Druid stores data, what kinds of compression it uses, how it indexes data, how the storage engine is linked with the query processing engine, and how the system handles resource management and multithreading. Together, all these pieces enable Apache Druid to process billions of records per second on a single data server.

Contact Organizer