Are you tired of the hype around GenAI? Ready to dive into the latest trends in data infrastructure? Join us for an in-person event to connect with data infrastructure experts and gain insights into data streaming and lakehouse technologies from industry leaders at Databricks, StreamNative, and RisingWave.
Agenda:
- 6:00pm~6:30pm: Checkin, food and Networking
- 6:30pm~7:30pm: Tech talks and Q&A
- 7:30pm~8:00pm: Open discussion and mixer
Tech Talk: Ursa: Kafka-compatible data streaming on Lakehouse
Speaker: Sijie Guo (StreamNative)
Abstract: Ursa is a Kafka-compatible data streaming engine built on top of a lakehouse, enabling users to store their topics and associated schemas directly in lakehouse tables. Ursa utilizes the innovations that StreamNative has developed to evolve Pulsar's storage layer from a disk-based shared storage layer to an object storage-based tiered storage system and to integrate with the lakehouse ecosystem. The Ursa engine simplifies the integration between data streams and lakehouse tables, drastically reducing the complexity of using bespoke integrations. In this talk, we will dive deeper into the details of the Ursa engine and how it leverages the lakehouse as a storage backend.
Tech Talk: The Streaming Lakehouse Era: Is Kafka the New Data Lake?
Speaker: Yingjun Wu (RisingWave)
Abstract: Apache Kafka plays a pivotal role in the technology stack of numerous data-driven corporations. Widely perceived as a “repository for recent data,” many organizations use Kafka to hold recently ingested data for durations ranging from 7 days to a month before transferring it to data lakes. However, there is increasing evidence suggesting that data persists in Kafka for longer periods, indicating that Kafka itself is evolving into a new form of data lake. In this talk, I will discuss whether Kafka can be considered the new data lake and how we can build a streaming lakehouse using open-source technologies like Kafka, RisingWave, and Iceberg
Tech Talk: The (Open) Interface is Everything
Speaker: Jason Reid (Databricks)
Abstract: SQL may be the universal language of data, but the emergence of a number of prominent open source standards over the past 15 years has helped revolutionize the way that our society interacts with data. Apache Arrow, Apache Iceberg, Apache Kafka, Apache Parquet, and Apache Spark are just some of the projects that have fueled this transition. In this talk, we explore the power of open, standard interfaces by recounting the steps we have taken to this point in an effort to cast light into what lies ahead on this journey.
Venue:
Plug and Play Tech Center, 440 N Wolfe Rd · Sunnyvale, CA