The Best of Both Worlds: Unlocking the Power of Apache Beam with Apache Flink


May 27, 10:00 AM PDT
  • Virtual
  • 210 RSVP
Description
Speaker
Welcome to the session 4 of the Beam Learning Months!

Apache Beam is a framework for writing stream and batch processing pipelines using multiple languages such as Java, Python, SQL, or Go. Apache Beam does not come with an execution engine of its own. Instead, it defers the execution to its Runners which translate Beam pipelines for any supported execution engine. Thus, users have complete control over the language and the execution engine they use, without having to rewrite their code.
In this talk, we will look at running Apache Beam pipelines with Apache Flink. We will explain the concepts behind Apache Beams portability framework for multi-language support, and then show how to get started running Java, Python, and SQL pipelines.

This is the 4th talk in the series. Do not forget to sign up to the other sessions on Beam learning months:

  • May 6 - Interactive Introduction to Apache Beam. Session 1
  • May 13 - Best practices towards a production-ready pipeline. Session 2
  • May 20 - Introduction to the Spark Runner. Session 3
  • Jun 3 - Feature Powered by Apache Beam – Beyond Lambda. Session 5
  • All the resources used during the presentation and links to the recording will be posted here: https://github.com/aijamalnk/beam-learning-month

    Maximilian Michels

    a software engineer and consultant for open-source data processing systems. Max is a PMC member of Apache Beam and Apache Flink and a member at the Apache Software Foundation. Max is one of the organizers of beamsummit.org which aims to spread the knowledge around the Apache Beam ecosystem.
    The event ended.
    Watch Recording
    *Recordings hosted on Youtube, click the link will open the Youtube page.
    Contact Organizer