Hva har Open Source, Java, Data Pipelines og data fra Oslo Bysykkel til felles?

Using Open Source Tech to Swap Out Components of Your Data Pipeline A few years ago, moving data between applications and datastores included expensive monolithic stacks from large software vendors with little flexibility. Now with frameworks such as Apache Beam and Apache Airflow, we can schedule and run data processing jobs for both streaming and batch with the same underlying code. This presen

Feb 20, 2019, 4:00 – 6:00 PM

RSVP'd

Key Themes

CloudData

About this event

Using Open Source Tech to Swap Out Components of Your Data Pipeline

A few years ago, moving data between applications and datastores included expensive monolithic stacks from large software vendors with little flexibility. Now with frameworks such as Apache Beam and Apache Airflow, we can schedule and run data processing jobs for both streaming and batch with the same underlying code. This presentation demonstrates the concepts of how this can glue your applications together and shows how we can run a data pipeline from Apache Kafka through Hadoop Flink to Hive and move this to Pub/Sub, Dataflow, and BigQuery by changing a few lines of Java in our Apache Beam code.

===Speaker:===
Rustam Mehmandarov
https://twitter.com/rmehmandarov

Passionate computer scientist. Leader of JavaZone. Frequent speaker. Java Champion.

Organizers

  • Rustam Mehmandarov

    Computas

    GDG Organizer

  • Leonard Sheng Sheng Lee

    Co-organiser

Contact Us