Snakes on a Plane: Interactive Data Exploration with PyFlink and Zeppelin Notebooks

A presentation at ApacheCon by Marta Paes

Stream processing has fundamentally changed the way we build and think about data pipelines — but the technologies that unlock the value of this powerful paradigm haven’t always been friendly to non-Java/Scala developers. Apache Flink has recently introduced PyFlink, allowing developers to tap into streaming data in real-time with the flexibility of Python and its wide ecosystem for data analytics and Machine Learning. In this talk, we will explore the basics of PyFlink and showcase how developers can make use of a simple tool like interactive notebooks to harness the full power of an advanced stream processor like Apache Flink.

Video

Resources

The following resources were mentioned during the presentation or are useful additional information.