Snakes on a Plane: Interactive Data Exploration with PyFlink and Zeppelin Notebooks

A presentation at ODSC Europe in September 2020 in by Marta Paes

Slide 1

Slide 1

© 2020 Ververica

Slide 2

Slide 2

2

Slide 3

Slide 3

open source distributed stateful stream processing Flink Runtime 3

Slide 4

Slide 4

open source distributed stateful stream processing High Performance Fault Tolerance Stateful Processing Flink Runtime 4 Flexible APIs

Slide 5

Slide 5

Streaming Analytics & ML Stateful Stream Processing Flink Runtime 5 Event-Driven Applications

Slide 6

Slide 6

streams state Streaming Analytics & ML Stateful Stream Processing Flink Runtime 6 time Event-Driven Applications

Slide 7

Slide 7

streams state ● ● ● 7 performance reliability time

Slide 8

Slide 8

Streaming Analytics & ML Stateful Stream Processing Flink Runtime 8 Event-Driven Applications

Slide 9

Slide 9

● ● ● 9 developer speed autonomy

Slide 10

Slide 10

10

Slide 11

Slide 11

11

Slide 12

Slide 12

Mature fast 12 intuitive

Slide 13

Slide 13

1995 2008 Mature 2003 2015 2001 13 fast intuitive

Slide 14

Slide 14

1995 2008 Mature 2003 2015 fast intuitive 2001 restricted fits in memory single core 14

Slide 15

Slide 15

15

Slide 16

Slide 16

16

Slide 17

Slide 17

17

Slide 18

Slide 18

18

Slide 19

Slide 19

Expose 19

Slide 20

Slide 20

Distribute 20 scale

Slide 21

Slide 21

Double 11 / Singles Day 21

Slide 22

Slide 22

● ● ● 22

Slide 23

Slide 23

● ● ● Execution Streaming Batch UDF Support 23 Python UDF Pandas UDF +UDAF (WIP) +UDAF (WIP)

Slide 24

Slide 24

● ● ● Execution Streaming Native Connectors Batch FileSystems Apache Kafka ML Library (WIP) FLIP-39 Notebooks UDF Support Kinesis Python UDF Pandas UDF +UDAF (WIP) +UDAF (WIP) HBase JDBC Elasticsearch

  • 24 Formats Apache Zeppelin

Slide 25

Slide 25

● ● ● Execution Streaming Native Connectors Batch FileSystems Apache Kafka ML Library (WIP) FLIP-39 Notebooks UDF Support Kinesis Python UDF Pandas UDF +UDAF (WIP) +UDAF (WIP) HBase JDBC Elasticsearch

  • 25 Formats Apache Zeppelin

Slide 26

Slide 26

26

Slide 27

Slide 27

… Advantages ● ● ● ● ● 27

Slide 28

Slide 28

DEMO 28

Slide 29

Slide 29

DEMO 29

Slide 30

Slide 30

DEMO 30

Slide 31

Slide 31

DEMO 31

Slide 32

Slide 32

DEMO 32

Slide 33

Slide 33

DEMO 33

Slide 34

Slide 34

34

Slide 35

Slide 35

© 2020 Ververica