Interactive Visualization of Streaming Data Powered by Spark by Ruhollah Farchtchi

13
INTERACTIVE VISUALIZATION OF STREAMING DATA POWERED BY SPARK

Transcript of Interactive Visualization of Streaming Data Powered by Spark by Ruhollah Farchtchi

Page 1: Interactive Visualization of Streaming Data Powered by Spark by Ruhollah Farchtchi

INTERACTIVE VISUALIZATION OF STREAMING DATA POWERED BY SPARK

Page 2: Interactive Visualization of Streaming Data Powered by Spark by Ruhollah Farchtchi

Streaming @Zoomdata

Visualizations react to new data delivered

Users start, stop, pause the stream

Users select a rolling window or pin a start time to capture cumulative metrics

Page 3: Interactive Visualization of Streaming Data Powered by Spark by Ruhollah Farchtchi

Drivers for Streaming Data

Data Freshness Time to Analytic Business Context

Page 4: Interactive Visualization of Streaming Data Powered by Spark by Ruhollah Farchtchi

Challenges• Time

• Frequency

• Retention

• Synchronization

• Order

• Updates

Page 5: Interactive Visualization of Streaming Data Powered by Spark by Ruhollah Farchtchi

Addressing streaming @Zoomdata

Historical Revised

Receive Data JMS Kafka

Manipulate Stream Single JVM in Memory Spark Streaming

Hold Data in Buffer MongoDB Pluggable

Interact with Data Custom Code Pluggable

Page 6: Interactive Visualization of Streaming Data Powered by Spark by Ruhollah Farchtchi

Technology Cast• The Stream - Kafka, Kinesis, JMS• Processing Fabric - Spark Streaming• Landing Area - MemSQL, Solr, Kudu,

Others

Page 7: Interactive Visualization of Streaming Data Powered by Spark by Ruhollah Farchtchi

How it looks

Page 8: Interactive Visualization of Streaming Data Powered by Spark by Ruhollah Farchtchi

With the rest of the app

Page 9: Interactive Visualization of Streaming Data Powered by Spark by Ruhollah Farchtchi

Scale Out

Page 10: Interactive Visualization of Streaming Data Powered by Spark by Ruhollah Farchtchi

Demo• Twitter Producer• Spark Streaming• MemSQL & Solr

Sinks

Page 11: Interactive Visualization of Streaming Data Powered by Spark by Ruhollah Farchtchi

Benefits• Contextual Expressiveness with Streaming

Data• Independent scalability (scale-up, scale-

around)• Expressiveness powered by Spark -- using

Windowing (dataframe API with stream)• DR COOP, other Data management concerns

Page 12: Interactive Visualization of Streaming Data Powered by Spark by Ruhollah Farchtchi

Future Work• Cross stream synchronization & fusion

• On-demand scale out and resource management via Mesos

• Schema evolution

• More extensible landing strategies

Page 13: Interactive Visualization of Streaming Data Powered by Spark by Ruhollah Farchtchi

Thanks