Spark Seattle meetup - Breaking ETL barrier with Spark Streaming
Massive Streaming Analytics with Spark Streaming
16
Mattia Bertorello
-
Upload
paolo-platter -
Category
Software
-
view
433 -
download
1
Transcript of Massive Streaming Analytics with Spark Streaming
Why streaming matters
DataReal Time Processing
FASTER REACTIONS MORE PROFITS
Business Reaction
Streaming BigData Workflow
Why prediction?
• Rule based categorization and clustering is obsolete
• Pattern discovery
• Adaptation to fast changing data
• Smart thinking: no dummies
• Prediction is more valuable
Card transaction analysis
PAN CIFRATO | AMOUNT | DESCRIPTION | TIMESTAMP
Classificazione delle transazioni
online/offline
PAN CIFRATO | AMOUNT | DESCRIPTION | TIMESTAMP | ISONLINE
fraud detection algorithm
SQL aggregation
Generazione di allarmi in tempo reale