Master Project Presentation
-
Upload
sayantani-manna -
Category
Documents
-
view
34 -
download
0
Transcript of Master Project Presentation
Problem Statement ● According to Royal Embassy of Saudi Arabia, 10
million pilgrims visited Mecca in 2011, . ● Amro & Nijem states 30,000 missing cases in 2011.
Mo.va.on ● Traditional search method. ● Inadequate means of identification – Plastic ID cards ● Delay in tracking lost Pilgrims
State of the art ● Splunk : Capture ,Index, correlate real time log data generating
report
● Infosphere (IBM): Stream Analytic platform.
● Amazon Kinesis: Real-time processing Serviceof streaming data
● h=p://en.wikipedia.org/wiki/List_of_web_analy.cs_soFware
Technology in our Project Apache Storm – Why?
● Multiple use cases ● Data types, size, velocity ● Mission critical data Fault-tolerance ● Time series / pattern analysis Reliability Ref:-‐h(p://storm.incubator.apache.org/
Processing, computation, etc Scalability
Technologies in our Project Apache KaMa
● A distributed publish-subscribe messaging system
Why Kafka… ● High throughput ● Persistent ● Distributed
http://baljeetsandhu.wordpress.com/page/2/
Technology Used (contd.) CASSANDRA-‐real >me opera>onal data store
• Optimal for time series data
• Near-linear scalable
• Low read/write latency
ref:-http://upload.wikimedia.org/wikipedia/commons/5/5e/Cassandra_logo.svg
INGEST PROCESS
VISUALIZE
ANALYZE
STORE
Sensor data
Sensor data
Sensor data
Streaming Data Processing
Conclusion ● A prototype of the tracking system on streaming data ● Real Time Analysis reduces tracking delay ● Potential to broaden the scope of software industry
in Tourism.