Eric Fan Insight Project Demo

download Eric Fan Insight Project Demo

If you can't read please download the document

Transcript of Eric Fan Insight Project Demo

SpentAtAGlance

By Eric FanInsight Data Engineering (Silicon Valley 2016)

Help Track Your Expense!

SpentAtAGlance

A budgeting tool (similar to mint.com) that summarizes your current months and recent credit card transaction history

How much have I spent this month?

A quick visualization of your acrued expenses before making your next purchase

SpentAtAGlance

Gather statistical data for the credit card company for marketing strategies

Total/average monthly transactions for all users

Front-End

For card holderhttp://spentglance.onlinehttp://ec2-52-41-11-131.us-west-2.compute.amazonaws.com

For card companyhttp://spentglance.online/card_companyhttp://ec2-52-41-11-131.us-west-2.compute.amazonaws.com/card_company

https://youtu.be/miDFxvX_4eQ

Data Pipeline

Producer.py

Camus

{"date":string,day_of_week: string,time: string,transaction_type:stringname:string,"amount":string}

50 GB of data

Data Pipeline Cost

Kafka1 x m4.large(spot)$0.04

Spark/HDFS4 x m4.xlarge(1 on-demand,3 spot)$0.239+3 * $0.06=$0.419

Cassandra4 x m4.large(spot)$0.06 x4=$0.24

Flask1 x m4.large(spot) 5000 w/s for the 4 nodes configuration

[1] T. Rabl et al. (2012), Solving Big Data Challenges for Enterprise Application Performance Management