Apache Hadoop India Summit 2011 talk "Data Infrastructure on Hadoop" by Venkatesh S

10
Data Infrastructur e on Hadoop Venkatesh S Architect, Hadoop Data

description

 

Transcript of Apache Hadoop India Summit 2011 talk "Data Infrastructure on Hadoop" by Venkatesh S

Page 1: Apache Hadoop India Summit 2011 talk "Data Infrastructure on Hadoop" by Venkatesh S

Data Infrastructure on HadoopVenkatesh SArchitect, Hadoop Data

Page 2: Apache Hadoop India Summit 2011 talk "Data Infrastructure on Hadoop" by Venkatesh S

Outline• Big Picture• Data Infrastructure –Now–Next Wave

• Questions

Page 3: Apache Hadoop India Summit 2011 talk "Data Infrastructure on Hadoop" by Venkatesh S

BIG Data is here.

Page 4: Apache Hadoop India Summit 2011 talk "Data Infrastructure on Hadoop" by Venkatesh S

Managing BIG Data

Page 5: Apache Hadoop India Summit 2011 talk "Data Infrastructure on Hadoop" by Venkatesh S

Ads Optimization

Content Optimization

Search Index

Machine Learning

(e.g. Spam filters)

RSS Feeds

Site thumbnails

Who is using this Data?

Page 6: Apache Hadoop India Summit 2011 talk "Data Infrastructure on Hadoop" by Venkatesh S

Next Wave!

Page 7: Apache Hadoop India Summit 2011 talk "Data Infrastructure on Hadoop" by Venkatesh S

Hadoop Analytics Warehouse

Page 8: Apache Hadoop India Summit 2011 talk "Data Infrastructure on Hadoop" by Venkatesh S

Utilization

Page 9: Apache Hadoop India Summit 2011 talk "Data Infrastructure on Hadoop" by Venkatesh S

Storage Efficiency

Page 10: Apache Hadoop India Summit 2011 talk "Data Infrastructure on Hadoop" by Venkatesh S

Questions?