Post on 08-Aug-2015
Volume refers to huge amount of data being generated every minute.
90% of the data we have now is created in just past 2 years.
Cisco estimates internet traffic 4.8ZB per year.
3 billion people would be online by 2015 .
Volume
Velocity refers to SPEED at which new data is being generated and moves around.
It includes Real time working systems such as Online banking.
Need of low response time.
Velocity
Variety refers to various datatypes which we can now use.
Earlier focus was on neat and structured data kept in form of tables in RDBMS.
80% of data available now is unstructured data
Data in the from of text, videos, audios and pictures.
Variety
Why Big Data
– Increase of storage capacities
– Increase of processing power
– Availability of data
– 90% of the data in the world today has been created in the last two years alone
Big Data Analytics
• Examining large amount of data
• Appropriate information
• Competitive advantage
• Better business decisions
• Effective marketing, customer satisfaction, increased revenue
Applications for Big Data AnalyticsFinance Smarter Healthcare
Multi-channel sales
TelecomManufacturing
Traffic Control
Trading Analytics
Log Analysis Search Quality
NoSQL : non-relational or at least non-SQL database solutions such as HBase (also a part of the Hadoop ecosystem), Cassandra, MongoDB, Riak, CouchDB, and many others.
Hadoop: It is an ecosystem of software packages, including MapReduce, HDFS, and a whole host of other software packages
Hadoop is a open source framework
Java-based programming framework
Processing and storing of large data sets
Distributed computing environment.
What is Hadoop?
References•http://searchbusinessanalytics.techtarget.com/Experts sound off on big data , Analytics and its tools
• http://www.ibmbigdatahub.com/infographic/four-vs-big-data Big data and analytics hub
• https://bigdatauniversity.com/bdu-wp/bdu-course/hadoop-fundamentals-i-version-3/Hadoop fundamentals