Big data

13
Big Data

Transcript of Big data

Page 1: Big data

Big Data

Page 2: Big data

What is Big data?

Big Data refers to the massive amounts of data that collect over time that are difficult to analyze and handle using common database management tools.

The data are analyzed for marketing trends in business as well as in the fields of manufacturing, medicine and science.

The types of data include business transactions, e-mail messages, photos, surveillance videos, activity logs and unstructured text from blogs and social media, as well as the huge amounts of data that can be collected from sensors of all varieties

Page 3: Big data
Page 4: Big data

Who's Generating Big Data?

Social media and networks(all of us are generating data)

Scientific instruments(collecting all sorts of data)

Mobile devices (tracking all objects all the time)

Sensor technology and networks(measuring all kinds of data)

Most analysts and practitioners currently refer to data sets from 30-50 terabytes(1000 gigabytes per terabyte) to multiple petabytes (1000 terabytes per petabyte) as big data.

Page 5: Big data

Big data: 3V's

Volume:The massive scale and growth of unstructured data outstrips traditional storage and analytical solutions

Velocity:Data is generated in real time, with demands for usable information to be served up immediately

Variety: Data is getting generated in the form of relational data, text data, semi structured data ,Graph data etc.

Page 6: Big data

Examples of Big Data Projects

Consumer product companies and retail organizations are monitoring social media like Facebook and Twitter to get an unprecedented view into customer behavior, preferences, and product perception.

Manufacturers are monitoring minute vibration data from their equipment, which changes slightly as it wears down, to predict the optimal time to replace or maintain. Replacing it too soon wastes money; replacing it too late triggers an expensive work stoppage

Advertising and marketing agencies are tracking social media to understand responsiveness to campaigns, promotions, and other advertising mediums.

Page 7: Big data

- - one of largest Destinations on the web

80% of the U.S.Internet population uses Yahoo!

Global network of content,commerce ,media ,search and access products.

100+ properties including mail ,TV, news ,shopping ,finance,autos ,travels,games ,movies, healths ,etc.

25+ terabytes of data collected each day Representing 1000's of cataloged consumer

behaviours

Page 8: Big data

Yahoo!Big Data-A league of its own

Grand challenge problems of data processing

Travel,Credit card processing ,Stock exchange ,Retail,Internet

Y!Data challenge exceeds others by 2 orders of magnitude

Page 9: Big data

Behavioral Targeting(BT)

Page 10: Big data

Yahoo!User DNA

On a per consumer basis: maintain a behavioral/interests profile andprofitability (user value and LTV) metrics

Page 11: Big data
Page 12: Big data

Row 1 Row 2 Row 3 Row 40

2

4

6

8

10

12

Column 1

Column 2

Column 3

Page 13: Big data

Thank you