7 Fun Things to do with MapReduce Chris Hillman – Teradata Data
-
Upload
hannah-young -
Category
Documents
-
view
217 -
download
0
description
Transcript of 7 Fun Things to do with MapReduce Chris Hillman – Teradata Data
![Page 1: 7 Fun Things to do with MapReduce Chris Hillman – Teradata Data](https://reader036.fdocuments.net/reader036/viewer/2022062412/5a4d1af67f8b9ab059981963/html5/thumbnails/1.jpg)
7 Fun Things to do with MapReduce
Chris Hillman – Teradata Data Scientist [email protected]@chillax7
![Page 2: 7 Fun Things to do with MapReduce Chris Hillman – Teradata Data](https://reader036.fdocuments.net/reader036/viewer/2022062412/5a4d1af67f8b9ab059981963/html5/thumbnails/2.jpg)
AgendaMap Tasks
Face DetectionCharacter RecognitionSpeech to Text
ShufflingMass Spectrometer processing
ReducersText MiningActual Mining
Cluster Building
![Page 3: 7 Fun Things to do with MapReduce Chris Hillman – Teradata Data](https://reader036.fdocuments.net/reader036/viewer/2022062412/5a4d1af67f8b9ab059981963/html5/thumbnails/3.jpg)
Face Detection in ImagesStepStep 1.
Get a good Open Source Library
Step 2.Check the
Example Code
@chillax7
![Page 4: 7 Fun Things to do with MapReduce Chris Hillman – Teradata Data](https://reader036.fdocuments.net/reader036/viewer/2022062412/5a4d1af67f8b9ab059981963/html5/thumbnails/4.jpg)
Character RecognitionStepMore Complex Task
than Face DetectionSELECT * FROM RecognizeNumberPlate( ON anpr.vehiclelogs imagecol('recognizedobject'));
@chillax7
![Page 5: 7 Fun Things to do with MapReduce Chris Hillman – Teradata Data](https://reader036.fdocuments.net/reader036/viewer/2022062412/5a4d1af67f8b9ab059981963/html5/thumbnails/5.jpg)
Speech to TextStep
Fed up with word count examples?
How about counting words in a recorded wav
file?@chillax7
![Page 6: 7 Fun Things to do with MapReduce Chris Hillman – Teradata Data](https://reader036.fdocuments.net/reader036/viewer/2022062412/5a4d1af67f8b9ab059981963/html5/thumbnails/6.jpg)
ProteomicsStepMass Spectrometers
Create a lot of data….In XML format….It’s nasty to work with
@chillax7
![Page 7: 7 Fun Things to do with MapReduce Chris Hillman – Teradata Data](https://reader036.fdocuments.net/reader036/viewer/2022062412/5a4d1af67f8b9ab059981963/html5/thumbnails/7.jpg)
Text MiningStepFirst phases are map
tasksText Extraction andParsing
@chillax7
![Page 8: 7 Fun Things to do with MapReduce Chris Hillman – Teradata Data](https://reader036.fdocuments.net/reader036/viewer/2022062412/5a4d1af67f8b9ab059981963/html5/thumbnails/8.jpg)
Actual MiningStepComparing Seismic
surveys taken at different points in time??
@chillax7
![Page 9: 7 Fun Things to do with MapReduce Chris Hillman – Teradata Data](https://reader036.fdocuments.net/reader036/viewer/2022062412/5a4d1af67f8b9ab059981963/html5/thumbnails/9.jpg)
Cluster BuildingStepWhy Build your own
cluster?• It’s fun• You learn lots• It gets you invited
to parties
Physical or Virtual?
Physical – more fun, looks
impressive, harder to build,
maintain, use, cost of power
Virtual – performance? Easier
to test, try different versions,
configurations@chillax7