Intro to Hadoop
-
Upload
quang-nguyen -
Category
Technology
-
view
346 -
download
2
description
Transcript of Intro to Hadoop
![Page 1: Intro to Hadoop](https://reader033.fdocuments.net/reader033/viewer/2022051817/5480cb445806b5d3108b45bd/html5/thumbnails/1.jpg)
INTRODUCTION TO HADOOPPresented by
Quang Nguyen & Hoang Le
![Page 2: Intro to Hadoop](https://reader033.fdocuments.net/reader033/viewer/2022051817/5480cb445806b5d3108b45bd/html5/thumbnails/2.jpg)
CONTENT
• Introduction to Hadoop
• Scalability on AWS / Azure
• Reality
• First 100-hour Award
• Second 100-hour Overview
• Career Path
• Q & A
![Page 3: Intro to Hadoop](https://reader033.fdocuments.net/reader033/viewer/2022051817/5480cb445806b5d3108b45bd/html5/thumbnails/3.jpg)
INTRODUCTION TO HADOOP
![Page 4: Intro to Hadoop](https://reader033.fdocuments.net/reader033/viewer/2022051817/5480cb445806b5d3108b45bd/html5/thumbnails/4.jpg)
SCALABILITY ON AWS / AZURE
![Page 5: Intro to Hadoop](https://reader033.fdocuments.net/reader033/viewer/2022051817/5480cb445806b5d3108b45bd/html5/thumbnails/5.jpg)
DISTRIBUTED SYSTEMS
MPI vs Hadoop
![Page 6: Intro to Hadoop](https://reader033.fdocuments.net/reader033/viewer/2022051817/5480cb445806b5d3108b45bd/html5/thumbnails/6.jpg)
DRIVERIGHT PROJECT
• Target: collect driving experience by mobile application for analyzing driving habits
• Purposes:
• Improve driving ability
• Supply driver’s information to
needed companies
• Market: China
• Scale-out Problem:
• Millions of users with rich data resources (records in milliseconds)
• MySQL database is not efficient for Big Data Analytics
![Page 7: Intro to Hadoop](https://reader033.fdocuments.net/reader033/viewer/2022051817/5480cb445806b5d3108b45bd/html5/thumbnails/7.jpg)
R - Python
Tableau
Mobile Apps
Mobile Platform
DRIVE-RIGHT
ARCHITECTURE
![Page 8: Intro to Hadoop](https://reader033.fdocuments.net/reader033/viewer/2022051817/5480cb445806b5d3108b45bd/html5/thumbnails/8.jpg)
REALITY
![Page 9: Intro to Hadoop](https://reader033.fdocuments.net/reader033/viewer/2022051817/5480cb445806b5d3108b45bd/html5/thumbnails/9.jpg)
FIRST 100-HOUR AWARD
![Page 10: Intro to Hadoop](https://reader033.fdocuments.net/reader033/viewer/2022051817/5480cb445806b5d3108b45bd/html5/thumbnails/10.jpg)
SECOND 100-HOUR OVERVIEW
#101 – Java & IntelliJ setup#102 – Java programming part 1#103 – Java programming part 2#104 – Java programming part 3#105 – Java programming part 4#106 – Single-node Hadoop#107 – Multi-node Hadoop#108 – Map Reduce basis
#109 – Intro to Map Reduce programming
#110 – Map Reduce Design Pattern part 1
#111 – Map Reduce Design Pattern part 2
#112 – Apache Mahout 1 – Setting Up
#113 – Apache Mahout 2 –Building Recommenders
#114 – Apache Mahout 3 –Building Clustering Systems
#115 – Final Project II
![Page 11: Intro to Hadoop](https://reader033.fdocuments.net/reader033/viewer/2022051817/5480cb445806b5d3108b45bd/html5/thumbnails/11.jpg)
PLAN FOR THE YEAR
Internship Program
Club
Sponsorship
Capable & young
data scientists
$$$
Effort
Smart students
![Page 12: Intro to Hadoop](https://reader033.fdocuments.net/reader033/viewer/2022051817/5480cb445806b5d3108b45bd/html5/thumbnails/12.jpg)
Q & A