Big data: Loading your data with flume and sqoop

22
Loading data in Hadoop 2 with SQOOP and Flume Christophe Marchal | Software Architect

description

Studying Hortonworks stack, I created this 10 minutes presentation. http://hortonworks.com

Transcript of Big data: Loading your data with flume and sqoop

Page 1: Big data:  Loading your data with flume and sqoop

Loading data in Hadoop 2

with SQOOP and Flume

Christophe Marchal | Software Architect

Page 2: Big data:  Loading your data with flume and sqoop

Problem to solve

Page 3: Big data:  Loading your data with flume and sqoop

Hortonworks stack

Page 4: Big data:  Loading your data with flume and sqoop

Batch Loading vs Stream Loading

Page 5: Big data:  Loading your data with flume and sqoop

SQOOP

HCatalog

Page 6: Big data:  Loading your data with flume and sqoop

SQOOP 1: Import

Page 7: Big data:  Loading your data with flume and sqoop

SQOOP 1: Export

Page 8: Big data:  Loading your data with flume and sqoop

SCOOP 2

Page 9: Big data:  Loading your data with flume and sqoop

Flume

AgentWeb Server

Source

Channel

Sink

HDFSAgent

Source

Channel

Sink

Agent

Source

Channel

Sink

Agent

Source

Channel

Sink

Web ServerWeb

Server

Page 10: Big data:  Loading your data with flume and sqoop

Multi agent flow

Page 11: Big data:  Loading your data with flume and sqoop

Consolidation flow

Page 12: Big data:  Loading your data with flume and sqoop

Flume vs SQOOP

● distributed

● reliable (transaction)

● available (backup

routes)

● collecting data

● aggregating data

● Data imports

● Parallelizes data

transfer

● Copies data quickly

Page 13: Big data:  Loading your data with flume and sqoop

Flume example

Page 14: Big data:  Loading your data with flume and sqoop

Flume example

Page 15: Big data:  Loading your data with flume and sqoop

Flume example

Page 16: Big data:  Loading your data with flume and sqoop

SQOOP: import HDFS

Page 17: Big data:  Loading your data with flume and sqoop

SQOOP: import HDFS

Page 18: Big data:  Loading your data with flume and sqoop

SQOOP: import HDFS

Page 19: Big data:  Loading your data with flume and sqoop

SQOOP: import Hive

Page 20: Big data:  Loading your data with flume and sqoop

SQOOP: import Hive

Page 21: Big data:  Loading your data with flume and sqoop

SQOOP: import Hive

Page 22: Big data:  Loading your data with flume and sqoop

Thanks

Christophe Marchal | Software Architect @toff63