MapR Converged Data Platform

63
ยฉ 2017 MapR Technologies 1 Converged Data Platform MapR Converged Data Platform

Transcript of MapR Converged Data Platform

Page 1: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 1 Converged Data Platform

MapR Converged Data Platform

Page 2: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 2 Converged Data Platform

๋น…๋ฐ์ดํ„ฐ ์‹œ์žฅ๋™ํ–ฅ

Page 3: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 3 Converged Data Platform

๊ธฐ์—… IT ํ™˜๊ฒฝ์˜ ๋ณ€ํ™”

โ€œ1980๋…„๋Œ€ ๋ชจ๋“  ๋ฐ์ดํ„ฐ๋ฅผ ํ”Œ๋žซ ํŒŒ์ผ๋กœ ๊ด€๋ฆฌํ•˜๋˜ ์–ด๋ ค์›€์„ ๊ทน๋ณตํ•˜๊ณ ์ž ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์‹œ์Šคํ…œ์ด ์‹œ์žฅ์— ์ถœ์‹œ ๋œ ์ดํ›„๋กœ ๊ธฐ์—…์šฉ

์–ดํ”Œ๋ฆฌ์ผ€์ด์…˜ ๋“ฑ์žฅ, ์ธํ„ฐ๋„ท์˜ ๋“ฑ์žฅ, ๋””์ง€ํ„ธ ๋ณ€ํ˜ ์ ‘๋ชฉ ๋“ฑ ๊ธฐ์—… ํ˜์‹ ์˜ ํ•ต์‹ฌ์—๋Š” ํ•ญ์ƒ ๋ฐ์ดํ„ฐ๊ฐ€ ์ค‘์š”ํ•œ ์—ญํ• ์„ ํ•จโ€

1980s Now

Early

databases

Enterprise

applications

The Internet Digital

Transformation

๋น…๋ฐ์ดํ„ฐ ์‹œ์žฅ๋™ํ–ฅ

Page 4: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 4 Converged Data Platform

๊ธฐ์—… IT ํ™˜๊ฒฝ์˜ ๋ณ€ํ™”์™€ ๋ฐ์ดํ„ฐ

๋น…๋ฐ์ดํ„ฐ ์‹œ์žฅ๋™ํ–ฅ

(100,000)

(80,000)

(60,000)

(40,000)

(20,000)

-

20,000

40,000

60,000

80,000

100,000

120,000

2013 2014 2015 2016 2017 2018 2019 2020

์ฐจ์„ธ๋Œ€ IT ๋น„์šฉ vs. ๋ ˆ๊ฑฐ์‹œ IT ๋น„์šฉ ($M)

$120

100

80

60

40

20

(20)

(40)

(60)

(80)

(100)

In Billions

IT ์‹œ์žฅ์˜ ์ „์ฒด ํˆฌ์ž ๊ทœ๋ชจ($) ์ฐจ์„ธ๋Œ€ IT ์‹œ์žฅ์˜ ํˆฌ์ž ๊ทœ๋ชจ($) ๋ ˆ๊ฑฐ์‹œ IT ์‹œ์žฅ์˜ ํˆฌ์ž/์‚ญ๊ฐ ๊ทœ๋ชจ($)

โ€œ๊ฐ€ํŠธ๋„ˆ์™€ ๊ฐ™์€ ์‹œ์žฅ ์กฐ์‚ฌ ๊ธฐ๊ด€๋“ค์€ ํ–ฅํ›„ 4๋…„ ์ด๋‚ด์— ๊ธฐ์—…๋‚ด ๋ฐ์ดํ„ฐ์˜ 90%๊ฐ€ Hadoop๊ณผ ๊ฐ™์€ ์ฐจ์„ธ๋Œ€ IT ํ™˜๊ฒฝ์— ์ €์žฅ๋  ๊ฒƒ์œผ๋กœ ์˜ˆ์ธก.โ€

Source: IDC, Gartner; Analysis & Estimates: MapR

Next-gen consists of cloud, big data, software and hardware related expenses

Page 5: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 5 Converged Data Platform

๊ธฐ์กด IT ํ™˜๊ฒฝ์˜ ํŠน์ง•

๋น…๋ฐ์ดํ„ฐ ์‹œ์žฅ๋™ํ–ฅ

โ€œ๊ธฐ์กด IT ํ™˜๊ฒฝ์€ ์ „ํ†ต์ ์ธ ๋ฐ์ดํ„ฐ ์ ‘๊ทผ๋ฒ•์— ์˜ํ•ด ๋‹จ์œ„๋ถ€์„œ ๋ณ„๋กœ ๊ฐ๊ธฐ ๋‹ค๋ฅธ ์–ดํ”Œ๋ฆฌ์ผ€์ด์…˜์ด ๊ฐ๊ธฐ ๋‹ค๋ฅธ ๋ฐ์ดํ„ฐ ์†Œ์Šค๋ฅผ ํ™œ์šฉํ•˜๋Š” ๊ตฌ์กฐโ€

< ๊ธฐ์กด IT ์•„ํ‚คํ…์ฒ˜ >

๊ธฐ์กด ํ™˜๊ฒฝ์˜ ํŠน์ง•

๋‹จ์œ„ ์‚ฌ์—…๋ณ„๋กœ ๊ตฌ์ถ•

์‚ฌ์ผ๋กœ ํ˜•ํƒœ์˜ ๊ตฌ์„ฑ

์ด๊ธฐ์ข… ์‹œ์Šคํ…œ๋“ค๊ฐ„์˜ ๋ณต์žกํ•œ

์ธํ„ฐํŽ˜์ด์Šค ๊ด€๊ณ„

์ •ํ˜• ๋ฐ์ดํ„ฐ ์œ„์ฃผ์˜ ์ฒ˜๋ฆฌ

IT ์—์„œ ๋ฐ์ดํ„ฐ๋ฅผ ๋ฐœ์ƒ ๋ฐ ์ฒ˜๋ฆฌ

Oracle, SAP, Microsoft ๋“ฑ์˜

์ „ํ†ต์  ์–ดํ”Œ๋ฆฌ์ผ€์ด์…˜

ํšŒ๊ณ„ํŒ€

์˜์—…ํŒ€

๋งˆ์ผ€ํŒ…ํŒ€

๊ฒฝ์—‰๋ถ„์„ํŒ€

๊ฐœ๋ฐœํŒ€

์ธ์‚ฌํŒ€

ITํŒ€

Page 6: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 6 Converged Data Platform

์ฐจ์„ธ๋Œ€ IT ํ™˜๊ฒฝ์˜ ํŠน์ง•

๋น…๋ฐ์ดํ„ฐ ์‹œ์žฅ๋™ํ–ฅ

โ€œ์ฐจ์„ธ๋Œ€ IT ํ™˜๊ฒฝ์€ ๊ณ ๊ฐ๋‹ˆ์ฆˆ ๋˜๋Š” ๊ฒฝ์˜์ง€ํ‘œ๋ฅผ ๋ณด๋‹ค ์‹ฌ๋„์žˆ๊ฒŒ ๋ถ„์„ํ•˜๊ธฐ ์œ„ํ•œ ๋ฐ์ดํ„ฐ ๊ธฐ๋ฐ˜์˜ ํ†ตํ•ฉ๋œ ํ”Œ๋žซํผ ํ˜•ํƒœ์˜ ๊ตฌ์กฐโ€

๋น…๋ฐ์ดํ„ฐ ๊ธฐ๋ฐ˜ ํ”Œ๋žซํผ

์ƒ๊ถŒ๋ถ„์„ ๋งž์ถคํ˜• ์ƒํ’ˆ ์ถ”์ฒœ ์‹ค์‹œ๊ฐ„ ์ƒํ’ˆ ์ถ”์ฒœ ๊ณ ๊ฐ ์„ธ๋ถ„ํ™”

ํŠธ๋ Œ๋“œ๋ถ„์„

๊ฒฝ์Ÿ์‚ฌ๋ถ„์„

SNS๋ถ„์„

์‹œ์žฅ๋ถ„์„

์†Œํ†ต๊ธฐ๋ฐ˜ ์ƒํ’ˆ

๊ฐœ์ธํ™” ์ƒํ’ˆ

์—ฐ๊ณ„์‚ฐ์—… ์ƒํ’ˆ

360 ยฐ ๊ณ ๊ฐ ๋ถ„์„ ์œ„์น˜๊ธฐ๋ฐ˜ ์„œ๋น„์Šค

์‚ฌ๋ฌผ์ธํ„ฐ๋„ท ์—ฐ๊ณ„

์ด์Šˆ๊ธฐ๋ฐ˜ ์„œ๋น„์Šค

O2O ์„œ๋น„์Šค

์ˆ˜์š”์˜ˆ์ธก ๊ฐœ์ธํ™” ๋ถ„์„ ์‹ค์‹œ๊ฐ„ ํƒ์ง€ ์‹ค์‹œ๊ฐ„ ๋งˆ์ผ€ํŒ… ์ƒํ’ˆ์„ค๊ณ„

< ์ฐจ์„ธ๋Œ€ IT ์•„ํ‚คํ…์ฒ˜ >

์ด์ƒ๊ฑฐ๋ž˜ ํƒ์ง€

์‹ค์‹œ๊ฐ„ ์†Œ๋น„ ํƒ์ง€

SNS ์ด์Šˆ ํƒ์ง€

์†Œ๋น„ํŒจํ„ด ๋ถ„์„

์ฐจ์„ธ๋Œ€ ํ™˜๊ฒฝ์˜ ํŠน์ง•

ํ†ตํ•ฉ ์‚ฌ์—…์œผ๋กœ ๊ตฌ์ถ•

์ด๊ธฐ์ข… ์‹œ์Šคํ…œ๋“ค์„ ๋ฐ์ดํ„ฐ

๊ธฐ๋ฐ˜์œผ๋กœ ํ†ตํ•ฉํ•˜๋Š” ํ˜•ํƒœ

์ •ํ˜•/๋ฐ˜์ •ํ˜•/๋น„์ •ํ˜• ๋ฐ์ดํ„ฐ๋ฅผ

๋ชจ๋‘ ํ†ตํ•ฉ

IT/OT ๋ชจ๋“  ์˜์—ญ์—์„œ ๋ฐ์ดํ„ฐ๋ฅผ

๋ฐœ์ƒ ๋ฐ ์ฒ˜๋ฆฌ

๊ฐœ์ธํ™” ๋ถ„์„, ์‹ค์‹œ๊ฐ„ ์˜คํผ๋“ฑ์˜

์ƒˆ๋กœ์šด ํŒจ๋Ÿฌ๋‹ค์ž„

Page 7: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 7 Converged Data Platform

ํ†ตํ•ฉ ๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ์˜ ํ•„์š”์„ฑ

๋น…๋ฐ์ดํ„ฐ ์‹œ์žฅ๋™ํ–ฅ

โ€œ์ƒˆ๋กœ์šด ์ฐจ์„ธ๋Œ€ IT ์•„ํ‚คํ…์ฒ˜๋Š” ์ „ํ†ต์ ์ธ ์‹œ์Šคํ…œ๊ณผ ์ƒˆ๋กœ์šด ์‹œ์Šคํ…œ ๋ชจ๋‘์™€ ํ˜ธํ™˜์ด ๊ฐ€๋Šฅ ํ•˜๋ฉด์„œ ์ „์‚ฌ ํ†ตํ•ฉ์ด ๊ฐ€๋Šฅํ•œ ์•„ํ‚คํ…์ฒ˜.โ€

์ƒˆ๋กœ์šด ์–ดํ”Œ๋ฆฌ์ผ€์ด์…˜ ์ „ํ†ต์  ์–ดํ”Œ๋ฆฌ์ผ€์ด์…˜

์˜คํ”ˆ์†Œ์Šค ์‹œ์Šคํ…œ ๋ถ„์„ ์‹œ์Šคํ…œ ๊ธฐ์กด ๋„์ž… ์‹œ์Šคํ…œ

์ƒˆ๋กœ์šด ํ†ตํ•ฉ ๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ

์˜จํ”„๋ ˆ๋ฏธ์Šค ํ”„๋ผ์ด๋น— ํด๋ผ์šฐ๋“œ ํผ๋ธ”๋ฆญ ํด๋ผ์šฐ๋“œ

๋‹ค์–‘ํ•œ ํ•˜๋“œ์›จ์–ด

Page 8: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 8 Converged Data Platform

๋น…๋ฐ์ดํ„ฐ ์‹œ๋Œ€์˜ ์‚ฐ์—… ๋ฐฉํ–ฅ

๋น…๋ฐ์ดํ„ฐ ์‹œ์žฅ๋™ํ–ฅ

โ€œMcKinsey๋“ฑ์˜ ์‹œ์žฅ์กฐ์‚ฌ ๊ธฐ๊ด€๋“ค์€ ๊ธˆ์œต์—…์€ ๋น…๋ฐ์ดํ„ฐ ํ™œ์šฉ์˜ ์ž ์žฌ ๊ฐ€์น˜๊ฐ€ ๋†’์€ ์‚ฐ์—…์œผ๋กœ, ๊ธฐ์กด ๋ฐ์ดํ„ฐ ๋ณด์œ ๋Ÿ‰์ด ๋งŽ๊ณ  ์ƒˆ๋กœ ์œ ์ž…๋˜๋Š”

๋ฐ์ดํ„ฐ ์–‘์ด ๋งŽ๊ธฐ ๋•Œ๋ฌธ์— VoC ๋ถ„์„, 360ยฐ ๊ณ ๊ฐ ์‹ฑ๊ธ€๋ทฐ ๊ตฌ์„ฑ ๋“ฑ ํ™œ์šฉ๊ฐ€์น˜๊ฐ€ ๋งค์šฐ ๋†’์€ ์‚ฐ์—…์œผ๋กœ ๋ถ„๋ฅ˜ ํ•˜๊ณ  ์žˆ์Šต๋‹ˆ๋‹ค.โ€

๊ฐœ์ธํ™” ์ƒํ’ˆ ์„ค๊ณ„

AI ์ž๋™์ƒ๋‹ด

360ยฐ ๊ณ ๊ฐ ์‹ฑ๊ธ€๋ทฐ

DW ํ™•์žฅ VoC ๋ถ„์„

์œตํ•ฉ ์‚ฐ์—… ๋น„์ •ํ˜• ๋ฐ์ดํ„ฐ ํ†ตํ•ฉ ์ ‘๊ทผ์ด๋ ฅ ๊ฐ์‚ฌ

์‚ฌ๋ฌผ์ธํ„ฐ๋„ท ์—ฐ๊ณ„ ์ด์ƒ๊ฑฐ๋ž˜ ํƒ์ง€

Page 9: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 9 Converged Data Platform

๊ธฐ์—…๋“ค์ด MapR ํ”Œ๋žซํผ์„ ์„ ํƒํ•˜๋Š” ์ด์œ 

๋น…๋ฐ์ดํ„ฐ ์‹œ์žฅ๋™ํ–ฅ

โ€œMapR ํ”Œ๋žซํผ์€ ๋‹จ์ˆœํ•œ ๋น…๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ์œผ๋กœ์„œ์˜ ๊ธฐ๋Šฅ์ด ์•„๋‹Œ ์—”ํ„ฐํ”„๋ผ์ด์ฆˆ ์ˆ˜์ค€์˜ ๊ฐ€์šฉ์„ฑ๊ณผ ์•ˆ์ „์„ฑ์„ ์ œ๊ณตํ•˜๋Š” ๊ธฐ์—…์šฉ ํ”Œ๋žซํผโ€

ํ•ต์‹ฌ ๊ธฐ์ˆ  ์š”์†Œ

โ€ข ๋ถ„์‚ฐ ํŒŒ์ผ ์‹œ์Šคํ…œ โ€“ MapR-FS

โ€ข NoSQL DB ์‹œ์Šคํ…œ โ€“ MapR-DB

โ€ข ๊ธ€๋กœ๋ฒŒ ๋ฉ”์„ธ์ง• ์‹œ์Šคํ…œ โ€“ MapR- Streams

๊ณ ์†์ฒ˜๋ฆฌ๋ฅผ ์œ„ํ•œ ์ตœ์ ํ™”

โ€ข ๋Œ€์šฉ๋Ÿ‰ ๋ถ„์‚ฐ ๋ณ‘๋ ฌ ์ฒ˜๋ฆฌ ์ง€์›

โ€ข ์ •ํ˜• ๋ฐ ๋น„์ •ํ˜•๋“ฑ์˜ ๋‹ค์–‘ํ•œ ๋ฐ์ดํ„ฐ๋ฅผ ํ†ตํ•œ ๋Œ€๊ทœ๋ชจ ๋ถ„์„ ๋ฐ ๊ธฐ๊ณ„ ํ•™์Šต ๊ตฌํ˜„

๊ทœ๋ชจ์— ๋”ฐ๋ฅธ ํ™•์žฅ

โ€ข ์ˆ˜ ์กฐ ๋‹จ์œ„๊นŒ์ง€ ํ™•์žฅ๋˜๋Š” ์ œํ•œ์—†๋Š” ํŒŒ์ผ ์‹œ์Šคํ…œ

โ€ข ๋Œ€๊ทœ๋ชจ ๋ถ„์‚ฐ ์ฒ˜๋ฆฌ ํ™˜๊ฒฝ์—์„œ ๋Œ€์šฉ๋Ÿ‰ ๋ฐ์ดํ„ฐ ๊ณ ์† ์ฒ˜๋ฆฌ

์—”ํ„ฐํ”„๋ผ์ด์ฆˆ ์ˆ˜์ค€์˜ ์•ˆ์ •์„ฑ

โ€ข ๋ฌด์ค‘๋‹จ ๋ฐ์ดํ„ฐ ์•ก์„ธ์Šค๋ฅผ ์ง€์›ํ•˜๋Š” ๋‹ค์–‘ํ•œ ์ž๋™ ๋ณต๊ตฌ ๊ธฐ๋Šฅ

โ€ข ์žฌํ•ด ๋ณต๊ตฌ์™€ ์Šค๋ƒ…์ƒท์„ ํฌํ•จํ•œ ๊ณ ๊ฐ€์šฉ์„ฑ ์ œ๊ณต

โ€ข ์•ˆ์ •์ ์ธ ๊ณ ๊ฐ ์ง€์› ์ฒด๊ณ„

Page 10: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 10 Converged Data Platform

MapR ํ”Œ๋žซํผ์„ ํ†ตํ•œ ๋น„์ฆˆ๋‹ˆ์Šค ๋ชฉํ‘œ

๋น…๋ฐ์ดํ„ฐ ์‹œ์žฅ๋™ํ–ฅ

โ€œ๋””์ง€ํ„ธ ๋ณ€ํ˜์— ๋Œ€ํ•œ ํ˜์‹  ์„ฑ๊ณผ์™€ ๋น„์šฉ ์ ˆ๊ฐ์— ๋Œ€ํ•œ ๊ฒฝ์˜ ์„ฑ๊ณผ๋ฅผ ๋ชจ๋‘ ๋‹ฌ์„ฑํ•  ์ˆ˜ ์žˆ๋Š” ํ˜์‹  ํ”Œ๋žซํผ์œผ๋กœ์„œ MapR์„ ์„ ํƒโ€

Financial Services Telco & Media

Ad tech

Government

Retail UnitedHealth Group์€ ๊ฑด๊ฐ• ๋ณดํ—˜๋ฃŒ ์ง€๋ถˆ๊ณผ ํด๋ ˆ์ž„ ๊ด€๋ฆฌ ๋“ฑ 80์—ฌ๊ฐ€์ง€ ์œ ์Šค์ผ€์ด์Šค๋ฅผ MapR ๊ธฐ๋ฐ˜์œผ๋กœ ๋„์ž…ํ•˜์—ฌ ์ง€๋ถˆ์˜ค๋ฅ˜, ์‚ฌ๊ธฐ ๋“ฑ ์›” 2๋ฐฑ๋งŒ ๋‹ฌ๋Ÿฌ์˜ ๋น„์šฉ ์ ˆ๊ฐ

IRi์‚ฌ๋Š” ๊ธฐ์กด ์šด์˜์ค‘์ด๋˜ ๋ฉ”์ธํ”„๋ ˆ์ž„์—์„œ MapR ํ”Œ๋žซํผ๊ณผ์˜ DW ์˜คํ”„๋กœ๋“œ๋ฅผ ํ†ตํ•ด ์—ฐ๊ฐ„ 2.5๋ฐฑ๋งŒ ๋‹ฌ๋Ÿฌ์˜ ๋น„์šฉ์„ ์ ˆ๊ฐ

Experian์‚ฌ๋Š” ๊ธฐ์กด DB2 ํ™˜๊ฒฝ์˜ ์‹ ์šฉํ‰๊ฐ€ ์Šค์ฝ”์–ด๋ง ์‹œ์Šคํ…œ์„ MapR ํ”Œ๋žซํผ์— ์ด์‹ํ•˜์—ฌ 20๋ฐฐ์˜ ๋น„์šฉ ์ ˆ๊ฐ

AADHAAR ์‚ฌ๋Š” ์ธ๋„์˜ 12์–ต5์ฒœ๋งŒ๋ช…์˜ ์ƒ์ฒด์ธ์‹ ์‹œ์Šคํ…œ์„ ๊ฐœ๋ฐœํ•˜์—ฌ ๋ถˆ๋ฒ•์ ์œผ๋กœ ๋ณดํ—˜๊ธˆ์„ ์ˆ˜๋ นํ•˜๋Š” ์‚ฌ๊ธฐํƒ์ง€ ์‹œ์Šคํ…œ์„ ๊ฐœ๋ฐœํ•ด 1.3B$๋ฅผ ์ ˆ๊ฐ

TransUnion ์‚ฌ๋Š” MapR ํ”Œ๋žซํผ์—์„œ ์ƒˆ๋กœ์šด ์…€ํ”„์„œ๋น„์Šค ๋ถ„์„ ํ”Œ๋žซํผ์„ ํ†ตํ•ด ๊ณ ๊ฐ์—๊ฒŒ ๋” ๋‚˜์€ ์‹œ์žฅ ํ†ต์ฐฐ๋ ฅ์„ ์ œ๊ณตํ•˜์—ฌ ์˜์‚ฌ๊ฒฐ์ •์— ๋„์›€

AMEX ์นด๋“œ์‚ฌ๋Š” MapR ํ”Œ๋žซํผ ๊ธฐ๋ฐ˜์˜ ์ด์ƒ๊ธˆ์œต๊ฑฐ๋ž˜ ํƒ์ง€ ์‹œ์Šคํ…œ์œผ๋กœ ๋งค๋…„ 1์กฐ ๋‹ฌ๋Ÿฌ์˜ ๊ฑฐ๋ž˜์—์„œ ์‚ฌ๊ธฐ๋ฅผ ๋ณดํ˜ธ

๋น„์ฆˆ๋‹ˆ์Šค ํ˜์‹ 

๋น„์šฉ ์ ˆ๊ฐ

Page 11: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 11 Converged Data Platform

1. MapR Technologies ์†Œ๊ฐœ

Page 12: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 12 Converged Data Platform

MapR Technologies ์—ฐํ˜

1. MapR Technologies ์†Œ๊ฐœ

โ€œMapR Technologies๋Š” 2009๋…„ ์„ค๋ฆฝ๋˜์—ˆ์œผ๋ฉฐ, ์—”ํ„ฐํ”„๋ผ์ด์ฆˆ ํ™˜๊ฒฝ์ด ์š”๊ตฌํ•˜๋Š” ์‹ ๋ขฐ์„ฑ ๋†’์€ ๋น… ๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ์„ ์ œ๊ณตํ•˜๋Š” ํšŒ์‚ฌ์ž…๋‹ˆ๋‹ค.

๋ณธ์‚ฌ๋Š” ๋ฏธ๊ตญ ์‚ฐํ˜ธ์„ธ์— ์œ„์น˜ํ•ด ์žˆ์œผ๋ฉฐ, ๋น… ๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ ์‹œ์žฅ์—์„œ ๊ฐ€์žฅ ๋งŽ์€ ์œ ๋ฃŒ๊ณ ๊ฐ์„ ํ™•๋ณดํ•˜๊ณ  ์žˆ์Šต๋‹ˆ๋‹ค.โ€

MapR ์ฃผ์š” ์—ฐํ˜ ๋ฐ ์ œํ’ˆ ์ถœ์‹œ ํ˜„ํ™ฉ

2009 2011 2012 2013 2014 2015 2016

MapR Technologies ๋ฏธ๊ตญ ์„ค๋ฆฝ

MapR ์—”ํ„ฐํ”„๋ผ์ด์ฆˆ ๋น…๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ ์ถœ์‹œ

ํ‘œ์ค€ API๋ฅผ ํ™œ์šฉํ•œ Unified ํ•˜๋‘ก ํ”Œ๋žซํผ ์ถœ์‹œ

MapR Enterprise Hadoop V1

ENTERPRISE EDITION

์„ธ๊ณ„ ์ตœ๋Œ€ ๋น…๋ฐ์ดํ„ฐ ์—…์ฒด์™€ ํŒŒํŠธ๋„ˆ ์ฒด๊ฒฐ

MapR Enterprise Hadoop V2

MapR ์—”ํ„ฐํ”„๋ผ์ด์ฆˆ ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ์—๋””์…˜ ์ถœ์‹œ

์—…๊ณ„ ์ตœ์ดˆ ํ•˜๋‘กํŒŒ์ผ์‹œ์Šคํ…œ๊ณผ NoSQL ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ํ†ตํ•ฉ

MapR Enterprise Hadoop V3

ENTERPRISE DATABASE EDITION

MapR ํ•œ๊ตญ์ง€์‚ฌ ์„ค๋ฆฝ

MapR Enterprise Hadoop V4

์—…๊ณ„์œ ์ผ์˜ ํ•˜๋‘ก ๋ฒ„์ „ ํ˜ธํ™˜์„ฑ(MR1/MR2) ์ œ๊ณต

ํ•˜๋‘ก์—…์ฒด ์ตœ์ดˆ ๋กœ์ปฌ ์ง€์› ์ธ๋ ฅ ๋ฐ ํ•œ๊ตญ์–ด ์„œ๋น„์Šค ์ œ๊ณต

MapR Enterprise Hadoop V5

์—…๊ณ„์œ ์ผ ์‹ค์‹œ๊ฐ„ NoSQL ๋ฐ์ดํ„ฐ๋ฒ ์ด์Šค ๋ณต์ œ ๊ตฌํ˜„

MapR Converged Data Platform V5.2

MapR ์ปจ๋ฒ„์ง€๋“œ ๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ ์ถœ์‹œ

๋ถ„์‚ฐ ๋ฉ”์‹œ์ง• ์‹œ์Šคํ…œ์„ ํฌํ•จํ•˜๋Š” ์ปจ๋ฒ„์ง€๋“œ ๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ

Page 13: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 13 Converged Data Platform

MapR ๊ธ€๋กœ๋ฒŒ ํŒŒํŠธ๋„ˆ

1. MapR Technologies ์†Œ๊ฐœ

APPLICATIONS & OS

ANALYTICS & BUSINESS INTELLIGENCE

DATA WAREHOUSE & INTEGRATION

INFRASTRUCTURE & CLOUD

CONSULTANTS & INTEGRATORS

Page 14: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 14 Converged Data Platform

MapR ์‚ฐ์—…๋ณ„ ๊ธ€๋กœ๋ฒŒ ๊ณ ๊ฐ

1. MapR Technologies ์†Œ๊ฐœ

Top

Fashion

Retailer

FINANCIAL SERVICES RETAIL CPG &

MANUFACTURING ONLINE SERVICES &

SECURITY MEDIA &

ENTERTAINMENT

MARKET RESEARCH ADVERTISING HEALTH COMMUNICATIONS GOVERMENT

Page 15: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 15 Converged Data Platform

MapR Converged Data Platform

1. MapR Technologies ์†Œ๊ฐœ

Open Source Engines & Tools Commercial Engines & Applications

Enterprise-Grade Platform Services

Data

P

roces

sin

g

Web-Scale Storage MapR-FS MapR-DB

Search and

Others

Real Time Unified Security Multi-tenancy Disaster Recovery Global Namespace High Availability

MapR Streams

Cloud and Managed Services

Search and

Others

Un

ified

Man

ag

em

en

t an

d M

on

itorin

g

Search and

Others

Event Streaming Database

Custom Apps

HDFS API POSIX, NFS HBase API JSON API Kafka API

Page 16: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 16 Converged Data Platform

MapR ์ปจ๋ฒ„์ง€๋“œ ๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ (Read/Write)

NFS HDFS API

MapR-FS (POSIX)

HBase API JSON API

MapR-DB (High-Performance NoSQL)

Kafka API OJAI API

MapR Streams (Global Event Streaming)

MapR Core & MapR Ecosystem Pack

1. MapR Technologies ์†Œ๊ฐœ

์—”ํ„ฐํ”„๋ผ์ด์ฆˆ ๋ฐ์ดํ„ฐ๋ณดํ˜ธ ํ”Œ๋žซํผ ๊ด€๋ฆฌ ์„ฑ๋Šฅ ๋™์‹œ์‚ฌ์šฉ์„ฑ ์ƒํ˜ธํ˜ธํ™˜์„ฑ

๊ธฐ์กด์˜ DB๋‚˜ ๋กœ๊ทธ์„œ๋ฒ„ ๋กœ๋ถ€ํ„ฐ ๋ฐ์ดํ„ฐ๋ฅผ ์ˆ˜์ง‘

Storm

Spark Steaming

Pig

Sqoop

MapR-NFS

Flume

MapR Ecosystem Pack

Hive

Spark SQL

Drill

Zookeeper

Sentry

Oozie

R / Python / Scala

Mahout

Spark MLlib

๋ฐ์ดํ„ฐ ์ˆ˜์ง‘ ๋ฐ์ดํ„ฐ ์ €์žฅ ๋ฐ ์ฒ˜๋ฆฌ ๊ฐœ๋ฐœ ๋ฐ ๊ด€๋ฆฌ ๋ฐ์ดํ„ฐ ๋ถ„์„

HUE

๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ์— ์ €์žฅ๋˜์–ด ์žˆ๋Š” ๋น„์ •ํ˜• ๋ฐ์ดํ„ฐ๋ฅผ ์ฒ˜๋ฆฌ

๋Œ€๋Ÿ‰์˜ ์ •ํ˜• ๋ฐ์ดํ„ฐ๋ฅผ SQL ๊ธฐ๋ฐ˜์œผ๋กœ ์ฒ˜๋ฆฌ

์ŠคํŠธ๋ฆฌ๋ฐ์œผ๋กœ ์ƒ์„ฑ๋˜๋Š” ๋ฐ์ดํ„ฐ๋ฅผ ์‹ค์‹œ๊ฐ„์œผ๋กœ ์ฒ˜๋ฆฌ

๋ฐ์ดํ„ฐ ๋งˆ์ด๋‹, ๊ธฐ๊ณ„ ํ•™์Šต์„ ํ†ตํ•œ ๋ฐ์ดํ„ฐ ๋ถ„์„

์–ดํ”Œ๋ฆฌ์ผ€์ด์…˜ ๊ฐœ๋ฐœ, ํด๋Ÿฌ์Šคํ„ฐ ๊ด€๋ฆฌ

YARN Framework

Page 17: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 17 Converged Data Platform

2. Hadoop ๊ธฐ๋ฐ˜ ํ”Œ๋žซํผ ๊ตฌ์ถ•๊ณผ ํ•œ๊ณ„

Page 18: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 18 Converged Data Platform

Hadoop ์ด๋ž€?

2. Hadoop ๊ธฐ๋ฐ˜ ํ”Œ๋žซํผ ๊ตฌ์ถ•๊ณผ ํ•œ๊ณ„

โ€œHadoop์ด๋ž€ ๋Œ€๋Ÿ‰์˜ ๋ฐ์ดํ„ฐ๋ฅผ ๋ณด๊ด€ ๋ฐ ์ฒ˜๋ฆฌํ•  ์ˆ˜ ์žˆ๋Š” ๋ถ„์‚ฐ ์ €์žฅ์†Œ ๋ฐ ๋ถ„์‚ฐ์ฒ˜๋ฆฌ ํ”„๋ ˆ์ž„์›Œํฌ ์ด๋ฉฐ Hadoop์— ์ €์žฅ๋˜์–ด ์žˆ๋Š” ๋ฐ์ดํ„ฐ๋ฅผ

์ˆ˜์ง‘, ์ €์žฅ, ์ฒ˜๋ฆฌ, ๋ถ„์„ ํ•˜๊ธฐ ์œ„ํ•œ ๋‹ค์–‘ํ•œ ์—์ฝ”์‹œ์Šคํ…œ์ด ํด๋Ÿฌ์Šคํ„ฐ ์ปดํ“จํŒ… ๊ธฐ๋ฐ˜์œผ๋กœ ๊ตฌ์„ฑ๋˜์–ด ์žˆ๋Š” ํ™˜๊ฒฝโ€

๋ถ„์‚ฐ ์ €์žฅ์†Œ

๋ถ„์‚ฐ ์ฒ˜๋ฆฌ ํ”„๋ ˆ์ž„์›Œํฌ

์˜คํ”ˆ์†Œ์Šค ์—์ฝ” ์‹œ์Šคํ…œ

Page 19: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 19 Converged Data Platform

๋„ค์ž„๋…ธ๋“œ

NoSQL ํด๋Ÿฌ์Šคํ„ฐ

Impala ํด๋Ÿฌ์Šคํ„ฐ

์ŠคํŠธ๋ฆฌ๋ฐ ํด๋Ÿฌ์Šคํ„ฐ

ํ•˜์ง€๋งŒ ์‹ค์ œ Hadoop ํ”Œ๋žซํผ์„ ๊ตฌ์ถ•ํ•ด ๋ณด๋ฉดโ€ฆ

2. Hadoop ๊ธฐ๋ฐ˜ ํ”Œ๋žซํผ ๊ตฌ์ถ•๊ณผ ํ•œ๊ณ„

โ€œ๊ธฐ์—…ํ™˜๊ฒฝ์—์„œ Apache Hadoop ๊ธฐ๋ฐ˜์˜ ๋น…๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ์„ ๊ตฌ์ถ•ํ•˜์—ฌ ๋ณด๋ฉด, ๋‹ค์–‘ํ•œ ๊ธฐ์—…๋‹ˆ์ฆˆ์— ๋Œ€์‘ํ•˜๊ธฐ ์œ„ํ•ด ์ˆ˜๋งŽ์€ ์ปดํฌ๋„ŒํŠธ๋“ค์ด

ํ™œ์šฉ๋˜๊ฒŒ ๋˜๊ณ  ์ด๋Ÿฌํ•œ ์ปดํฌ๋„ŒํŠธ์˜ ๋ถˆ์™„์ „ํ•œ ๊ฒฐํ•ฉ์œผ๋กœ ์ธํ•ด ๋น…๋ฐ์ดํ„ฐ ํ†ตํ•ฉ ํ”Œ๋žซํผ์˜ ๋ณต์žก๋„๊ฐ€ ์˜คํžˆ๋ ค ์ฆ๊ฐ€ํ•˜๋Š” ํ˜„์ƒ์ด ๋ฐœ์ƒโ€

๋ฐ์ดํ„ฐ๋…ธ๋“œ(HDFS)

์–€(YARN) ํ”„๋ ˆ์ž„์›Œํฌ

์˜คํ”ˆ์†Œ์Šค ์—์ฝ” ์‹œ์Šคํ…œ

Page 20: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 20 Converged Data Platform

๋„ค์ž„๋…ธ๋“œ

NoSQL ํด๋Ÿฌ์Šคํ„ฐ

Impala ํด๋Ÿฌ์Šคํ„ฐ

์ŠคํŠธ๋ฆฌ๋ฐ ํด๋Ÿฌ์Šคํ„ฐ

ํ•˜์ง€๋งŒ ์‹ค์ œ Hadoop ํ”Œ๋žซํผ์„ ๊ตฌ์ถ•ํ•ด ๋ณด๋ฉดโ€ฆ

2. Hadoop ๊ธฐ๋ฐ˜ ํ”Œ๋žซํผ ๊ตฌ์ถ•๊ณผ ํ•œ๊ณ„

โ€œ๊ธฐ์—…ํ™˜๊ฒฝ์—์„œ Apache Hadoop ๊ธฐ๋ฐ˜์˜ ๋น…๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ์„ ๊ตฌ์ถ•ํ•˜์—ฌ ๋ณด๋ฉด, ๋‹ค์–‘ํ•œ ๊ธฐ์—…๋‹ˆ์ฆˆ์— ๋Œ€์‘ํ•˜๊ธฐ ์œ„ํ•ด ์ˆ˜๋งŽ์€ ์ปดํฌ๋„ŒํŠธ๋“ค์ด

ํ™œ์šฉ๋˜๊ฒŒ ๋˜๊ณ  ์ด๋Ÿฌํ•œ ์ปดํฌ๋„ŒํŠธ์˜ ๋ถˆ์™„์ „ํ•œ ๊ฒฐํ•ฉ์œผ๋กœ ์ธํ•ด ๋น…๋ฐ์ดํ„ฐ ํ†ตํ•ฉ ํ”Œ๋žซํผ์˜ ๋ณต์žก๋„๊ฐ€ ์˜คํžˆ๋ ค ์ฆ๊ฐ€ํ•˜๋Š” ํ˜„์ƒ์ด ๋ฐœ์ƒโ€

๋ฐ์ดํ„ฐ๋…ธ๋“œ(HDFS)

์–€(YARN) ํ”„๋ ˆ์ž„์›Œํฌ

์˜คํ”ˆ์†Œ์Šค ์—์ฝ” ์‹œ์Šคํ…œ

๋‹จ์ผ ์žฅ์• ์  ๋ฐœ์ƒ

์™ธ๋ถ€ ๋ฐ์ดํ„ฐ ์ฒ˜๋ฆฌ ์ง€์—ฐ ์„œ๋น„์Šค ๊ฐ€์šฉ์„ฑ ์ถ•์†Œ

๋Œ€๋ฉด/๋น„๋Œ€๋ฉด ๊ฒฐํ•ฉ ์ €ํ•ด ๋น„์ •ํ˜• ๋ฐ์ดํ„ฐ ์ฒ˜๋ฆฌ ์–ด๋ ค์›€

๋…ธ๋“œ ์ˆ˜ ์ฆ๊ฐ€์— ๋”ฐ๋ฅธ ๋น„์šฉ ์ฆ๊ฐ€

๋Œ€๊ณ ๊ฐ ์„œ๋น„์Šค ํ’ˆ์งˆ ์ €ํ•˜ ์—”ํ„ฐํ”„๋ผ์ด์ฆˆ ์ˆ˜์ค€ ๊ฐ€์šฉ์„ฑ ํ™•๋ณด ๋ถˆ๊ฐ€

Page 21: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 21 Converged Data Platform

์–ด๋Š ๋น…๋ฐ์ดํ„ฐ ํ”„๋กœ์ ํŠธ ๋งค๋‹ˆ์ €์˜ ๊ณ ๋ฏผ

2. Hadoop ๊ธฐ๋ฐ˜ ํ”Œ๋žซํผ ๊ตฌ์ถ•๊ณผ ํ•œ๊ณ„

โ€œ๋ง‰์ƒ Hadoop ํ”„๋กœ์ ํŠธ๋ฅผ ๋‹ค ๋๋‚ด๋†“๊ณ  ๋ณด๋‹ˆโ€ฆโ€

์ˆ˜๋งŽ์€ ์ปดํฌ๋„ŒํŠธ๋“คโ€ฆ ์ˆ˜๋งŽ์€ ํด๋Ÿฌ์Šคํ„ฐ๋“คโ€ฆ

๋Œ€์ฒด ์ด ๋งŽ์€ ๊ด€๋ฆฌํฌ์ธํŠธ๋ฅผ ์–ด๋–ป๊ฒŒ ๊ด€๋ฆฌํ•˜๋ผ๋Š”๊ฑฐ์•ผโ€ฆ

๋ญ๊ฐ€ ์ด๋ ‡๊ฒŒ ์–ด๋ ค์šด๊ฑฐ์•ผ ์ต์ˆ™ํ•œ ๊ธฐ์ˆ ์ด๋ผ๊ณ ๋Š” ํ•˜๋‚˜๋„ ์—†๊ณ โ€ฆํ•˜์•„โ€ฆ

ํŒ€์› ๊ต์œก์ด ๊ฑฑ์ •์ด๋‹คโ€ฆ

๊ทธ๋ฆฌ๊ณ  Hadoop ์ด๋ผ๋Š”๊ฒŒ ์›๋ž˜ ์ด๋ ‡๊ฒŒ ๋Š๋ฆฐ๊ฑฐ์˜€์–ด? ์ด๊ฑด ๋‚ด ์˜ˆ์ƒ๋ณด๋‹ค ํ›จ์”ฌ ์„ฑ๋Šฅ์ด ์•ˆ๋‚˜์˜ค์ž–์•„โ€ฆ

์ด๊ฑฐ ์ •๋ง ๊ธฐ์—…์—์„œ ์‚ฌ์šฉํ•˜๋ผ๊ณ  ๋งŒ๋“ ๊ฑฐ์•ผ? ์žฌ๋‚œ๋Œ€์ฑ…์€? ๋ณด์•ˆ์€?

๊ฑฑ์ •์ด๋‹คโ€ฆ

Page 22: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 22 Converged Data Platform

3. MapR Converged Data Platform ํŠน์žฅ์ 

Page 23: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 23 Converged Data Platform

๋‹จ์ผ ์žฅ์• ์  ์ œ๊ฑฐ๋กœ ์„œ๋น„์Šค ๋ ˆ๋ฒจ ํ’ˆ์งˆ ํ™•๋ณด

3. MapR Converged Data Platform ํŠน์žฅ์ 

Page 24: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 24 Converged Data Platform

๋‹จ์ผ ์žฅ์• ์  ์ œ๊ฑฐ๋กœ ์„œ๋น„์Šค ๋ ˆ๋ฒจ ํ’ˆ์งˆ ํ™•๋ณด

3. MapR Converged Data Platform ํŠน์žฅ์ 

Page 25: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 25 Converged Data Platform

๋‹จ์ผ ์žฅ์• ์  ์ œ๊ฑฐ๋กœ ์„œ๋น„์Šค ๋ ˆ๋ฒจ ํ’ˆ์งˆ ํ™•๋ณด

3. MapR Converged Data Platform ํŠน์žฅ์ 

Page 26: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 26 Converged Data Platform

๋‹จ์ผ ์žฅ์• ์  ์ œ๊ฑฐ๋กœ ์„œ๋น„์Šค ๋ ˆ๋ฒจ ํ’ˆ์งˆ ํ™•๋ณด

3. MapR Converged Data Platform ํŠน์žฅ์ 

Page 27: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 27 Converged Data Platform

๋‹จ์ผ ์žฅ์• ์  ์ œ๊ฑฐ๋กœ ์„œ๋น„์Šค ๋ ˆ๋ฒจ ํ’ˆ์งˆ ํ™•๋ณด

3. MapR Converged Data Platform ํŠน์žฅ์ 

โ€œApache Hadoop์˜ ํ•„์ˆ˜์ ์ธ ์š”์†Œ์ธ ๋„ค์ž„๋…ธ๋“œ๋Š” ๋ถ„์‚ฐํ™˜๊ฒฝ์—์„œ ๋ฐ์ดํ„ฐ ๋ธ”๋Ÿญ์˜ ์œ„์น˜๊ณผ ๋ณ€๊ฒฝ์‚ฌํ•ญ์„ ๊ด€์žฅํ•˜๋Š” ํ•ต์‹ฌ ์š”์†Œ์ด๋‚˜ ๋‹จ์ผ

๊ตฌ์„ฑ์œผ๋กœ ์žฅ์• ์™€ ๋ณ‘๋ชฉ์˜ ์ฃผ์š” ์š”์†Œ. MapR์€ ๋„ค์ž„๋…ธ๋“œ๋ฅผ ์ œ๊ฑฐํ•œ ์•„ํ‚คํ…์ฒ˜๋กœ ์„œ๋น„์Šค ๋ ˆ๋ฒจ์˜ ํ’ˆ์งˆ ํ™•๋ณด๊ฐ€ ๊ฐ€๋Šฅํ•œ ์œ ์ผํ•œ ํ”Œ๋žซํผโ€

๊ธฐ์กด Hadoop File System MapR File System

๋„ค์ž„ ๋…ธ๋“œ

๋ฐ์ดํ„ฐ ๋…ธ๋“œ

๋„ค์ž„ ๋…ธ๋“œ(HA)

๋ฐ์ดํ„ฐ ๋…ธ๋“œ

๋„ค์ž„๋…ธ๋“œ ์žฅ์• ์‹œ ์ „์ฒด ํด๋Ÿฌ์Šค๊ฐ€ ์ค‘์ง€๋˜๋Š” ๋‹จ์ผ ์žฅ์• ์  ๋ฐœ์ƒ

๋Œ€๊ธฐ(Stand-by) ๋„ค์ž„๋…ธ๋“œ๋Š” ๋ถ„์‚ฐ์ฒ˜๋ฆฌ๊ฐ€ ์•„๋‹Œ ์žฅ์•  ๋Œ€๋น„

๋Œ€๋Ÿ‰์˜ ๋…ธ๋“œ๊ฐ€ ๊ตฌ์„ฑ๋œ ํด๋Ÿฌ์Šคํ„ฐ ํ™˜๊ฒฝ์—์„œ ๋„ค์ž„๋…ธ๋“œ๊ฐ€ ๋ถ€ํ•˜์‹œ

์ „์ฒด ํด๋Ÿฌ์Šคํ„ฐ์˜ ์„ฑ๋Šฅ์ด ์ €ํ•˜๋˜๋Š” ์น˜๋ช…์ ์ธ ๊ตฌ์กฐ

๋„ค์ž„๋…ธ๋“œ๋ฅผ ์ œ๊ฑฐํ•˜์—ฌ ๋‹จ์ผ ์žฅ์• ์ (SPOF) ์ œ๊ฑฐ

CLDB(Container Location Database) ๊ธฐ์ˆ ๊ธฐ๋ฐ˜ ์šด์˜

๋Œ€๋Ÿ‰์˜ ๋…ธ๋“œ๊ฐ€ ๊ตฌ์„ฑ๋œ ํด๋Ÿฌ์Šคํ„ฐ ํ™˜๊ฒฝ์—์„œ๋„ ์ปจํ…Œ์ด๋„ˆ ์œ„์น˜์™€

๋ณ€๊ฒฝ์‚ฌํ•ญ์„ ๋ถ„์‚ฐ์ฒ˜๋ฆฌ ํ•˜์—ฌ ์„ฑ๋Šฅ์— ์˜ํ–ฅ์„ ๋ฏธ์น˜์ง€ ์•Š๋Š” ๊ตฌ์กฐ

Page 28: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 28 Converged Data Platform

์‹ค์‹œ๊ฐ„ ๋น…๋ฐ์ดํ„ฐ์™€ ์„œ๋น„์Šค ์„ฑ๋Šฅ ํ’ˆ์งˆ

3. MapR Converged Data Platform ํŠน์žฅ์ 

โ€œ๊ธˆ์œต์‚ฐ์—…์—์„œ ๋น…๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ์„ ํ†ตํ•œ ์„ ์ œ์  ์˜์‚ฌ๊ฒฐ์ • ํ™˜๊ฒฝ์„ ๊ตฌํ˜„ํ•˜๊ธฐ ์œ„ํ•ด์„œ๋Š” ์ ์‹œ์„ฑ์„ ํ™•๋ณดํ•˜๋Š” ๊ฒƒ์ด ํ•ต์‹ฌ. MapR ํ”Œ๋žซํผ์€

๊ธฐ์กด Hadoop์˜ ์•„ํ‚คํ…์ฒ˜๋ฅผ ์™„์ „ํžˆ ํ˜์‹ ํ•˜์—ฌ ๊ธฐ์—…์ˆ˜์ค€์˜ ์ ์‹œ์„ฑ ํ™•๋ณด์™€ ๊ธฐ์กด ๊ฐœ๋ฐœ ํ™˜๊ฒฝ๊ณผ ์œ ์‚ฌํ•œ ํ™˜๊ฒฝ์„ ์ œ๊ณตโ€

๊ธฐ์กด Hadoop File System MapR File System

Disk

Linux File System

Java Virtual Machine

Hadoop File System (Java Code)

Disk

MapR File System (C Code)

๋‹ค์ค‘ ๊ณ„์ธต

์œผ๋กœ ์ธํ•œ

์„ฑ๋Šฅ ์ €ํ•˜

๊ณ„์ธต

๊ฐ„์†Œํ™”๋กœ

์„ฑ๋Šฅ ํ–ฅ์ƒ

POSIX

์ง€์›

HDFS

API

Direct

NFS

8 KB

I/O

Volume

๊ด€๋ฆฌ

HDFS

API

128 MB

I/O

Page 29: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 29 Converged Data Platform

๋น…๋ฐ์ดํ„ฐ ์ €์žฅ์— ์ตœ์ ํ™”๋œ MapR-FS์˜ ์‹œ์Šคํ…œ ๊ตฌ์กฐ

3. MapR Converged Data Platform ํŠน์žฅ์ 

8KB 8KB (Fixed)

10-30GB (Self-adjusting)

3 disks (default)

Physical/Virtual

Physical/Virtual

User-Defined

Sizes vary

Sizes vary

Volume

Container

Chunks Tablets Partitionlets

Blocks

Storage Pools

Disks

Data Node

MapR-FS Files MapR-DB Tables MapR Streams

Page 30: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 30 Converged Data Platform

๋น…๋ฐ์ดํ„ฐ ์ €์žฅ์— ์ตœ์ ํ™”๋œ MapR-FS์˜ ์‹œ์Šคํ…œ ๊ตฌ์กฐ

3. MapR Converged Data Platform ํŠน์žฅ์ 

โ€œMapR-FS๋Š” Random Read/Write๋ฅผ ์œ„ํ•œ 8KB IO๋‹จ์œ„๋ฅผ ๊ฐ–์œผ๋ฉด์„œ๋„ ๋ฐ์ดํ„ฐ๋Š” ํšจ๊ณผ์ ์œผ๋กœ ๋ถ„๋ฐฐ๋˜์–ด ๋ณ‘๋ ฌ์ฒ˜๋ฆฌ๋ฅผ ์ง„ํ–‰ํ•˜๋ฉฐ GB ๋‹จ์œ„์˜

๋ณต์ œ๋ฅผ ํ†ตํ•ด ๋ฉ”ํƒ€๊ด€๋ฆฌ๋ฅผ ๋ณด๋‹ค ๊ฐ„์†Œํ•˜๊ฒŒ ๊ด€๋ฆฌํ•˜๋Š” ํ˜์‹ ์ ์ธ ๋น…๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ ์ž…๋‹ˆ๋‹ค.โ€

๊ธฐ์กด Hadoop File System MapR File System

Name Node CLDB

Block Containers

Chunks

Blocks 8

KB

๋ณต์ œ

๋ถ„์‚ฐ ๋ฉ”ํƒ€

๋ณ‘๋ ฌ ๋ถ„์‚ฐ

8KB IO

๋ณต์ œ ๋น„ํšจ์œจ ๋ถ„์‚ฐ 64MB < IO

์‹ฑ๊ธ€ ๋ฉ”ํƒ€

IO ์ฒ˜๋ฆฌ ๋‹จ์œ„, ์ €์žฅ ๋‹จ์œ„, ๋ณต์ œ ๋‹จ์œ„๊ฐ€ ๋ชจ๋‘ ๋™์ผํ•˜์—ฌ

๋น…๋ฐ์ดํ„ฐ๋ฅผ ํšจ๊ณผ์ ์œผ๋กœ ์ฒ˜๋ฆฌ๊ฐ€ ๋ถˆ๊ฐ€๋Šฅ

๋ฐ์ดํ„ฐ ์‚ฌ์ด์ฆˆ ์ฆ๊ฐ€๋กœ ๋Œ€๋Ÿ‰์˜ Block์ด ์ƒ์„ฑ๋˜๊ณ  ์ฒ˜๋ฆฌ

๋ ๋•Œ Name Node์˜ ๋ถ€ํ•˜๋กœ ์ธํ•œ ๊ธ‰๊ฒฉํ•œ ์„ฑ์ฆ์ €ํ•˜

์˜ค๋กœ์ง€ Read๋งŒ์„ ์œ„ํ•œ ๊ตฌ์กฐ

Page 31: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 31 Converged Data Platform

๋™์ผํ•œ ์‚ฌ์šฉ์ž ๊ฒฝํ—˜์„ ์œ„ํ•œ MapR NFS

3. MapR Converged Data Platform ํŠน์žฅ์ 

๋“œ๋ž˜๊ทธ&๋“œ๋กญ์„ ํ†ตํ•œ ํŒŒ์ผ์˜ ํ™œ์šฉ ๊ฐ€๋Šฅ

์ผ๋ฐ˜์ ์ธ ํŒŒ์ผ ๋ธŒ๋ผ์šฐ์ €๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ MapR-FS์˜ ๋ฐ์ดํ„ฐ๋ฅผ

์†์‰ฝ๊ฒŒ ํ™œ์šฉ

๋กœ๊ทธ์™€ ๊ฐ™์€ ํŒŒ์ผ์„ MapR-FS ์— ์ง์ ‘ Write

๊ฐœ๋ณ„ ์„œ๋ฒ„ ๋‚ด๋ถ€์— ๋กœ๊ทธ๋ฅผ ์ €์žฅํ•˜๊ณ  Hadoop ์œผ๋กœ ์ด๊ด€ํ•˜๋Š” ๋ฐฉ๋ฒ•์ด ์•„๋‹Œ

MapR-FS์— ๋ฐ”๋กœ Write

$ find . | grep log $ cp /mapr/cluster $ scp /mapr/cluster $ vi results.csv $ tail -f part-00000

์ปค์Šคํ„ฐ ๋งˆ์ด์ง• ์—†์ด ์–ดํ”Œ๋ฆฌ์ผ€์ด์…˜์—์„œ ํ™œ์šฉ

MapR-FS ๋Š” POSIX ๊ธฐ๋ฐ˜ ํŒŒ์ผ ์‹œ์Šคํ…œ์œผ๋กœ ์ผ๋ฐ˜์ ์ธ ๋ฆฌ๋ˆ…์Šค ์œ ํ‹ธ๋ฆฌํ‹ฐ๋ฅผ ๊ทธ๋Œ€๋กœ ํ™œ์šฉ ๊ฐ€๋Šฅ

ํ‘œ์ค€ OS ์œ ํ‹ธ๋ฆฌํ‹ฐ ์‚ฌ์šฉ

Read/Write ํŒŒ์ผ ์‹œ์Šคํ…œ์„ ํ†ตํ•ด Hadoop ์ „์šฉ ์ปค๋„ฅํ„ฐ๊ฐ€ ์—†๋Š” ์–ดํ”Œ๋ฆฌ์ผ€์ด์…˜๋„ ํ™œ์šฉ

โ€œMapR์˜ NFS ๊ธฐ๋Šฅ์€ IT ๊ด€๋ฆฌ์ž, ๊ฐœ๋ฐœ์ž, ๋ฐ์ดํ„ฐ ์‚ฌ์ด์–ธํ‹ฐ์ŠคํŠธ์ด ํŠน๋ณ„ํ•œ Hadoop ์ปค๋งจ๋“œ๋ฅผ ์ตํžˆ์ง€ ์•Š๊ณ ๋„ ๊ธฐ์กด์˜ ํŒŒ์ผ์‹œ์Šคํ…œ๊ณผ

๋™์ผํ•œ ๋ฐฉ๋ฒ•์œผ๋กœ MapR-FS๋ฅผ ์‚ฌ์šฉํ•  ์ˆ˜ ์žˆ๋„๋ก ์ง€์›ํ•˜๋Š” ๊ธฐ๋Šฅ ์ž…๋‹ˆ๋‹ค. ์ด ๊ธฐ๋Šฅ์„ ํ†ตํ•˜์—ฌ ๋ ˆ๊ฑฐ์‹œ ์†”๋ฃจ์…˜๊ณผ๋„ ์—ฐ๋™์ด ๊ฐ€๋Šฅ ํ•ฉ๋‹ˆ๋‹คโ€

Page 32: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 32 Converged Data Platform

๋ฐ์ดํ„ฐ ์••์ถ•

3. MapR Converged Data Platform ํŠน์žฅ์ 

โ€œMapR File System์€ ๊ธฐ๋ณธ์ ์œผ๋กœ ์••์ถ• ๊ธฐ๋Šฅ์ด ์ ์šฉ. Hadoop ํŒŒ์ผ ์‹œ์Šคํ…œ ์••์ถ• ๊ธฐ๋Šฅ ์‚ฌ์šฉ์‹œ ๊ณผ๋„ํ•œ ์„ฑ๋Šฅ์ €ํ•˜๊ฐ€ ๋ฐœ์ƒํ•˜๋Š” ๊ฒฝ์Ÿ์‚ฌ

์ œํ’ˆ๊ณผ ๋‹ฌ๋ฆฌ MapR ํŒŒ์ผ ์‹œ์Šคํ…œ์€ IO ๋ ˆ์ด์–ด์˜ ๊ฐ„์†Œํ™”๋ฅผ ํ†ตํ•ด ์••์ถ• ๊ธฐ๋Šฅ์„ ์‚ฌ์šฉํ•˜์—ฌ๋„ ์ตœ์†Œํ•œ์˜ ์˜ค๋ฒ„ํ—ค๋“œ๋งŒ ๋ฐœ์ƒโ€

์••์ถ• ํƒ€์ž… ์••์ถ•๋ฅ  ์••์ถ• ์†๋„ ์••์ถ•ํ•ด์ œ ์†๋„

LZ4 2.084 330 MB/s 915 MB/s

LZF 2.076 197 MB/s 465 MB/s

ZLIB 3.095 14 MB/s 210 MB/s

Page 33: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 33 Converged Data Platform

์—”ํ„ฐํ”„๋ผ์ด์ฆˆ ์ˆ˜์ค€์˜ ๊ฐ€์šฉ์„ฑ ํ™•๋ณด

3. MapR Converged Data Platform ํŠน์žฅ์ 

โ€œ๊ธˆ์œต์—…์€ ์—…์˜ ํŠน์„ฑ์ƒ DW ๋ฐ์ดํ„ฐ๋ฅผ ํฌํ•จํ•˜์—ฌ ๋ชจ๋“  ๋ฐ์ดํ„ฐ๊ฐ€ ๋ฏธ์…˜ํฌ๋ฆฌํ‹ฐ์ปฌ ๋ฐ์ดํ„ฐ๋กœ ๋ถ„๋ฅ˜๋˜๋Š” ์‚ฐ์—…์œผ๋กœ ๋น…๋ฐ์ดํ„ฐ ์—ญ์‹œ ๋ฐ์ดํ„ฐ์˜

์•ˆ์ „์„ฑ ํ™•๋ณด๊ฐ€ ๋งค์šฐ ์ค‘์š”. MapR์€ ์ด๋Ÿฌํ•œ ๊ธฐ์—…์š”๊ฑด์„ ์ œํ’ˆ์— ๋ฐ˜์˜ํ•˜์—ฌ ์ œํ’ˆ ์—”์ง„๋ ˆ๋ฒจ์—์„œ ๋ฐฑ์—… ๋ฐ HA/DR ํ™˜๊ฒฝ์„ ๊ตฌ์„ฑโ€

๊ธฐ์กด Hadoop File System MapR File System

? Active

Active

ํƒœ์ƒ์ ์œผ๋กœ HA/DR ๊ตฌ์„ฑ์ด ๊ณ ๋ ค๋˜์ง€ ์•Š์€ ์•„ํ‚คํ…์ฒ˜

๋‹จ์ˆœ ํŒŒ์ผ ๋ณต์ œ ์ˆ˜์ค€์˜ DR ์„ผํ„ฐ ๊ตฌ์„ฑ์œผ๋กœ ๋ฐ์ดํ„ฐ ์œ ์‹ค ๊ฐ€๋Šฅ

Active โ€“ Standby ์ˆ˜์ค€์˜ DR ์„ผํ„ฐ ๊ตฌ์„ฑ

์Šค๋ƒ…์ƒท / ๋ฏธ๋Ÿฌ๋ง ๊ธฐ๋Šฅ์„ ํ†ตํ•œ ๋‚ด/์™ธ๋ถ€๋กœ์˜ ๋ฐ์ดํ„ฐ ๋ณดํ˜ธ

์••์ถ• ๊ธฐ๋Šฅ์„ ํ†ตํ•œ ์ €๋น„์šฉ ๊ณ ์„ฑ๋Šฅ DR ์„ผํ„ฐ ๊ตฌ์ถ• ๊ฐ€๋Šฅ

Active โ€“ Active HA/DR ์„ผํ„ฐ ๊ตฌ์„ฑ์œผ๋กœ ํด๋Ÿฌ์Šคํ„ฐ ํšจ์œจ ๊ทน๋Œ€ํ™”

Sanpshot

Sanpshot

Page 34: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 34 Converged Data Platform

์—”ํ„ฐํ”„๋ผ์ด์ฆˆ ์ˆ˜์ค€์˜ ๋ณด์•ˆ ๊ธฐ๋Šฅ ์ œ๊ณต

3. MapR Converged Data Platform ํŠน์žฅ์ 

โ€œ๊ธฐ์—…์˜ ๋ณด์•ˆ ์š”๊ฑด์„ ์ถฉ์กฑํ•˜๊ธฐ ์œ„ํ•ด MapR์€ ๊ถŒํ•œ, ์ธ์ฆ, ์•”ํ˜ธํ™”, ๊ฐ์‚ฌ ๋ณด์•ˆ ๊ธฐ๋Šฅ์„ ์ œ๊ณต. ํด๋Ÿฌ์Šคํ„ฐ์—์„œ ํŒŒ์ผ ๋‹จ์œ„๊นŒ์ง€ ๋ฏธ์„ธํ•œ

๋ณด์•ˆ์„ค์ •์œผ๋กœ ๊ธฐ์—… ๋‚ด/์™ธ๋ถ€์˜ ํ”Œ๋žซํผ ์‚ฌ์šฉ์ž๋“ค์— ๋Œ€ํ•œ ์•ˆ์ „ํ•œ ๋ฐ์ดํ„ฐ ํ™œ์šฉ ํ™˜๊ฒฝ์„ ์ œ๊ณตโ€

โ€ข Access Control Lists (ACLs) ๊ด€๋ฆฌ

โ€ข ๋ณผ๋ฅจ ๋‹จ์œ„ ๊ถŒํ•œ ์ œ์–ด

โ€ข MapR Table ์ ‘๊ทผ ๊ถŒํ•œ ์ œ์–ด

โ€ข ํ‘œ์ค€ UNIX ํŒŒ์ผ ์‹œ์Šคํ…œ ํผ๋ฏธ์…˜ ์ง€์›

Authorization

โ€ข ๋ชจ๋“  ์ด๋ฒคํŠธ JSON ๋กœ๊ทธํŒŒ์ผ๋กœ ์ €์žฅ

โ€ข ๋ฐ์ดํ„ฐ ์•ก์„ธ์Šค ๋ฐ ๊ด€๋ฆฌ ํฌํ•จ

โ€ข SQL๊ณผ ํ‘œ์ค€ BIํˆด์„ ํ†ตํ•œ ๊ฐ์‹œ๋กœ๊ทธ ์งˆ์˜

๋ฐ ์ปค์Šคํ…€ ๋ฆฌํฌํŠธ ์ƒ์„ฑ

Auditing Authentication

โ€ข MapR Ticket ์ ์šฉ

โ€ข UserID & Password (PAM, LDAP)

โ€ข Kerberos ์—ฐ๋™

Encryption

โ€ข Wire-level ๋ฐ์ดํ„ฐ ์•”ํ˜ธํ™”

โ€ข NSA Suite B ์•”ํ˜ธํ™” (AES-256 ๋ฐ SHA-256)

๋ฅผ ํ™œ์šฉํ•˜์—ฌ ํ‘œ์ค€ ๊ธฐ๋ฐ˜ ๊ธฐ๋ณธ ์ธ์ฆ์„ ์ง€์›

โ€ข 3rd Party ์†”๋ฃจ์…˜์—ฐ๊ณ„๋ฅผ ํ†ตํ•œ ํŒŒ์ผ/์ปฌ๋Ÿผ ๋ ˆ๋ฒจ

โ€ข ์•”ํ˜ธํ™”

Page 35: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 35 Converged Data Platform

NoSQL ํด๋Ÿฌ์Šคํ„ฐ

Impala ํด๋Ÿฌ์Šคํ„ฐ

์ŠคํŠธ๋ฆฌ๋ฐ ํด๋Ÿฌ์Šคํ„ฐ

MapR Converged Data Platform ํ†ตํ•ฉ

3. MapR Converged Data Platform ํŠน์žฅ์ 

โ€œMapR Converged Data Platform์€ ๊ธฐ์—…์š”๊ฑด์ด ๋ฐ˜์˜๋˜์ง€ ์•Š์€ ๋‹จ์ˆœํ•œ ์˜คํ”ˆ์†Œ์Šค ๊ธฐ๋ฐ˜์˜ ๋น…๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ์ด ์•„๋‹Œ ์—”ํ„ฐํ”„๋ผ์ด์ฆˆ

ํ™˜๊ฒฝ์—์„œ ํ•„์š”๋กœ ๋˜๋Š” ์•ˆ์ „์„ฑ, ๋ณด์•ˆ, ์„ฑ๋Šฅ์„ ๊ฐ•ํ™”ํ•œ ์—…๊ณ„ ์œ ์ผ์˜ ์—”ํ„ฐํ”„๋ผ์ด์ฆˆ ๋Œ€์‘์ด ๊ฐ€๋Šฅํ•œ ๋น…๋ฐ์ดํ„ฐ ํ”Œ๋žซํผโ€

MapR File System

์–€(YARN) ํ”„๋ ˆ์ž„์›Œํฌ

์˜คํ”ˆ์†Œ์Šค ์—์ฝ” ์‹œ์Šคํ…œ

Page 36: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 36 Converged Data Platform

MapR Streams ๊ฐœ์š”

3. MapR Converged Data Platform ํŠน์žฅ์ 

โ€œ์ŠคํŠธ๋ฆผ ๋ฐ์ดํ„ฐ๋Š” ์•„์ง ์ €์žฅ๋˜์ง€๋Š” ์•Š์•˜์œผ๋‚˜ ์œ ์ž…์ด ๋˜๊ณ  ์žˆ๋Š” ๋ฐ์ดํ„ฐ๋กœ ๋น…๋ฐ์ดํ„ฐ ํ™˜๊ฒฝ์—์„œ ํ˜„์žฌ ๋ฐœ์ƒ๋˜๊ณ  ์žˆ๋Š” ์ƒํ™ฉ์ด๋‚˜ ์ƒํƒœ๋ฅผ

์‹ค์‹œ๊ฐ„์œผ๋กœ ํ™œ์šฉํ•˜๊ณ ์ž ํ•˜๋Š” ์š”๊ฑด์ด ์ฆ๊ฐ€. ํ•˜์ง€๋งŒ ์˜คํ”ˆ์†Œ์Šค Kafka ๊ธฐ๋ฐ˜์˜ ์ŠคํŠธ๋ฆผ ์ฒ˜๋ฆฌ๋Š” ๊ตฌ์กฐ์  ํ•œ๊ณ„๋กœ ๋งŽ์€ ์ด์Šˆ๋ฅผ ๋ฐœ์ƒโ€

๊ธฐ์กด Kafka ๊ตฌ๋™ ๋ฐฉ์‹ MapR Streams ๊ตฌ๋™ ๋ฐฉ์‹

Disk

Linux File System

Java Virtual Machine

Hadoop File System (Java Code)

Disk

MapR File System (C Code)

JVM

Kafka (Java Code)

MapR Streams (C Code)

๊ณ„์ธต

๊ฐ„์†Œํ™”๋กœ

์„ฑ๋Šฅ ํ–ฅ์ƒ

Event Processing

Local File System

Event Processing

Page 37: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 37 Converged Data Platform

์ŠคํŠธ๋ฆผ ๋ฐ์ดํ„ฐ์˜ ์ ์‹œ์„ฑ ํ™•๋ณด

3. MapR Converged Data Platform ํŠน์žฅ์ 

โ€œ๊ธˆ์œต์—…๊ณผ๊ฐ™์€ ์„œ๋น„์Šค ์ƒํ’ˆ์„ฑ ์ œํ’ˆ๋“ค์€ ์˜คํผ๋ง์˜ ์ ์‹œ์„ฑ์ด ๋งค์šฐ ์ค‘์š”ํ•œ ์š”์†Œ๋กœ ์ž‘์šฉ ํ•ฉ๋‹ˆ๋‹ค. ํŠนํžˆ ์‹œ์žฅ ๋˜๋Š” ๊ฐœ์ธ์˜ ์ƒํ™ฉ์„ ์‹ค์‹œ๊ฐ„์œผ๋กœ

ํŒŒ์•…ํ•  ์ˆ˜ ์žˆ์–ด์•ผ ํ•ฉ๋‹ˆ๋‹ค. SNS, IoT, Mobile ๋ฐ์ดํ„ฐ์— ๋Œ€ํ•œ ์œ ์‹ค์—†๋Š” ์•ˆ์ •์  ์ŠคํŠธ๋ฆผ ์ฒ˜๋ฆฌ์™€ ๊ณ ์† ์ฒ˜๋ฆฌ๋กœ ์ ์‹œ์„ฑ ํ™•๋ณด๊ฐ€ ๊ฐ€๋Šฅโ€

Producers

Consumers

Producers & Consumers

Producers & Consumers

๊ธฐ์กด Kafka ๊ตฌ๋™ ๋ฐฉ์‹ MapR Streams ๊ตฌ๋™ ๋ฐฉ์‹

์™ธ๋ถ€ ๋ชจ๋“ˆ์ธ MirrorMaker ์— ์˜ํ•œ ๋ณต์ œ ๊ตฌ์กฐ

๋ฉ”์‹œ์ง€ ์˜คํ”„์…‹๋“ค์€ ์ƒˆ ํด๋Ÿฌ์Šคํ„ฐ์— ์ €์žฅ

๋ฉ”์‹œ์ง€ ๋ฃจํ•‘์„ ํ”ผํ•˜๊ธฐ ์œ„ํ•œ ๋ณต์žกํ•œ 2-stage ํด๋Ÿฌ์Šคํ„ฐ

์ฝ”์–ด ํ”Œ๋žซํผ์— ์˜ํ•œ ๋ณต์ œ ๊ตฌ์กฐ

๋ฉ”ํƒ€๋ฐ์ดํ„ฐ/์˜คํ”„์…‹๋“ค๊ณผ ํ•จ๊ป˜ ๋ฉ”์‹œ์ง€ ๋ณต์ œ

๋ฉ”์‹œ์ง€ ๋ฃจํ•‘ ๋ฐฉ์ง€ ๊ธฐ์ˆ  ๋‚ด์žฅ

Page 38: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 38 Converged Data Platform

์ŠคํŠธ๋ฆผ ๋ฐ์ดํ„ฐ์˜ ๋ถ„์‚ฐ ์ฒ˜๋ฆฌ์™€ ๊ฐ€์šฉ์„ฑ

3. MapR Converged Data Platform ํŠน์žฅ์ 

โ€œ๊ธฐ์กด Kafka ์ŠคํŠธ๋ฆผ ํด๋Ÿฌ์Šคํ„ฐ๋Š” Hadoop ์™ธ๋ถ€์— ์œ„์น˜ํ•˜์—ฌ ๋„คํŠธ์›Œํฌ ํ†ต์‹ ์— ์˜ํ•ด ๋ฉ”์‹œ์ง• ์ฒ˜๋ฆฌ๋ฅผ ์ˆ˜ํ–‰ ํ•ฉ๋‹ˆ๋‹ค. ๋งŒ์•ฝ ์ŠคํŠธ๋ฆผ ์ฒ˜๋ฆฌ๋ฅผ ์œ„ํ•œ

CEP์™€ ๊ฐ™์€ ์ŠคํŠธ๋ฆผ ๋ถ„์„ ์†”๋ฃจ์…˜์˜ ์žฅ์• ๊ฐ€ ๋ฐœ์ƒ์‹œ Kafka ์„œ๋ฒ„ ๋‚ด๋ถ€ ํŒŒ์ผ ๋ฒ„ํผ์˜ ์‚ฌ์ด์ง• ํ•œ๊ณ„๋กœ ๋ฐ์ดํ„ฐ์˜ ์œ ์‹ค ๊ฐ€๋Šฅ์„ฑ์ด ๋ฐœ์ƒํ•ฉ๋‹ˆ๋‹ค.โ€

๊ธฐ์กด Kafka ๊ตฌ๋™ ๋ฐฉ์‹ MapR Streams ๊ตฌ๋™ ๋ฐฉ์‹

ํŒŒํ‹ฐ์…˜์„ ๋‹จ์ผ ๋””์Šคํฌ์— ํ• ๋‹น

ํŒŒํ‹ฐ์…˜์˜ ์œ ์ž…๋ฅ , ๋ฆฌํ…์…˜ ์ •์ฑ…๋“ฑ์„ ๊ณ ๋ คํ•˜์ง€ ์•Š๊ณ  ํ• ๋‹น

์ˆ˜์ž‘์—…์— ์˜ํ•œ ํŒŒํ‹ฐ์…˜ ์ด๋™

ํŒŒํ‹ฐ์…˜์„ MapR ํŒŒ์ผ์‹œ์Šคํ…œ์— ๋ถ„ํ• /๋ถ„์‚ฐํ•˜์—ฌ ํ• ๋‹น

๋ถ„ํ• ๋œ ํŒŒํ‹ฐ์…˜์„ ํ™œ์šฉ์œจ๊ธฐ๋ฐ˜์œผ๋กœ ํ• ๋‹น

์ฃผ๊ธฐ์ ์œผ๋กœ ๋ถ„ํ• ๋œ ํŒŒํ‹ฐ์…˜์˜ ๋ฆฌ๋ฐธ๋Ÿฐ์‹ฑ ์ˆ˜ํ–‰

Page 39: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 39 Converged Data Platform

์˜คํ”ˆ์†Œ์Šค ์—์ฝ” ์‹œ์Šคํ…œ

NoSQL ํด๋Ÿฌ์Šคํ„ฐ

Impala ํด๋Ÿฌ์Šคํ„ฐ

MapR Converged Data Platform ํ†ตํ•ฉ

3. MapR Converged Data Platform ํŠน์žฅ์ 

โ€œMapR Converged Data Platform์€ ๊ธฐ์—…์š”๊ฑด์ด ๋ฐ˜์˜๋˜์ง€ ์•Š์€ ๋‹จ์ˆœํ•œ ์˜คํ”ˆ์†Œ์Šค ๊ธฐ๋ฐ˜์˜ ๋น…๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ์ด ์•„๋‹Œ ์—”ํ„ฐํ”„๋ผ์ด์ฆˆ

ํ™˜๊ฒฝ์—์„œ ํ•„์š”๋กœ ๋˜๋Š” ์•ˆ์ „์„ฑ, ๋ณด์•ˆ, ์„ฑ๋Šฅ์„ ๊ฐ•ํ™”ํ•œ ์—…๊ณ„ ์œ ์ผ์˜ ์—”ํ„ฐํ”„๋ผ์ด์ฆˆ ๋Œ€์‘์ด ๊ฐ€๋Šฅํ•œ ๋น…๋ฐ์ดํ„ฐ ํ”Œ๋žซํผโ€

MapR File System | MapR Streams

์–€(YARN) ํ”„๋ ˆ์ž„์›Œํฌ

Page 40: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 40 Converged Data Platform

MapR-DB ๊ฐœ์š”

3. MapR Converged Data Platform ํŠน์žฅ์ 

โ€œHbase๋Š” Hadoop ํ™˜๊ฒฝ์—์„œ ๋น„์ •ํ˜• ๋ฐ์ดํ„ฐ๋ฅผ ์ €์žฅํ•˜๋Š” NoSQL ์†”๋ฃจ์…˜. ํ•˜์ง€๋งŒ Hbase ๊ตฌ์กฐ์  ํ•œ๊ณ„๋กœ ์ธํ•ด ์„ฑ๋Šฅ ๋ฐ ์•ˆ์ „์„ฑ ํ™•๋ณด๊ฐ€ ๋งค์šฐ

ํฐ ์ด์Šˆ. MapR-DB๋Š” ์ด๋Ÿฌํ•œ ์ œ์•ฝ ์‚ฌํ•ญ์„ ์ œ๊ฑฐํ•˜์—ฌ MapR FS์™€ DB๋ฅผ ๊ฒฐํ•ฉ์‹œ์ผœ ๊ณ ์†์œผ๋กœ Schema less ๋ฐ์ดํ„ฐ๋ฅผ ์ฒ˜๋ฆฌโ€

Disk

Linux File System

Java Virtual Machine

Hadoop File System (Java Code)

Disk

MapR File System (C Code)

Java Virtual Machine

Hbase (Java Code) MapR-DB (C Code)

๋‹ค์ค‘ ๊ณ„์ธต

์œผ๋กœ ์ธํ•œ

์„ฑ๋Šฅ ์ €ํ•˜

๊ณ„์ธต

๊ฐ„์†Œํ™”๋กœ

์„ฑ๋Šฅ ํ–ฅ์ƒ

๊ธฐ์กด Hbase ๊ตฌ๋™ ๋ฐฉ์‹ MapR-DB ๊ตฌ๋™ ๋ฐฉ์‹

Page 41: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 41 Converged Data Platform

๊ธฐ์—…์ˆ˜์ค€์˜ ๋ฐ˜์ •ํ˜• NoSQL ๋ฐ์ดํ„ฐ๋ฅผ ์ฒ˜๋ฆฌํ•˜๋Š” MapR-DB

3. MapR Converged Data Platform ํŠน์žฅ์ 

โ€œ๊ณ ๊ฐ ๋Œ€์‘ ํ›„ ์ƒ์„ฑ๋˜๋Š” ๋ฐ์ดํ„ฐ๋ฅผ ๊ธฐ์กด ์ •ํ˜• DBMS๋กœ ์ฒ˜๋ฆฌํ•˜๋Š” ๊ฒƒ์€ ์Šคํ‚ค๋งˆ ๊ตฌ์„ฑ ๋ฐ ๊ด€๋ฆฌ์— ๋Œ€ํ•œ ๋ถ€๋‹ด์ด ๊ฐ€์ค‘. ์›น์ด๋‚˜ ๋ชจ๋ฐ”์ผ์•ฑ, ARS,

VoC๋“ฑ์œผ๋กœ ์ƒ์„ฑ๋˜๋Š” ๋ฐ์ดํ„ฐ๋“ค์— ๋Œ€ํ•ด ๊ณ ๊ฐ์ค‘์‹ฌ์˜ ๋ฐ์ดํ„ฐ๋ฅผ ๊ตฌ์„ฑํ•˜๊ธฐ ์œ„ํ•ด์„œ๋Š” ์„ฑ๋Šฅ๊ณผ ์•ˆ์ „์„ฑ์ด ํ™•๋ณด๋œ NoSQL DB๊ฐ€ ํ•„์š”.โ€

๊ธฐ์กด Hbase ๊ตฌ๋™ ๋ฐฉ์‹ MapR-DB ๊ตฌ๋™ ๋ฐฉ์‹

๋งˆ์Šคํ„ฐ ๋…ธ๋“œ๊ฐ€ ๋‹จ์ผ์žฅ์• ์ (SPOF)

์—ฐ์‚ฐ์˜์—ญ์ด ๋…๋ฆฝ๋œ ์˜์—ญ์—์„œ๋งŒ ์‚ฌ์šฉ

๋ฐ”์ด๋„ˆ๋ฆฌ ํ…Œ์ด๋ธ”๋งŒ ์ œ๊ณต, ๋‹คํ๋จผํŠธ NoSQL์€ ๋ณ„๋„๋กœ ๊ตฌ์„ฑ

HA/DR ์œ„ํ•œ ๊ธฐ๋Šฅ ๋ฏธ์ œ๊ณต

๋งˆ์Šคํ„ฐ ๋…ธ๋“œ๋ฅผ ์ œ๊ฑฐํ•˜์—ฌ ๋‹จ์ผ ์žฅ์• ์ (SPOF)์„ ์ œ๊ฑฐ

MapR ํด๋Ÿฌ์Šคํ„ฐ ์ „์ฒด๋ฅผ ์—ฐ์‚ฐ ์˜์—ญ์œผ๋กœ ํ†ตํ•ฉํ•˜์—ฌ ์‚ฌ์šฉ

๋ฐ”์ด๋„ˆ๋ฆฌ ํ…Œ์ด๋ธ” ๋ฟ๋งŒ ์•„๋‹Œ JSON ๋‹คํ๋จผํŠธ ํ…Œ์ด๋ธ” ์ œ๊ณต

HA/DR์„ ์œ„ํ•œ Table ๋ ˆ๋ฒจ ๋™๊ธฐํ™” ๊ธฐ๋Šฅ ์ œ๊ณต

๋งˆ์Šคํ„ฐ

๋…ธ๋“œ

Hbase ํด๋Ÿฌ์Šคํ„ฐ

๋งˆ์Šคํ„ฐ

๋…ธ๋“œ(HA)

MapR ํด๋Ÿฌ์Šคํ„ฐ MongoDB ํด๋Ÿฌ์Šคํ„ฐ

Page 42: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 42 Converged Data Platform

Apache Drill ์†Œ๊ฐœ

3. MapR Converged Data Platform ํŠน์žฅ์ 

โ€œHadoopํ™˜๊ฒฝ์˜ ๋งŽ์€ SQL on Hadoop ๊ธฐ์ˆ ๋“ค์ด ์กด์žฌ. Apache Drill์€ MapR์ด ์ฃผ๋„ํ•˜๋Š” ํ”„๋กœ์ ํŠธ๋กœ ์ด๊ธฐ์ข… Hadoop ์—์ฝ”

์‹œ์Šคํ…œ๋“ค์— ๋Œ€ํ•ด ๋™์ผํ•œ ANSI SQL์„ ๊ธฐ์ค€์œผ๋กœ ๊ฐœ๋ฐœ ํ™˜๊ฒฝ์„ ํ†ตํ•ฉํ•˜๋Š” ํ”„๋กœ์ ํŠธ๋กœ ๊ฐ€์žฅ ์‰ฝ๊ฒŒ ์‚ฌ์šฉํ•  ์ˆ˜ ์žˆ๋Š” SQL on Hadoop๊ธฐ์ˆ โ€

๊ธฐ์กด SQL on Hadoop ๊ธฐ์ˆ  Apache Drill

Apache Drill ( ANSI SQL )

HDFS/MapR FS Hbase/MapR-DB Hive HDFS Hbase Hive

Phoenix SQL HiveQL

๊ฐ๊ฐ์˜ ์—์ฝ” ์‹œ์Šคํ…œ๋งˆ๋‹ค ๋…๋ฆฝ์ ์ธ SQL on Hadoop ์—”์ง„

ANSI SQL์ด ์•„๋‹Œ ๋…๋ฆฝ์ ์ธ SQL๊ธฐ๋ฐ˜์˜ ์–ธ์–ด๋ฅผ ์‚ฌ์šฉ

HDFS์˜ ํŒŒ์ผ์„ ์Šคํ‚ค๋งˆ ๊ตฌ์„ฑ์—†์ด SQL ์ง์ ‘์ ‘๊ทผ์ด ๋ถˆ๊ฐ€

Drillbit ์ด๋ผ๋Š” ๋™์ผํ•œ SQL on Hadoop ์—”์ง„ ์‚ฌ์šฉ

ANSI 92 ํ‘œ์ค€ SQL์„ ๋ชจ๋“  ์†Œ์Šค์— ๋Œ€ํ•ด ๋™์ผํ•˜๊ฒŒ ์‚ฌ์šฉ

MapR FS ํŒŒ์ผ์„ ์Šคํ‚ค๋งˆ ๊ตฌ์„ฑ์—†์ด SQL ์ง์ ‘์ ‘๊ทผ ๊ฐ€๋Šฅ

Impala SQL

Page 43: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 43 Converged Data Platform

Apache Drill ์†Œ๊ฐœ

3. MapR Converged Data Platform ํŠน์žฅ์ 

โ€œApache Drill์€ ์ƒ๋‹น ๋ถ€๋ถ„์˜ ์ปค๋ฏธํ„ฐ๋“ค์ด MapR ์†Œ์†์˜ ๊ฐœ๋ฐœ์ž๋“ค๋กœ ๊ตฌ์„ฑ๋˜์–ด ์ฝ”๋“œ๋ฅผ ๊ธฐ์—ฌํ•˜๊ณ  ์žˆ์œผ๋ฉฐ MapR-FS, HDFS, Hive, MapR-

DB, Hbase ๋“ฑ ๋ฐ์ดํ„ฐ ์†Œ์Šค์™€ ์Šคํ‚ค๋งˆ์— ์ž์œ ๋กœ์šด SQL on Hadoop ํ”„๋กœ์ ํŠธ. ์ž์ฒด ๊ฐœ๋ฐœ Web UI ๋˜๋Š” Zeppelin ๋“ฑ๊ณผ ํ†ตํ•ฉํ•˜์—ฌ ํ™œ์šฉโ€

Page 44: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 44 Converged Data Platform

Apache Drill ์†Œ๊ฐœ

3. MapR Converged Data Platform ํŠน์žฅ์ 

โ€œApache Drill์€ MapReduce ์—”์ง„์ด ์•„๋‹Œ Drillbit์ด๋ผ๋Š” ์ „์šฉ ๋ฐ์ดํ„ฐ ์ฒ˜๋ฆฌ ์—”์ง„์„ ๋ณด์œ . ์‚ฌ์šฉ์ž๋Š” Zookeeper๋กœ๋ถ€ํ„ฐ ์งˆ์˜๋ฅผ ์ „๋‹ฌํ• 

Drillbit์—”์ง„์„ ํ• ๋‹น๋ฐ›๊ณ  ์งˆ์˜๋ฅผ ์ˆ˜ํ–‰ ํ•˜๋ฉฐ ๊ฐ ์—”์ง„์€ ์บ์‹œ๋œ ๋ฉ”ํƒ€ ๋ฐ์ดํ„ฐ๋ฅผ ์กฐํšŒํ•˜์—ฌ ๋ฐ์ดํ„ฐ ์ง€์—ญ์„ฑ์„ ๊ธฐ๋ฐ˜์œผ๋กœ ๋ฐ์ดํ„ฐ๋ฅผ ํƒ์ƒ‰โ€

Drillbit Engine

Zookeeper

โ€ฆ

Select device, cust_id, order_id

FROM clicks.json t, hive.orders o

WHERE t.cust_id=o.cust_id

MapR File System

Yarn Framework

Drillbit Engine

Drillbit Engine

Drillbit Engine

Drillbit Engine

Drillbit Engine

Page 45: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 45 Converged Data Platform

Apache Drill ์†Œ๊ฐœ

3. MapR Converged Data Platform ํŠน์žฅ์ 

โ€œ์งˆ์˜๋ฅผ ํŒŒ์‹ฑํ•˜๊ธฐ ์œ„ํ•œ Drillbit ์—”์ง„์ด ์„ ์ •๋˜๋ฉด ์ฟผ๋ฆฌ๋ฅผ Logical Plan๊ณผ Physical Plan์œผ๋กœ ๊ฐ๊ธฐ ์ˆ˜ํ–‰๊ณ„ํš์„ ์ƒ์„ฑ. ์ด๋•Œ ๋ฐ์ดํ„ฐ์˜

์ง€์—ญ์„ฑ์„ ์ตœ๋Œ€ํ•œ ํ™œ์šฉํ•˜๋˜๊ธฐ ์œ„ํ•œ ์ •๋ณด๋ฅผ ์ˆ˜์ง‘ํ•˜๊ณ  ๊ฐ ๊ณ„ํš์„ ํŠธ๋ฆฌ๊ตฌ์กฐ๋กœ ์กฐ๊ฐ๋‚ด์–ด ๊ฐ ๋ถ„์‚ฐ ๋…ธ๋“œ์˜ Drill ์—”์ง„์— ์ „๋‹ฌโ€

โ€ฆ Rule/Cost ๊ธฐ๋ฐ˜ ์ตœ์ ํ™”

Protobuf ๊ธฐ๋ฐ˜ RPC

๋ฐ์ดํ„ฐ ์ง€์—ญ์„ฑ

์งˆ์˜ ๋ถ„์‚ฐ

Drillbit Engine

RPC Endpoint

SQL Parser

Logical Plan

Optimizer

Physical Plan

Storage Engine Interface

Execution

Distributed Cache

Drillbit Engine

RPC Endpoint

SQL Parser

Logical Plan

Optimizer

Physical Plan

Storage Engine Interface

Execution

Distributed Cache

Drillbit Engine

RPC Endpoint

SQL Parser

Logical Plan

Optimizer

Physical Plan

Storage Engine Interface

Execution

Distributed Cache

Page 46: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 46 Converged Data Platform

์˜คํ”ˆ์†Œ์Šค ์—์ฝ” ์‹œ์Šคํ…œ

MapR Converged Data Platform ํ†ตํ•ฉ

3. MapR Converged Data Platform ํŠน์žฅ์ 

โ€œMapR Converged Data Platform์€ ๊ธฐ์—…์š”๊ฑด์ด ๋ฐ˜์˜๋˜์ง€ ์•Š์€ ๋‹จ์ˆœํ•œ ์˜คํ”ˆ์†Œ์Šค ๊ธฐ๋ฐ˜์˜ ๋น…๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ์ด ์•„๋‹Œ ์—”ํ„ฐํ”„๋ผ์ด์ฆˆ

ํ™˜๊ฒฝ์—์„œ ํ•„์š”๋กœ ๋˜๋Š” ์•ˆ์ „์„ฑ, ๋ณด์•ˆ, ์„ฑ๋Šฅ์„ ๊ฐ•ํ™”ํ•œ ์—…๊ณ„ ์œ ์ผ์˜ ์—”ํ„ฐํ”„๋ผ์ด์ฆˆ ๋Œ€์‘์ด ๊ฐ€๋Šฅํ•œ ๋น…๋ฐ์ดํ„ฐ ํ”Œ๋žซํผโ€

MapR File System | MapR Streams | MapR-DB

์–€(YARN) ํ”„๋ ˆ์ž„์›Œํฌ

Page 47: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 47 Converged Data Platform

MapR Converged Data Platform์˜ ํŠน์žฅ์  ์š”์•ฝ

3. MapR Converged Data Platform ํŠน์žฅ์ 

โ€œMapR Converged Data Platform์€ ๊ธฐ์—…์ด ์ง๋ฉดํ•˜๊ณ  ์žˆ๋Š” ๋น…๋ฐ์ดํ„ฐ ๋ฌธ์ œ๋ฅผ ํ•ด์†Œํ•œ ์—…๊ณ„ ์œ ์ผ์˜ ๊ธฐ์—…์šฉ ๋น…๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ ์ž…๋‹ˆ๋‹คโ€

์˜คํ”ˆ์†Œ์Šค ๊ธฐ๋ฐ˜ Hadoop Platform MapR Converged Data Platform

๋„ค์ž„๋…ธ๋“œ ๋ฐ ๋งˆ์Šคํ„ฐ๋…ธ๋“œ์˜ ๋‹จ์ผ ์žฅ์• ์  ๋ฌธ์ œ

JVM๋ฐ OS File System์˜ ๋ณต์žกํ•œ ๋‹จ๊ณ„๋กœ ์ธํ•œ ์„ฑ๋Šฅ์ €ํ•˜

Hadoop ์ „๋ฌธ๊ฐ€๋งŒ์ด ์‹œ์Šคํ…œ ์šด์˜ ๋ฐ ๊ฐœ๋ฐœ์ด ๊ฐ€๋Šฅ

HA/DR ๋ฐ ๋ณด์•ˆ๊ธฐ๋Šฅ์˜ ์ทจ์•ฝ์œผ๋กœ ๊ธฐ์—…์ˆ˜์ค€ ๋Œ€์‘ ๋ถˆ๊ฐ€

๋‹ค๋Ÿ‰์˜ ๊ด€๋ฆฌ๋…ธ๋“œ ๊ตฌ์„ฑ์ด ํ•„์š”๋กœ ๋˜์–ด TCO ์ฆ๊ฐ€

๋„ค์ž„๋…ธ๋“œ ๋ฐ ๋งˆ์Šคํ„ฐ๋…ธ๋“œ๋ฅผ ์ œ๊ฑฐํ•˜์—ฌ ๋‹จ์ผ ์žฅ์• ์  ์ œ๊ฑฐ

JVM ๋ฐ OS File System ์˜์—ญ์„ ์ œ๊ฑฐํ•˜์—ฌ ์„ฑ๋Šฅ ํ–ฅ์ƒ

POSIX, NFS ๊ธฐ๋Šฅ ๋“ฑ์œผ๋กœ ๊ธฐ์กด์˜ ์ธ๋ ฅ์œผ๋กœ ์šด์˜ ๋ฐ ๊ฐœ๋ฐœ ๊ฐ€๋Šฅ

์ฝ”์–ด๋ ˆ๋ฒจ๋กœ HA/DR ๋ฐ ๋ณด์•ˆ๊ธฐ๋Šฅ ์ œ๊ณต์œผ๋กœ ๊ธฐ์—…์ˆ˜์ค€ ๋Œ€์‘

๊ฐ„์†Œํ™”๋œ ํด๋Ÿฌ์Šคํ„ฐ ๊ตฌ์„ฑ์œผ๋กœ ๋…ธ๋“œ๋ฅผ ๊ฐ์†Œํ•˜์—ฌ TCO ์ ˆ๊ฐ

Page 48: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 48 Converged Data Platform

4. ์˜คํ”ˆ์†Œ์Šค ์—์ฝ”์‹œ์Šคํ…œ

Page 49: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 49 Converged Data Platform

ํ•˜๋‘ก ์—์ฝ” ์‹œ์Šคํ…œ โ€“ ๋ฐ์ดํ„ฐ ์ˆ˜์ง‘

4. ์˜คํ”ˆ์†Œ์Šค ์—์ฝ”์‹œ์Šคํ…œ

์ปดํฌ๋„ŒํŠธ ์„ค ๋ช…

MapR-NFS โ€ข ์ฝ๊ธฐ/์“ฐ๊ธฐ๊ฐ€ ๊ฐ€๋Šฅํ•œ NFS๋ฅผ ์ œ๊ณตํ•˜์—ฌ ์†Œ์Šค ๋ฐ์ดํ„ฐ ์‹œ์Šคํ…œ์— ๋งˆ์šดํŠธํ•˜์—ฌ ์‹ ์†ํžˆ ๋ฐ์ดํ„ฐ๋ฅผ ๋กœ๋“œ ํ•  ์ˆ˜ ์žˆ์Œ

โ€ข ํ•˜๋‘ก ํŒŒ์ผ์‹œ์Šคํ…œ ์ ‘๊ทผ์— ๋Œ€ํ•œ POSIX ์‚ฌ์šฉ ์ง€์›

Apache Sqoop

โ€ข ๊ฐ„๋‹จํ•œ CLI(Command Line Interface)๋กœ Oracle, MySQL ๋“ฑ์˜ RDBMS์˜ ํŠน์ • ํ…Œ์ด๋ธ” ๋˜๋Š” ํŠน์ • ์กฐ๊ฑด์— ๋งž๋Š” ๋ฐ์ดํ„ฐ

๋ฅผ HDFS๋กœ ์‰ฝ๊ฒŒ ์˜ฎ๊ธธ ์ˆ˜ ์žˆ์œผ๋ฉฐ, Hive, Pig, HBase ๋“ฑ์œผ๋กœ ๋ฐ”๋กœ ์˜ฎ๊ฒจ ํ™•์ธ ๊ฐ€๋Šฅ

โ€ข HDFS์— ์ €์žฅ๋˜์–ด ์žˆ๋Š” ๋ฐ์ดํ„ฐ๋ฅผ RDBMS๋กœ ์ ์žฌ ๋˜๋Š” RDBMS ๋ฐ์ดํ„ฐ๋ฅผ HDFS๋กœ ์ ์žฌ ๊ธฐ๋Šฅ

โ€ข ํ•˜๋‘ก์šฉ ETL ๋„๊ตฌ

Apache Flume

โ€ข ๋Œ€์šฉ๋Ÿ‰์˜ ๋กœ๊ทธ ๋ฐ์ดํ„ฐ๋ฅผ ๋ถ„์‚ฐ,์•ˆ์ •์„ฑ,๊ฐ€์šฉ์„ฑ์„ ๋ฐ”ํƒ•์œผ๋กœ ํšจ์œจ์ ์œผ๋กœ ์ˆ˜์ง‘,์ง‘๊ณ„,์ด๋™์ด ๊ฐ€๋Šฅํ•œ ๋กœ๊ทธ์ˆ˜์ง‘ ๊ธฐ๋Šฅ

โ€ข ์žฅ์• ๊ฐ€ ๋‚˜๋”๋ผ๋„ ๋กœ๊ทธ,์ด๋ฒคํŠธ๋ฅผ ์œ ์‹ค ์—†์ด ์ „์†กํ•จ์„ ๋ณด์žฅ

โ€ข ์ˆ˜ํ‰ํ™•์žฅ(Scale-Out)์ด ๊ฐ€๋Šฅํ•˜์—ฌ ๋ถ„์‚ฐ์ˆ˜์ง‘์ด ๊ฐ€๋Šฅํ•œ ๊ตฌ์กฐ๋กœ ์„ค๊ณ„๋จ

โ€ข ๋‹ค์–‘ํ•œ ์†Œ์Šค๋กœ๋ถ€ํ„ฐ ๋ฐ์ดํ„ฐ๋ฅผ ์ˆ˜์ง‘ํ•˜์—ฌ ๋‹ค์–‘ํ•œ ๋ฐฉ์‹์œผ๋กœ ๋ฐ์ดํ„ฐ๋ฅผ ์ „์†ก์ด ๊ฐ€๋Šฅํ•˜์ง€๋งŒ, ์•„ํ‚คํ…์ฒ˜๊ฐ€ ๋‹จ์ˆœํ•˜๊ณ  ์œ ์—ฐํ•˜

๋ฉฐ ํ™•์žฅ ๊ฐ€๋Šฅํ•œ ๋ฐ์ดํ„ฐ ๋ชจ๋ธ์„ ์ œ๊ณตํ•˜์—ฌ, ์‹ค์‹œ๊ฐ„ ๋ถ„์„ ์• ํ”Œ๋ฆฌ์ผ€์ด์…˜์„ ์‰ฝ๊ฒŒ ๊ฐœ๋ฐœ

โ€ข ๊ฐ์ข… Source, Sink๋“ฑ ์ œ๊ณต์œผ๋กœ ์‰ฝ๊ฒŒ ํ™•์žฅ ๊ฐ€๋Šฅ

โ€œ๋‹ค์–‘ํ•œ ์›์ฒœ ์‹œ์Šคํ…œ์— ์žˆ๋Š” ๋กœ์šฐ ๋ฐ์ดํ„ฐ๋ฅผ ๋ฐ์ดํ„ฐ ์ €์žฅ ์˜์—ญ์ธ MapR-FS(HDFS)์™€ ์—ฐ๊ณ„ํ•˜๊ธฐ ์œ„ํ•ด ์•„๋ž˜์™€ ๊ฐ™์€ ๊ธฐ์ˆ ์„ ์‚ฌ์šฉํ•ฉ๋‹ˆ๋‹ค. โ€

Page 50: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 50 Converged Data Platform

ํ•˜๋‘ก ์—์ฝ” ์‹œ์Šคํ…œ โ€“ ๋ฐ์ดํ„ฐ ์ฒ˜๋ฆฌ

4. ์˜คํ”ˆ์†Œ์Šค ์—์ฝ”์‹œ์Šคํ…œ

์ปดํฌ๋„ŒํŠธ ์„ค ๋ช…

Apache Drill โ€ข ๋‹ค์–‘ํ•œ ์ •ํ˜• ๋˜๋Š” ๋น„์ •ํ˜• ๋ฐ์ดํ„ฐ(Hbase, Hive, MongoDB, Json., CSV๋“ฑ)์—์„œ ์ฆ‰๊ฐ์ ์ธ ๋ฐ์ดํ„ฐ ํƒ์ƒ‰์„ ์ œ๊ณต

โ€ข ANSI SQL ์‚ฌ์šฉ์œผ๋กœ ์†์‰ฝ๊ฒŒ ์ฟผ๋ฆฌ๋ฅผ ์ž‘์„ฑ

Apache Spark โ€ข ํ•˜๋‘ก ๊ธฐ๋ฐ˜์˜ ๊ณ ๊ธ‰ ์‹ค์‹œ๊ฐ„ ๋ถ„์„ ์—”์ง„

โ€ข ๊ณ ์† ์ฟผ๋ฆฌ ์ˆ˜ํ–‰ ํˆด, ๋จธ์‹  ํ•™์Šต ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ, ๊ทธ๋ž˜ํ”„ ํ”„๋กœ์„ธ์‹ฑ ์—”์ง„, ์ŠคํŠธ๋ฆฌ๋ฐ ๋ถ„์„ ์—”์ง„ ์ œ๊ณต

Apache Hive

โ€ข ์ „๋ฌธ์ ์ธ ๋ถ„์„์ฝ”๋”ฉ ๋Šฅ๋ ฅ์ด ํ•„์š” ์—†์ด ๊ฐ„ํŽธํ•˜๊ฒŒ, ์ ‘๊ทผํ•˜๊ธฐ ์‰ฝ๋„๋ก RDB์˜ SQL๋ฌธ๊ณผ ์œ ์‚ฌํ•œ HQL(Hive Query Language)

์„ ์ œ๊ณต

โ€ข ํ…Œ์ด๋ธ”๊ณผ ํŒŒํ‹ฐ์…˜๊ณผ ๊ด€๋ จ๋œ ๋ฉ”ํƒ€์ •๋ณด๋ฅผ ํ•˜์ด๋ธŒ ๋ฉ”ํƒ€์Šคํ† ์–ด์— ์ €์žฅ

Apache Pig โ€ข ๋Œ€์šฉ๋Ÿ‰์˜ ๋ฐ์ดํ„ฐ ์ง‘ํ•ฉ์„ ๋ถ„์„ํ•˜๊ธฐ ์œ„ํ•œ ํ”Œ๋žซํผ

โ€ข ํŠน์ˆ˜๋ชฉ์ ์— ๋งž๊ฒŒ ์‚ฌ์šฉ์ž ํ•จ์ˆ˜๋ฅผ ๋งŒ๋“ค์–ด ํ™•์žฅ์„ฑ์„ ์ œ๊ณต

โ€œMapR์€ ์ˆ˜์ง‘๋œ ๋ฐ์ดํ„ฐ์— ๋Œ€ํ•œ ์กฐํšŒ ๋ฐ ์ฒ˜๋ฆฌ๋ฅผ ์œ„ํ•ด SQL on Hadoop๊ณผ ๊ฐ™์€ ๋‹ค์–‘ํ•œ ์˜คํ”ˆ ์†Œ์Šค ๊ธฐ์ˆ ๋“ค์„ ์ œ๊ณต ํ•ฉ๋‹ˆ๋‹ค. โ€

Page 51: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 51 Converged Data Platform

ํ•˜๋‘ก ์—์ฝ” ์‹œ์Šคํ…œ โ€“ ๋ฐ์ดํ„ฐ ๋ณด๊ด€ ๋ฐ ์›Œํฌํ”Œ๋กœ์šฐ ์ฒ˜๋ฆฌ

4. ์˜คํ”ˆ์†Œ์Šค ์—์ฝ”์‹œ์Šคํ…œ

์ปดํฌ๋„ŒํŠธ ์„ค ๋ช…

MapR-FS (HDFS)

โ€ข ๋ถ„์‚ฐ ๋ฐ์ดํ„ฐ ์ฒ˜๋ฆฌ๋ฅผ ์œ„ํ•ด์„œ ๋ฐ์ดํ„ฐ๋ฅผ ์ €์žฅํ•˜๊ธฐ ์œ„ํ•œ ํŒŒ์ผ ์‹œ์Šคํ…œ โ€ข ๋น„ NameNode ์•„ํ‚คํ…์ฒ˜ โ€ข ๋ฏธ๋Ÿฌ๋ง์„ ํ†ตํ•ด ๋ณต์ œ๋ณธ์„ ์›๊ฒฉ ์‚ฌ์ดํŠธ์— ์ƒ์„ฑํ•˜์—ฌ ์žฌํ•ด ๋ณต๊ตฌ(DR)๋ฅผ ์ง€์› โ€ข ๋Œ€์šฉ๋Ÿ‰ ๋ฐ์ดํ„ฐ (terabyte ๋˜๋Š” petabyte)๋ฅผ ์ €์žฅํ•˜๋„๋ก ๊ณ ์•ˆ. ์ด๋ฅผ ์œ„ํ•ด์„œ ๋ฐ์ดํ„ฐ๋ฅผ ์—ฌ๋Ÿฌ ๋Œ€์˜ ์ปดํ“จํ„ฐ์— ๋‚˜๋ˆ„์–ด ์ €์žฅ.

์ฆ‰, NFS๋ณด๋‹ค ํ›จ์”ฌ ํฐ ํŒŒ์ผ๋„ ์ง€์› โ€ข ๋ฐ์ดํ„ฐ ์•ก์„ธ์Šค์— ๋Œ€ํ•œ ์„ฑ๋Šฅ๊ฐœ์„ ๋„ ์šฉ์ดํ•˜๋ฉฐ ํ™•์žฅ ์‹œ ํด๋Ÿฌ์Šคํ„ฐ์— ์ปดํ“จํ„ฐ node๋งŒ ์ถ”๊ฐ€ํ•˜๋ฉด ๋จ

MapR-DB (NoSQL) โ€ข ๋กœ๊ทธ ๋ฐ์ดํ„ฐ, ์„ผ์„œ ๋ฐ์ดํ„ฐ, ๋ฉ”ํƒ€๋ฐ์ดํ„ฐ, ํด๋ฆญ ๋™ํ–ฅ, ์‚ฌ์šฉ์ž ํ”„๋กœํŒŒ์ผ, ์„ธ์…˜ ์ƒํƒœ ๋ฐ ๋งํฌ/์˜๋ฏธ/๊ด€๊ณ„ ๋ฐ์ดํ„ฐ๋ฅผ ํฌํ•จํ•œ

๊ด‘๋ฒ”์œ„ํ•œ ์šด์˜ ๋ฐ์ดํ„ฐ ํ˜•์‹์„ ๊ด€๋ฆฌํ•˜๊ธฐ ์œ„ํ•ด ์—ด ๊ธฐ๋ฐ˜(column-oriented) NoSQL ๋ฐ์ดํ„ฐ ๋ชจ๋ธ์„ ์ง€์› โ€ข ๋†’์€ ์ฒ˜๋ฆฌ๋Ÿ‰ ๋ฐ ์ง€์†์ ์ธ ์งง์€ ๋Œ€๊ธฐ์‹œ๊ฐ„์„ ์ œ๊ณต

Apache Oozie

โ€ข ํ•˜๋‘ก ์žก์„ ๊ด€๋ฆฌํ•˜๊ธฐ ์œ„ํ•œ ์›Œํฌํ”Œ๋กœ์šฐ ์Šค์ผ€์ค„๋Ÿฌ ์‹œ์Šคํ…œ โ€ข ์šฐ์ง€ ์›Œํฌ ํ”Œ๋กœ์šฐ ์žก์€ ํ•˜๋‘ก์˜ ๋‹ค์–‘ํ•œ ์žก์„ ์‹คํ–‰ํ•  ์ˆ˜ ์žˆ๋Š” ์•ก์…˜(action)์œผ๋กœ ๊ตฌ์„ฑ๋˜๋ฉฐ, ๋ฐฉํ–ฅ์„ฑ์ด ์žˆ๋Š” ๋น„์ˆœํ™˜ ๊ทธ๋ž˜ํ”„

(DAG: Directed Acyclic Graph)๋ฅผ ๊ตฌ์„ฑ โ€ข ํ•˜๋‘ก ์Šคํƒ์˜ ๋‹ค์–‘ํ•œ ํ•˜๋‘ก ์žก๊ณผ ๋™์ž‘ํ•˜๋„๋ก ํ†ตํ•ฉ๋˜์—ˆ์œผ๋ฉฐ, ์—ฌ๊ธฐ์ด๋Š” ์ž๋ฐ” ๋งต ๋ฆฌ๋“€์Šค, ์ŠคํŠธ๋ฆฌ๋ฐ ๋งต ๋ฆฌ๋“€์Šค, ํ”ผ๊ทธ, ํ•˜์ด

๋ธŒ, ์Šค์ฟฑ, Distcp ๋“ฑ์ด ํฌํ•จ๋˜๋ฉฐ ์ž๋ฐ” ํ”„๋กœ๊ทธ๋žจ์ด๋‚˜ ์‰˜ ์Šคํฌ๋ฆฝํ‹ฐ์™€ ๊ฐ™์€ ํŠน์ • ์‹œ์Šคํ…œ์— ํŠนํ™”๋œ ์žก๋„ ์‹คํ–‰ ๊ฐ€๋Šฅ

โ€œMapR์€ ๋ณด์•ˆ๊ณผ ์„ฑ๋Šฅ์ด ๊ฐ•ํ™”๋œ ๋น„์šฉํšจ์œจ์ ์ธ ์Šค์ผ€์ผ ์•„์›ƒ ๋ฐฉ์‹์˜ ๋ถ„์‚ฐ ์ €์žฅ ์‹œ์Šคํ…œ์ธ MapR-FS, MapR-DB๋ฅผ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค. โ€

Page 52: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 52 Converged Data Platform

ํ•˜๋‘ก ์—์ฝ” ์‹œ์Šคํ…œ โ€“ ์™ธ๋ถ€ ์†”๋ฃจ์…˜ ์—ฐ๊ณ„

4. ์˜คํ”ˆ์†Œ์Šค ์—์ฝ”์‹œ์Šคํ…œ

์ปดํฌ๋„ŒํŠธ ์„ค ๋ช…

HBase-API โ€ข MapR-DB์— ์ €์žฅ๋œ ๊ฒฐ๊ณผ ๋ฐ์ดํ„ฐ๋ฅผ HBase-API๋ฅผ ์ด์šฉํ•˜์—ฌ ๋‹ค์–‘ํ•œ ์™ธ๋ถ€ ๋„๊ตฌ๊ณผ ์—ฐ๊ณ„

Kafka-API โ€ข MapR Streams์˜ ๋ฐ์ดํ„ฐ๋ฅผ Kafka-API๋ฅผ ํ†ตํ•˜์—ฌ ํ™œ์šฉ ๊ฐ€๋Šฅ

Hadoop Connector โ€ข Hadoop Connector๋ฅผ ์ด์šฉํ•˜์—ฌ ๋‹ค์–‘ํ•œ ์™ธ๋ถ€ ๋„๊ตฌ๊ณผ ์—ฐ๊ณ„

Apache Drill ODBC/JDBC Driver

โ€ข Apache Drill์˜ ODBC ๋ฐ JDBC ๋“œ๋ผ์ด๋ฒ„๋ฅผ ์ด์šฉํ•˜์—ฌ ๋‹ค์–‘ํ•œ UI ํˆด๊ณผ ์—ฐ๊ณ„

JSON Interface โ€ข MapR-DB, MapR Streams, MapR FS์˜ ๋ฐ์ดํ„ฐ๋ฅผ JSON ๊ธฐ๋ฐ˜์œผ๋กœ ์ธํ„ฐํŽ˜์ด์Šค

โ€œMapR์€ ๊ฐ€๊ณต๋œ ๋ฐ์ดํ„ฐ๋ฅผ ํ™œ์šฉํ•˜๊ธฐ ์œ„ํ•ด ๋‹ค์–‘ํ•œ ํ”„๋กœํ† ์ฝœ ๋ฐ API ๋ฅผ ์ œ๊ณต ํ•ฉ๋‹ˆ๋‹ค. ๋‹ค์–‘ํ•œ SW๋“ค๊ณผ ์œ ์—ฐํ•œ ์—ฐ๊ฒฐ์„ฑ์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹คโ€

Page 53: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 53 Converged Data Platform

๋จธ์‹ ๋Ÿฌ๋‹ ๋„๊ตฌ ํ™œ์šฉ

4. ์˜คํ”ˆ์†Œ์Šค ์—์ฝ”์‹œ์Šคํ…œ

โ€œMapR ํ”Œ๋žซํผ์€ Spark Mllib, Mahout ๋จธ์‹ ๋Ÿฌ๋‹ ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ๋ฅผ ์ œ๊ณตํ•˜๋ฉฐ ๊ณ ๊ฐ์‚ฌ์˜ ์š”๊ตฌ์— ๋”ฐ๋ผ Tensorflow, Caffe, H2O์™€ ๊ฐ™์€

๋จธ์‹ ๋Ÿฌ๋‹ ํ˜น์€ ๋”ฅ๋Ÿฌ๋‹๊ณผ ๊ฐ™์€ AI ๋ถ„์„ ๋„๊ตฌ๋“ค์„ ํ™œ์šฉํ•จ์— ์žˆ์–ด MapR NFS ๊ธฐ๋Šฅ์„ ํ†ตํ•ด ์†์‰ฝ๊ฒŒ ๋ถ„์„ ๋„๊ตฌ๋“ค๊ณผ ์—ฐ๊ณ„โ€

Spark MLlib Tensorflow caffe

Spark์˜ ๊ธฐ๋ณธ ๋จธ์‹ ๋Ÿฌ๋‹ ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ๋กœ ์ธ๋ฉ”๋ชจ๋ฆฌ ๊ธฐ๋ฐ˜์œผ๋กœ ๋จธ์‹ ๋Ÿฌ๋‹ ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ๋ถ„์‚ฐ์ฒ˜๋ฆฌ

๊ตฌ๊ธ€์—์„œ ๊ฐœ๋ฐœํ•œ ๋”ฅ๋Ÿฌ๋‹, ๋จธ์‹ ๋Ÿฌ๋‹ ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ๋กœ CPU/GPU ๋ชจ๋“œ๋กœ ์ˆ˜ํ–‰

๋ฒ„ํด๋ฆฌ๋Œ€์—์„œ ๊ฐœ๋ฐœํ•œ ๋”ฅ๋Ÿฌ๋‹ ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ๋กœ CPU/GPU ๋ชจ๋“œ๋กœ ์ˆ˜ํ–‰

Classification, Regression, Decision Tree, Recommendation, Clustering, Topic Modeling, Association Rule ๋“ฑ์˜ ์•Œ๊ณ ๋ฆฌ์ฆ˜ ์ œ๊ณต

Classification, Regression, Clustering, Markov, Neural Network ๋“ฑ์˜ ์•Œ๊ณ ๋ฆฌ์ฆ˜ ์ œ๊ณต

Artificial Neural Network, Convolutional Neural Network ๋“ฑ Neural Network ์ค‘์‹ฌ์˜ ๋”ฅ๋Ÿฌ๋‹ ์•Œ๊ณ ๋ฆฌ์ฆ˜ ์ œ๊ณต

MapR File System MapR File System MapR File System

Tensorflow Server MapR Client

Caffe Server MapR Client

YARN

Spark MLlib

NFS NFS

Page 54: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 54 Converged Data Platform

Spark๊ฐ€ ์ง€์›ํ•˜๋Š” ๊ธฐ๊ณ„ํ•™์Šต ์•Œ๊ณ ๋ฆฌ์ฆ˜

4. ์˜คํ”ˆ์†Œ์Šค ์—์ฝ”์‹œ์Šคํ…œ

โ€œApache Spark๋Š” ์ธ๋ฉ”๋ชจ๋ฆฌ ๊ธฐ๋ฐ˜์˜ ๋ถ„์„์„ ์œ„ํ•ด Mllib ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ๋ฅผ ํ†ตํ•ด ๋‹ค์–‘ํ•œ ๊ธฐ๊ณ„ ํ•™์Šต ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ์ œ๊ณตโ€

Basic statistics Classification and regression

Collaborative filtering Clustering Dimensionality reduction

Summary statistics

Correlations

Stratified sampling

Hypothesis testing

Random data

generation

SVMs, logistic

regression, linear

regression

Naive Bayes

Decision trees

Random Forests

Gradient-Boosted

Trees

alternating least

squares (ALS)

k-means

Gaussian mixture

Power iteration

clustering (PIC)

Latent Dirichlet

allocation (LDA)

Streaming k-means

Singular value

decomposition

(SVD)

Principal

component

analysis (PCA)

Page 55: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 55 Converged Data Platform

ETL ๋ฐ Workflow ์ฒ˜๋ฆฌ

4. ์˜คํ”ˆ์†Œ์Šค ์—์ฝ”์‹œ์Šคํ…œ

โ€œ์ˆ˜์ง‘, ์ฒ˜๋ฆฌ, ๋ถ„์„ ๋“ฑ์˜ ์ž‘์—…์„ ์›Œํฌํ”Œ๋กœ์šฐ ๋‹จ์œ„๋กœ ์Šค์ผ€์ค„๋ง ํ•˜๊ธฐ์œ„ํ•ด Apache Oozie๋ฅผ ์ง€์›. Apache Oozie๋ฅผ HUE์™€ ํ•จ๊ป˜ ์‚ฌ์šฉํ•˜๋ฉด

GUI ๊ธฐ๋ฐ˜์˜ ๋“œ๋ž˜๊ทธ&๋“œ๋กญ ๊ธฐ๋ฐ˜์œผ๋กœ ์„ค๊ณ„ ํ•  ์ˆ˜ ์žˆ์œผ๋ฉฐ ์ž‘์—…์˜ ์‹œ์ž‘, ์ค‘์ง€, ์‹คํŒจ ๋“ฑ์˜ ๋ชจ๋‹ˆํ„ฐ๋ง๋„ GUI ํ™˜๊ฒฝ์—์„œ ์†์‰ฝ๊ฒŒ ํ™•์ธโ€

Oozie ์‹œ์ž‘

Hive ์ž‘์—…1

Sqoop ์ž‘์—…

Hive ์ž‘์—…2 Hive ์ž‘์—…3

Spark ์ž‘์—… Spark ์ž‘์—…

Oozie ์ข…๋ฃŒ

< HUE์—์„œ Oozie ์ˆ˜ํ–‰ >

์ˆ˜์ง‘

์ฒ˜๋ฆฌ

๋ถ„์„

์ž‘์—…์˜ Workflow ๋””์ž์ธ Email ์•Œ๋žŒ ์ง€์› ์Šค์ผ€์ค„๋ง ๋ฐฉ๋ฒ• ์„ ํƒ(์‹œ๊ฐ„, ๋ฐ์ดํ„ฐ) ์ž‘์—…์˜ ๋ชจ๋‹ˆํ„ฐ๋ง ์‹คํŒจํ•œ ์ž‘์—…์˜ ์žฌ์‹คํ–‰ ์„ค์ •

< Oozie ์ž‘์—…์˜ ๋…ผ๋ฆฌ ๊ตฌ์กฐ >

Page 56: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 56 Converged Data Platform

๋ถ„์„ ๋„๊ตฌ ํ™œ์šฉ

4. ์˜คํ”ˆ์†Œ์Šค ์—์ฝ”์‹œ์Šคํ…œ

โ€œMapR ํ”Œ๋žซํผ์€ ๋‹ค์–‘ํ•œ ๋ถ„์„๋„๊ตฌ ํ™œ์šฉ์„ ์ง€์›. ๋ถ„์„ ํ™˜๊ฒฝ์˜ ๋ชฉ์ ๊ณผ ํŠน์„ฑ์— ๋”ฐ๋ผ ์ ์ ˆํ•œ ๋ถ„์„ ๋„๊ตฌ๋ฅผ ์„ ์ •ํ•˜์—ฌ ์ ์šฉ. ์ธํ„ฐํ”„๋ฆฌํ„ฐ

์ข…๋ฅ˜๋‚˜ ์‹œ๊ฐํ™” ๋ฐฉ๋ฒ•์— ๋”ฐ๋ผ ๋ถ„์„ ๋„๊ตฌ์˜ ์„ ํƒ์ด ๋‹ฌ๋ผ์งˆ ์ˆ˜ ์žˆ์œผ๋‚˜ ๊ณตํ†ต์ ์œผ๋กœ ๋ชจ๋“  ๋„๊ตฌ๊ฐ€ Spark๋ฅผ ์ ๊ทน์ ์œผ๋กœ ์ง€์›โ€

HUE Zeppelin Jupyter

Hadoop ๊ธฐ๋ฐ˜์˜ ๊ฐ€์žฅ ๋Œ€์ค‘์ ์ธ ๋ถ„์„ ๋„๊ตฌ๋กœ ์ฟผ๋ฆฌ๋„๊ตฌ ๋ฐ ๋…ธํŠธ๋ถ ๋ถ„์„ ์ œ๊ณต

๊ฐ€์žฅ ๋งŽ์€ ๊ธฐ๋ณธ ์ธํ„ฐํ”„๋ฆฌํ„ฐ๋ฅผ ์ œ๊ณตํ•˜๋ฉฐ

๋…ธํŠธ๋ถ ๊ธฐ๋ฐ˜์˜ ๋ถ„์„

Python ๋…ธํŠธ๋ถ ๋ถ„์„์— ํŠนํ™” ๋˜์–ด ์žˆ์œผ๋ฉฐ

๋‹ค์–‘ํ•œ ๋ฌด๋ฃŒ/์œ ๋กœ ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ ์ง€์›

MapR ํŒŒ์ผ์‹œ์Šคํ…œ ๊ด€๋ฆฌ, ํŒŒ์ผ ๊ถŒํ•œ ๊ด€๋ฆฌ, Hive, Impala, Spark(Scala, Python, R)๋“ฑ ๋ถ„์„ ์ง€์›

Hive, Drill, Impala, Spark(Scala, Python, R)

๋“ฑ ๋ถ„์„ ์ง€์›ํ•˜๋ฉฐ ํƒ€ ์ธํ„ฐํ”„๋ฆฌํ„ฐ ์„ค์น˜ ์ง€์›

Python ์ธํ„ฐํ”„๋ฆฌํ„ฐ๋งŒ ๊ธฐ๋ณธ ์ œ๊ณตํ•˜๊ณ  ํƒ€ ์ธํ„ฐ

ํ”„๋ฆฌํ„ฐ๋Š” ๋ณ„๋„ ํ”Œ๋Ÿฌ๊ทธ์ธ์„ ์„ค์น˜ํ•˜์—ฌ ์ง€์›

Page 57: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 57 Converged Data Platform

โ€œ์ด์ƒ์ง•ํ›„ ํฌ์ฐฉ ์‹œ์ ์„ 1์ฐจ ์‹ค์‹œ๊ฐ„, 2์ฐจ ๋น„์‹ค์‹œ๊ฐ„์œผ๋กœ ๊ตฌ๋ถ„ํ•˜์—ฌ 1์ฐจ ๋ถ„์„์—์„œ ๋ฐœ๊ฒฌ๋œ ์ด์ƒ์ง•ํ›„๋ฅผ ์‹ค์‹œ๊ฐ„์œผ๋กœ ๋ฆฌํฌํŠธํ•˜๊ณ  ๋จธ์‹ ๋Ÿฌ๋‹ ๊ธฐ๋ฒ•์ด

์ ์šฉ๋œ 2์ฐจ ์‹ฌ์ธต๋ถ„์„์„ ํ†ตํ•˜์—ฌ ๋ฐœ๊ฒฌ๋œ ์ง•ํ›„๋ฅผ ๋ฆฌํฌํŠธํ•˜์—ฌ ์ „์ฒด ํ˜„์ƒ์„ ์ˆ˜๋ฆฌ์ ์œผ๋กœ ๋ถ„์„ํ•œ ๊ฒฐ๊ณผ๋ฅผ ์ด์ƒ์ง•ํ›„์— ๋Œ€ํ•œ ํ‰๊ฐ€ ๊ธฐ์ค€์œผ๋กœ ํ™œ์šฉ.โ€

๋ฐ์ดํ„ฐ ์†Œ์Šค ์‹ค์‹œ๊ฐ„

์ŠคํŠธ๋ฆผ ๋ถ„์„ ์‹ค์‹œ๊ฐ„

๋Œ€์‹œ๋ณด๋“œ

๋ฐ์ดํ„ฐ ์ €์žฅ ๋ฐ์ดํ„ฐ ์ „์ฒ˜๋ฆฌ

๋จธ์‹ ๋Ÿฌ๋‹ ๋ชจํ˜• ๋Œ€์ž…

๊ณ ์œ„ํ—˜๊ตฐ ๋ฆฌํฌํŠธ

์‹ค์‹œ๊ฐ„ ์ฒ˜๋ฆฌ ๋ฐ ์ €์žฅ ๋ฐ์ดํ„ฐ ์ˆ˜์ง‘ ๋ฐ์ดํ„ฐ ์ „์ฒ˜๋ฆฌ ๋ฐ์ดํ„ฐ ๋ถ„์„ ๊ฒฐ๊ณผ ๋ฆฌํฌํŠธ

๋‹ค์–‘ํ•œ ์†Œ์Šค์˜ ๋ฐ์ดํ„ฐ ์ˆ˜์ง‘

์‹ค์‹œ๊ฐ„ ๋ถ„์„ ๋ฐ์ดํ„ฐ ๋ถ„๋ฅ˜

๋ฐฐ์น˜ ๋ถ„์„ ๋Œ€์ƒ ๋ฐ์ดํ„ฐ ๋ถ„๋ฅ˜

Apache Sqoop, Apache

Flume, MapR-NFS ๊ธฐ์ˆ ํ™œ์šฉ

MapR-Streams ์™€ Apache

Spark ์—ฐ๊ณ„๋ฅผ ํ†ตํ•œ ๋ฃฐ ๊ธฐ๋ฐ˜์˜

์‹ค์‹œ๊ฐ„ ๋ฐ์ดํ„ฐ ์ฒ˜๋ฆฌ ๋ฐ ๋ถ„์„

MapR-DB, MapR-FS์— ์›๋ณธ

๋ฐ ์ฒ˜๋ฆฌ๋œ ๋ฐ์ดํ„ฐ ์ €์žฅ

๋จธ์‹ ๋Ÿฌ๋‹ ๋ถ„์„์„ ์œ„ํ•œ ๋ฐ์ดํ„ฐ

์ •์ œ ๋ฐ ๊ฐ€๊ณต๋“ฑ์˜ ์ „์ฒ˜๋ฆฌ ์ˆ˜ํ–‰

๋ฐ์ดํ„ฐ ์ˆ˜์น˜ํ™”, ์ •ํ˜•ํ™” ์ˆ˜ํ–‰

Apache Hive, Apache Spark

๋“ฑ์˜ ์ฒ˜๋ฆฌ ๊ธฐ์ˆ ์„ ํ™œ์šฉ

๋ชจํ˜• ํ›ˆ๋ จ ๋ฐ ํ‰๊ฐ€๊ฐ€ ์™„๋ฃŒ๋œ

๋ถ„๋ฅ˜๋ชจ๋ธ(Classification)์—

์ƒˆ๋กœ ์œ ์ž…๋œ ๋ฐ์ดํ„ฐ ์ ์šฉ

Apache Spark ๋˜๋Š” Apache

Mahout์˜ ๋จธ์‹ ๋Ÿฌ๋‹ ๊ธฐ์ˆ ํ™œ์šฉ

๋ถ„์„ ์™„๋ฃŒ๋œ ๊ฒฐ๊ณผ ๋ฐ์ดํ„ฐ๋ฅผ

์‹œ๊ฐํ™” ๋ฐ ๋ฆฌํฌํŠธ๋กœ ์ƒ์„ฑ

Apache Drill ๊ธฐ์ˆ ์„ ํ™œ์šฉํ•˜์—ฌ

ANSI SQL ๊ธฐ๋ฐ˜์œผ๋กœ ๊ฒฐ๊ณผ

๋ฐ์ดํ„ฐ ์กฐํšŒ

์ด์ƒ์ง•ํ›„ ์˜ˆ์ธก ์‹œ์Šคํ…œ ์ ์šฉ ์˜ˆ

4. ์˜คํ”ˆ์†Œ์Šค ์—์ฝ”์‹œ์Šคํ…œ

Page 58: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 58 Converged Data Platform

๊ฐœ์ธํ™” ๋งˆ์ผ€ํŒ… ์‹œ์Šคํ…œ ์ ์šฉ ์˜ˆ

4. ์˜คํ”ˆ์†Œ์Šค ์—์ฝ”์‹œ์Šคํ…œ

โ€œ๋‹ค์–‘ํ•œ ์†Œ์Šค๋กœ ๋ถ€ํ„ฐ ๋ฐ์ดํ„ฐ๋ฅผ ์ˆ˜์ง‘ํ•˜์—ฌ ํด๋Ÿฌ์Šคํ„ฐ๋กœ ๊ตฌ์„ฑ๋˜์–ด ์žˆ๋Š” MapR ์ปจ๋ฒ„์ง€๋“œ ๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ์— ๋ฐ์ดํ„ฐ๋ฅผ ์ˆ˜์ง‘, ์ €์žฅ, ์ฒ˜๋ฆฌ ํ›„

๋ถ„์„ ์„œ๋ฒ„๊ฐ€ ์ €์žฅ๋œ ๋ฐ์ดํ„ฐ๋ฅผ ๋ถ„์„. ๋ถ„์„ ์„œ๋ฒ„๋Š” ๋ถ„์„ ๋ชฉ์ ์— ๋”ฐ๋ผ ๊ฐ๊ธฐ ๋‹ค๋ฅธ ์„œ๋ฒ„๋กœ ๊ตฌ์„ฑ๋˜์–ด ์žˆ์œผ๋ฉด ์‹ ๊ธฐ์ˆ ์„ ์œ„ํ•œ ๋ณ„๋„ ์„œ๋ฒ„ ๊ตฌ์„ฑโ€

MapR Converged Data Platform

ํ†ต๊ณ„๋ถ„์„ ์„œ๋ฒ„

ํ…์ŠคํŠธ ๋งˆ์ด๋‹ ์„œ๋ฒ„

์‹œ๊ฐํ™” ์„œ๋ฒ„

์˜คํ”ˆ์†Œ์Šค ์„œ๋ฒ„

๊ธฐ๊ฐ„๊ณ„ RDBMS

STT ์‹œ์Šคํ…œ

SNS API

x86 ๋ฆฌ๋ˆ…์Šค ์„œ๋ฒ„ / 10G ๋„คํŠธ์›Œํฌ

Page 59: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 59 Converged Data Platform

๊ฐœ์ธํ™” ๋งˆ์ผ€ํŒ… ์‹œ์Šคํ…œ ์ ์šฉ ์˜ˆ

4. ์˜คํ”ˆ์†Œ์Šค ์—์ฝ”์‹œ์Šคํ…œ

โ€œ์ •ํ˜• ๋ฐ์ดํ„ฐ์™€ ๋น„์ •ํ˜• ๋ฐ์ดํ„ฐ๋ฅผ ์ˆ˜์ง‘ํ›„ ODS ์˜์—ญ์— ์ €์žฅํ›„ ๋ฐ์ดํ„ฐ ์ฒ˜๋ฆฌ๋ฅผ ํ†ตํ•ด ๋ฐ์ดํ„ฐ ๋งˆํŠธ ๊ตฌ์„ฑ. ๊ตฌ์„ฑ ๋œ ๋งˆํŠธ๋Š” ๊ธฐ์กด ๋ถ„์„๋ฐฉ๋ฒ•์œผ๋กœ

์ •ํ˜• ๋ฐ์ดํ„ฐ๋ฅผ ๋ถ„์„ํ•˜๊ณ  ์ƒˆ๋กœ์šด ๋ถ„์„๋ฐฉ๋ฒ•์œผ๋กœ ๋น„์ •ํ˜• ๋ฐ์ดํ„ฐ๋ฅผ ๋ถ„์„ ํ›„ ๋ชจ๋ธ์„ ๊ฒฐํ•ฉํ•˜์—ฌ ๋ณด๋‹ค ์ •ํ™•ํ•œ ์˜ˆ์ธก ๋ชจ๋ธ์„ ์ƒ์„ฑโ€

์ •ํ˜• ๋ฐ์ดํ„ฐ

์นด๋“œ ์‚ฌ์šฉ ์ด๋ ฅ

์นด๋“œ ์Šน์ธ ์ด๋ ฅ

์ฝœ์„ผํ„ฐ ์ƒ๋‹ด ์ด๋ ฅ โ€ฆ

๋น„์ •ํ˜• ๋ฐ์ดํ„ฐ

์ƒ๋‹ด์‚ฌ ๊ธฐ๋ก

STT ์ถ”์ถœ

์›น/๋ชจ๋ฐ”์ผ/ARS

์™ธ๋ถ€ SNS โ€ฆ

MapR ์ปจ๋ฒ„์ง€๋“œ ๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ

ODS

์™ธ๋ถ€ SNS ํ…์ŠคํŠธ

์›น/๋ชจ๋ฐ”์ผ/ARS

์ƒ๋‹ด ์ด๋ ฅ / STT ํ…์ŠคํŠธ

์ฝœ์„ผํ„ฐ ์ƒ๋‹ด ์ด๋ ฅ

์นด๋“œ ์Šน์ธ ์ด๋ ฅ

์นด๋“œ ์‚ฌ์šฉ ์ด๋ ฅ

๋ฐ์ดํ„ฐ ๋งˆํŠธ

..

์ƒ๊ธ‰๊ธฐ๊ฐ„ ์ ‘์ด‰ ํ˜„ํ™ฉ

๋ฏผ์› ๊ฒฝ์œ  ํ˜„ํ™ฉ

๋ฏผ์› ์œ ํ˜•

๊ณ ๊ฐ๊ฐœ์ธ ํ˜„ํ™ฉ

์นด๋“œ๋ก  ํ˜„ํ™ฉ

๊ฐ€๋งน์  ์นด๋“œ์‚ฌ์šฉ ํ˜„ํ™ฉ

..

๋งˆ์ผ€ํŒ… ๋ถ„์„

ํƒ€๊ฒŸ ๋งˆ์ผ€ํŒ…

์ดํƒˆ/์—ฐ์ฒด ์˜ˆ์ธก

์นด๋“œ๋ก  ์˜ˆ์ธก โ€ฆ

๋ฏผ์› ๋ถ„์„

๋ฏผ์› ๋ถ„๋ฅ˜

์ ‘์ด‰ ์ด๋ ฅ

๋ฏผ์› ์žฌ๊ธฐ ์˜ˆ์ธก โ€ฆ

Page 60: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 60 Converged Data Platform

๊ฐœ์ธํ™” ๋งˆ์ผ€ํŒ… ์‹œ์Šคํ…œ ์ ์šฉ ์˜ˆ

4. ์˜คํ”ˆ์†Œ์Šค ์—์ฝ”์‹œ์Šคํ…œ

โ€œMapR์˜ ๋‹ค์–‘ํ•œ ๋ฐ์ดํ„ฐ ์ฒ˜๋ฆฌ ์—์ฝ”์‹œ์Šคํ…œ๊ณผ SAS์˜ ๋ถ„์„ ์†”๋ฃจ์…˜์„ ์—ฐ๋™ํ•˜์—ฌ ๋ฐ์ดํ„ฐ๋ฅผ ๋ถ„์„. MapR์˜ ๊ณ ์„ฑ๋Šฅ์„ ํ™œ์šฉํ•˜์—ฌ ๊ธฐ์กด ๋ฐ˜๋…„

์ฃผ๊ธฐ์˜ ๋ฐฐ์น˜ ๋ฐ ๋ถ„์„ ์ž‘์—…์„ ์›”์ฃผ๊ธฐ๋กœ, ์›” ์ฃผ๊ธฐ์˜ ๋ถ„์„ ์ž‘์—…์„ ์ผ์ฃผ๊ธฐ๋กœ ๋‹จ์ถ• ํ•˜์˜€๊ณ  ์˜ํ–ฅ ๋ณ€์ˆ˜์˜ ์ˆ˜ ๋„ ํ™•์žฅํ•˜์—ฌ ๋ถ„์„โ€

์ •ํ˜• ๋ฐ์ดํ„ฐ

๋น„์ •ํ˜• ๋ฐ์ดํ„ฐ

๋งˆ์ด๋‹ ์†”๋ฃจ์…˜

SAS CA

SAS EM

์‹œ๊ฐํ™” ์†”๋ฃจ์…˜

Dashboard

Chart/Graph

Oracle DB

ํ…์ŠคํŠธ ํŒŒ์ผ / ๋กœ๊ทธ ํŒŒ์ผ

์˜คํ”ˆ์†Œ์Šค ์†”๋ฃจ์…˜

Tensorflow/Cafe

R / Python

Jupyter/Zeppelin

MapR ์ปจ๋ฒ„์ง€๋“œ ๋ฐ์ดํ„ฐ ํ”Œ๋žซํผ

Sqoop Hive Impala

MapR-NFS Spark

Oozie

HUE / MapR MCS

Zookeeper

๋ฐ์ดํ„ฐ ์ˆ˜์ง‘ ๋ฐ์ดํ„ฐ ์ฒ˜๋ฆฌ ๋ฐ์ดํ„ฐ ๋ถ„์„

Page 61: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 61 Converged Data Platform

๊ฐœ์ธํ™” ๋งˆ์ผ€ํŒ… ์‹œ์Šคํ…œ ์ ์šฉ ์˜ˆ

4. ์˜คํ”ˆ์†Œ์Šค ์—์ฝ”์‹œ์Šคํ…œ

MapR File System

. . .

Spark

SAS Impala Connector

SAS Hive Connector

SAS Data Loader

SAS Enterprise Miner

Hive

Impala

Hiveserver2

Spark

Hive

Impala

Spark

Hive

Impala

MapR File System

. . .

SparkR

Impala JDBC Driver

Hive JDBC Driver

MapR Client

R Server

Hive

Impala

Hiveserver2

SparkR

Hive

Impala

SparkR

Hive

Impala

JDBC

JDBC

JDBC

JDBC

API API

< MapR & SAS ์—ฐ๊ณ„ ์•„ํ‚คํ…์ฒ˜ > < MapR & R ์—ฐ๊ณ„ ์•„ํ‚คํ…์ฒ˜ >

์ „์šฉ ์ปค๋„ฅํ„ฐ๋ฅผ ํ†ตํ•ด Impala ์™€ Hive๋ฅผ ์—ฐ๊ณ„ํ•˜์—ฌ SAS์˜ ๋ฐ์ดํ„ฐ ์†Œ์Šค๋กœ ํ™œ์šฉ

SAS Data Loader๋ฅผ ํ†ตํ•ด Spark์™€ ์—ฐ๊ณ„ํ•˜์—ฌ SAS์˜ ๋ฐ์ดํ„ฐ ์ •์žฌ์ž‘์—…์„ ์œ„์ž„

JDBC ๋“œ๋ผ์ด๋ฒ„๋ฅผ ํ†ตํ•ด Impala ์™€ Hive๋ฅผ ์—ฐ๊ณ„ํ•˜์—ฌ R์˜ ๋ฐ์ดํ„ฐ ์†Œ์Šค๋กœ ํ™œ์šฉํ•˜์—ฌ R

Server์—์„œ ๋ถ„์„์„ ์ˆ˜ํ–‰

MapR Client๋ฅผ ํ†ตํ•ด SparkR์˜ API๋ฅผ ํ˜ธ์ถœํ•˜์—ฌ Spark ํด๋Ÿฌ์Šคํ„ฐ์—์„œ ๋ถ„์‚ฐ์ฒ˜๋ฆฌ

NFS NFS

Page 62: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 62 Converged Data Platform

4. ์˜คํ”ˆ์†Œ์Šค ์—์ฝ”์‹œ์Šคํ…œ

๊ตฌ์„ฑ ์•„ํ‚คํ…์ฒ˜ ์˜ˆ์‹œ

ํ•˜๋‘ก๊ธฐ๋ฐ˜ ํ”Œ๋žซํผ ๊ตฌ์ถ•

Analytics

Streaming Data

Sensing

Log

Unstructured Data

Text

Documents

Legacy System

RDBMS

File System

Visualization

Graph

Chart

BI

Report

Ad-hoc Report

Search

Data Exploration

MapR Converged Data Platform

MapR-DB MapR File

System MapR Streams

Event Streaming

Storm

Spark Streaming

Interactive Query

Drill

Spark SQL

Batch Processing

Hive

Spark

Data Integration

Sqoop

Flume

Workflow/Devel.

Oozie

HUE

Machine Learning

Spark

Mahout ๋กœ๊ทธ ์ŠคํŠธ๋ฆผ

์„ผ์„œ ์ŠคํŠธ๋ฆผ

๋น„์ •ํ˜• ๋ฌธ์„œ ์ ์žฌ

๋ฐ์ดํ„ฐ ์ ์žฌ(ETL)

์ •ํ˜• ๋‹ค์ฐจ์› ๋ถ„์„

์ •ํ˜• ๋ฐ์ดํ„ฐ ์‹œ๊ฐํ™”

๋น„์ •ํ˜• ๋ฐ์ดํ„ฐ ์‹œ๊ฐํ™”

๋ฐ์ดํ„ฐ ๋ถ„์„

Inte

rfa

ce

(N

FS

/ h

ttp

FS

/ A

PI

/ O

DB

C)

Inte

rfa

ce

(N

FS

/ h

ttp

FS

/ A

PI

/ O

DB

C)

Page 63: MapR Converged Data Platform

ยฉ 2017 MapR Technologies 63 Converged Data Platform

4. ์˜คํ”ˆ์†Œ์Šค ์—์ฝ”์‹œ์Šคํ…œ

๊ตฌ์„ฑ ์•„ํ‚คํ…์ฒ˜ ์˜ˆ์‹œ

ํ•˜๋‘ก์„ ์‚ฌ์šฉํ•œ ํ•˜์ด๋ธŒ๋ฆฌ๋“œ DW

Analytics

Streaming Data

Sensing

Log

Unstructured Data

Text

Documents

Legacy System

RDBMS

File System

Visualization

Graph

Chart

BI

Report

Ad-hoc Report

Search

Data Exploration

MapR Converged Data Platform

MapR-DB MapR

File System MapR Streams

Event Streaming

Storm

Spark Streaming

Interactive Query

Drill

Spark SQL

Batch Processing

Hive

Spark

Data Integration

Flume

Sqoop

Workflow/Devel.

Oozie

HUE

Machine Learning

Spark

Mahout

DW Appliance

Query Node

Data Node

Data Node ์ •ํ˜• ๋‹ค์ฐจ์› ๋ถ„์„

์ •ํ˜• ๋ฐ์ดํ„ฐ ์‹œ๊ฐํ™”

๋น„์ •ํ˜• ๋ฐ์ดํ„ฐ ์‹œ๊ฐํ™”

๋ฐ์ดํ„ฐ ๋ถ„์„

์ฟผ๋ฆฌ ํŽ˜๋”๋ ˆ์ด์…˜ ๋ฐ์ดํ„ฐ ์ ํ•˜(Offload)

๋ฐ์ดํ„ฐ ์ ์žฌ(ETL)

๋กœ๊ทธ ์ŠคํŠธ๋ฆผ

์„ผ์„œ ์ŠคํŠธ๋ฆผ

๋น„์ •ํ˜• ๋ฌธ์„œ ์ ์žฌ

Inte

rfa

ce

(N

FS

/ h

ttp

FS

/ A

PI

/ O

DB

C)

Inte

rfa

ce

(N

FS

/ h

ttp

FS

/ A

PI

/ O

DB

C)