HADOOP TECHNOLOGY ppt

Technical Seminar on

HADOOP TECHNOLOGY Under the Guidance of

P.V.R.K.MURTHY, M.Tech

Assistant Professor

What is hadoop Technology??

Why hadoop?

Developers of hadoop Technology

Famous hadoop users

Hadoop Features

Hadoop Architectures

Core-Components of Hadoop

Hadoop High Level Architechture

Hadoop cluster

CONTENTS

What is HDFS

HDFS – Name Node features:

HDFS-name node architecture

HDFS-data node

Hadoop MAPREDUCE

Benefits of Hadoop…

Conclusion

Reference

CONTENTS…

HADOOP TECHNOLOGY

What is Hadoop Technology??

•The most well known technology used for Big Data is

Hadoop.

•It is actually a large scale batch data processing system

Why Hadoop ??

•Distributed cluster system

•Platform for massively scalable applications

•Enables parallel data processing

Developers of Hadoop Technology:

Michael j. cafarellaDoug cutting

Famous Hadoop users

Hadoop Features

•Hadoop provides access to the file systems

• The Hadoop Common package contains the

necessary JAR files and scripts

•The package also provides source code,

documentation and a contribution section that includes

projects from the Hadoop Community.

HADOOP ARCHITECTURE

Core-Components of Hadoop:

Hadoop distributive file system.

Map reduce.

What is HDFS ?

•Distributed file system

•Traditional hierarchical file organization

•Single namespace for the entire cluster

•Write-once-read-many access model

•Aware of the network topology

Hadoop High Level Architechture

Hadoop cluster

•A Small Hadoop Cluster Include a single master &

multiple worker nodesMaster node:Data Node Job Tracker Task Tracker Name Node

Slave node: Data Node Task Tracke

HDFS – Name Node Features

Metadata in main memory:

• List of files

• List of blocks for each file

• List of Data Nodes for each block

• File attributes

• Creation time

• Records every change in the

metadata

HDFS-name node architectureSecondary name

3.Store to HDD

Primary name-node

1. Pull transaction log

4.Push

2. Merge changes

HDFS-Data node

•Block Server Stores data in the local file system

•Periodic validation of checksums

•Periodically sends a report of all existing blocks

to the Name Node

Hadoop MAPREDUCE

Job Tracker:Splitting into map and reduce tasksScheduling tasks on a cluster nodeTask Tracker:Runs Map Reduce tasks periodically

Map reduce implementation:

Benefits of Hadoop…

•Cost Saving and efficient and reliable data processing

• Provides an economically scalable solution

• Storing and processing of large amount of data

•Data grid operating system

• It is deployed on industry standard servers rather than expensive

specialized data storage systems.

• Parallel processing of huge amounts of data across inexpensive,

industry-standard servers.

Why commodity hw ?

because cheaper

designed to tolerate faults

Why HDFS ?

network bandwidth vs seek latency

Why Map reduce programming model?

parallel programming

large data sets

moving computation to data

single compute + data cluster

CONCLUSION

REFERENCES

•Apache Hadoop!

(http://hadoop.apache.org)

•Hadoop on Wikipedia (

http://en.wikipedia.org/wiki/Hadoop)

•Cloudera - Apache Hadoop for the Enterprise (

http://www.cloudera.com

Any Queries

Thank you

HADOOP TECHNOLOGY ppt

Technology

Transcript of HADOOP TECHNOLOGY ppt

Ppt hadoop

Hadoop - Strategy and Technology - SAS can treat Hadoop just as any other data source, pulling data FROM Hadoop, when it is most convenient; SAS can work directly IN Hadoop, leveraging

Accenture Technology Labs Cloud-based Hadoop · PDF fileCloud-based Hadoop Deployments: Benefits and Considerations. ... cloud-based Hadoop deployments ... Accenture Technology Labs’

Femtocells Technology Ppt

Our Hadoop Ppt

Introduction to Hadoop Technology

BigData & Hadoop - Technology Latinoware 2016

Hadoop 101 - Big Data Technology

Haptic Technology ppt

Power Big Data platform Based on Hadoop Technology

SAS® and Hadoop Technology: Overview

Hadoop Basics - Information Technology · Hadoop Basics. A brief history on Hadoop • 2003 - Google launches project Nutch to handle billions of searches and indexing millions of

Educational Technology ppt

SQL on Hadoop Technology, Architecture & Innovationsschd.ws/hosted_files/apachebigdata2016/4f/SQL on Hadoop-Big Data... · SQL on Hadoop Technology, Architecture & Innovations 1.

Hadoop Presentation - PPT

SQL on Hadoop Technology, Architecture & Innovations

Top 16 Hadoop Technology Companies

Assistive technology ppt

Ppt technology

Implementing hadoop software technology