Big Data Systems Infrastructure · * Implementing Apache Nifi Cluster * Simple Integration Project...

6
Big Data Systems Infrastructure

Transcript of Big Data Systems Infrastructure · * Implementing Apache Nifi Cluster * Simple Integration Project...

Page 1: Big Data Systems Infrastructure · * Implementing Apache Nifi Cluster * Simple Integration Project with Apache Nifi Intro to Python Programming * Ingesting Data into Hadoop * Implementing

Big Data Systems Infrastructure

Page 2: Big Data Systems Infrastructure · * Implementing Apache Nifi Cluster * Simple Integration Project with Apache Nifi Intro to Python Programming * Ingesting Data into Hadoop * Implementing

All Rights reserved @ www.clslearn.com , Contact us : [email protected] , +201000216660 , +201001692348

Page 3: Big Data Systems Infrastructure · * Implementing Apache Nifi Cluster * Simple Integration Project with Apache Nifi Intro to Python Programming * Ingesting Data into Hadoop * Implementing

Course Overview

Big Data Systems Infrastructure Duration:104 Hours

Schedule :Full Day Morning ( 9-5)

Half Day Evening (6-10)

Weekends Full Day (10-4)

Instructor-Led

Hands-On Training

Delivery Options:

In CLS Classroom.

On site Classroom.

Online Live.

Your Training Comes with

a 100% Satisfaction

Guarantee!

* This course is essential to all software engineers, programmers, Data analysts, database administrators and anyone looking to become great at big data.* You will learn how to use the most popular software in the Big Data industry at moment, using batch processing as well as real time processing.* This course will give you enough background to be able to talk about real problems and solutions with experts in the industry.* You will gain an experience with Ingesting data into Hadoop file system, working with data in batch and Stream processing, Deliver visualization and insights of Data and too many technical skills.

* Following Course:Big Data Applied Technologies and Models ( Level 3 )

All Rights reserved @ www.clslearn.com , Contact us : [email protected] , +201000216660 , +201001692348

Page 4: Big Data Systems Infrastructure · * Implementing Apache Nifi Cluster * Simple Integration Project with Apache Nifi Intro to Python Programming * Ingesting Data into Hadoop * Implementing

Course Outline

Beginner Level 3 Days / 24 HoursFundamentals of Big Data

* Intro to Distributed Systems & HDFS

* Exploring Big Data Ecosystem &

Distributions

* Basic Intro to NoSQL

Hadoop Fundamentals

* Intro to Hadoop

* Introduction to Map Reduce

* Intro to YARN

* Basic Hadoop Cluster Implementation

SQL-On-Hadoop

* Intro to SQL-On-Hadoop

* Intro to Hive

* Basic Implementation of Hive & Hive

Server 2

* Ingesting Data into Hadoop

* Intro to Sqoop

* Concept of Stream Processing

*Into to Kafka

Advanced Level: 5 Days / 40 HoursMap Reduce In Depth

* Hadoop Architecture In Depth

* Intro to Apache Zookeeper

* Advanced Cluster Implementation

SQL-On-Hadoop

* Advanced Hive Architecture

* Ingesting Data Into Hive Using Sqoop

* HiveQL

* Intro to Apache HBase

Ingesting Data into Hadoop

* Intro to ETL Concepts

* Intro to Data Flow using Apache Nifi

* Implementing Apache Nifi Cluster

* Simple Integration Project with Apache

Nifi

Intro to Python Programming

* Ingesting Data into Hadoop

* Implementing Kafka Cluster

* Building Simple Kafka Producer &

Consumer Using Python

Intro to Apache Spark

* Spark Implementation

* Simple Data Analysis with Apache Spark

Big Data Workshop: 5 Days / 40 HoursProject 1: Data Ingestion from RDBMS into

HIVE (ORC File)

* Using Sqoop to Connect to

RDBMS(Oracle / SQL Server)

* Creating Hive Table on ORC File

* Hive Table Performance Tuning

Project 2: Data Ingestion Streaming Data

Using Kafka into Hadoop/Hive

* Using Kafka to Connect to RDBMS(Oracle

/ SQL Server)

* Kafka Advanced Cluster Configuration

* Ingesting Data into Hadoop/Hive

Project 3: Data Analysis & Stream

Processing using Spark 1

* Spark Cluster Implementation

* Using PySpark

* Processing Data With Spark

* Stream Processing With Spark

Project 4: Data Analysis & Stream

Processing using Spark & Hive 2

* Integrating Spark with Hive

* Ingesting Data with Spark into Hive

* Using Spark SQL

Project 5: Data Visualization Using Apache

* Implementing Apache Zeppelin

* Integrating Apache Zeppeling With Spark

* Visualize Data With Zeppeling

All Rights reserved @ www.clslearn.com , Contact us : [email protected] , +201000216660 , +201001692348

Page 5: Big Data Systems Infrastructure · * Implementing Apache Nifi Cluster * Simple Integration Project with Apache Nifi Intro to Python Programming * Ingesting Data into Hadoop * Implementing

Course Outcome Audience Profile- The course is aimed at Software Engineers,

Database Administrators, and System Administrators

that want to learn about Big Data. Other IT

professionals can also take this course, but might

have to do some extra research to understand some

of the concepts.

- Who are interested in big data management, data

engineering and data analysis using Big Data

technologies to get hands on training about big

data and NoSQL

Prerequisites

Attending Course :

Big Data Systems ( Level 1 )

- To gain the most from the workshop, the following

is required:

- Knowledge of Programming, Basic of “Java,

Scala, Or Python”

- Knowledge of Relational Database Management

Systems.

- Basic knowledge of operating systems & Network.

All Rights reserved @ www.clslearn.com , Contact us : [email protected] , +201000216660 , +201001692348

* Students will be trained as big data Engineers with real world hands-on experience in Hadoop Administration, ETL (Batch/Stream), Sql-On-Hadoop and many other technologies related to Big Data.

Page 6: Big Data Systems Infrastructure · * Implementing Apache Nifi Cluster * Simple Integration Project with Apache Nifi Intro to Python Programming * Ingesting Data into Hadoop * Implementing

We select the best instructors, who are certified from trustworthy

international vendors. They don’t only provide training program, but they

also share their professional experience with the students, so they can have

hands-on experience on the job market.

CLS facilities are well-equipped with strong hardware and software

technologies that aid both students and trainers lead very effective

smooth training programs.

We provide our clients with the best solutions, Our team of training advisers

answer whatever questions you have.

We have been in the market since 1995, and we kept accumulating

experience in the training business, and providing training for more than

100,000 trainees ever since, in Egypt, and the MENA region.

CLS is an authorized and accredited partner by technology leaders like

Microsoft, EC-Council, Adobe and Autodesk. This means that our

training programs are of the highest quality source materials, the most

up-to-date, and have the highest return on investment ever possible.

We keep tabs on every change in the market and the technology field,

so our training programs will always be updated up to the World-class

latest standards, and adapted to the global shape-shifting job market.

Our clients prefer our training programs not only for the quality

education they get, but also because they are cost effective.

All Rights reserved @ www.clslearn.com , Contact us : [email protected] , +201000216660 , +201001692348