1. Big Data Hadoop Beginners 2. Advanced 3. Python ...saec.ac.in/IT-HUB/pdf/Revised_Syllabus.pdfDay...
Transcript of 1. Big Data Hadoop Beginners 2. Advanced 3. Python ...saec.ac.in/IT-HUB/pdf/Revised_Syllabus.pdfDay...
1. Big Data – Hadoop – Beginners
2. Big Data – Hadoop – Advanced
3. Python programming & Data Analytics ( MongoDB, MySQL)
4. Software Testing, Selenium and Java Programming
Course -1
Big Data – Hadoop – Beginners
Technology Stack: RDBMS, Java, Hadoop Ecosystem, Sqoop, Hive, Pig, MapReduce96 Hrs.
Day 1: Why BigData is Hot Technology in the Market?
1.Introduction to Big Data and Hadoop 2.What is Big Data?
3.Types of Data 4.Need for Big Data
5.Characteristics of Big Data
6.Traditional IT Analytics Approach
7.Big Data—Use Cases
Quiz – 1
Day 2: Working with SQL Database
1. Introduction to MySQL Database 2. Data Retrieval using SELECT Statement
3. Data Restriction and Sorting
4. Single Row Functions
5.Conversion Functions
6. Data Aggregation
Lab Exercises on SQL
Quiz – 2
Day 3: Working with SQL Database
1. Table Joins 2. Sub Queries
3. Set Operations 4. Data Manipulation Language
5. Data Control Language
6. Correlated Sub Queries
Lab Exercises on SQL
Quiz – 3
Day 4: Working with Java Programming
1. Java Basics 2. Oops Concepts
3. Data Types, Variables, Methods, Operators
4. Class and Objects
5. Constructor
6. Arrays
Lab Exercises on Java Programs
Quiz – 4
Day 5: Working with Java Programming
1. Polymorphism 2. Abstraction
3. Interfaces 4. Casting
5. String, Stringbuffer and String Builder
6. Collection Frameworks
Lab Exercises on Java Programs
Quiz – 5
Day 6: Working with Java Programming
1. Reading Data from CSV file
Technology Stack: RDBMS, Java, Hadoop Ecosystem, Sqoop, Hive, Pig, MapReduce96 Hrs.
2. Writing Data into CSV file
3. Reading Data from JSON file
4. Writing Data into JSON file
5. Reading and Writing into XML file
6. Reading and Writing into MySQL Database
Lab Exercises on Java Programs
Quiz – 6
Day 7: Hadoop Architecture
1. Understanding HDFS
2. MapReduce Architecture
3. Understanding Single Node Cluster
4. Understanding Multi Node Cluster
5. MapReduce Example using Word Count
Lab Exercises onInstalling Single Node Hadoop Clusters
Lab Exercises on Hadoop Shell Commands
Quiz – 7
Day 8: Pig Programming Language
1. Components of Pig
2. Pig Data Model
3. Pig vs SQL
4. Filtering and Transformation of Data
5. Grouping and Sorting
6. Combining and Splitting
Lab Exercises on Pig Commands
Quiz – 8
Day 9: Sqoop ETL Tool
1. Understanding ETL
2. Sqoop Architecture
3. Moving Data from RDBMS to Hadoop
4. Moving Data from Hadoop to RDBMS
5. Sqoop Import
6. Sqoop Export
Lab Exercises on Sqoop Commands
Quiz – 9
Day 10: Hive
1. What is Hive
2. Characteristics of Hive
3. Hive Query Language
4. Difference between Hive and PIG
5. Hive Data Models
6. Hive Functions
Lab Exercises on Hive Query Language
Quiz – 10
Technology Stack: RDBMS, Java, Hadoop Ecosystem, Sqoop, Hive, Pig, MapReduce96 Hrs.
Day 11: Lab Exercises
1. XML Data processing using Hadoop
2. Processing Logs in Pig
3. Process your tweets in Hive
4. JSON processing using MapReduce
5. Pattern Matching Example using Logs
6. Sqoop Data Transfer
Lab Exercises
Quiz – 11
Day 12: Lab Exercises and Certifications
1. Retail Data Analysis
2. Airlines Data Analysis
3. Use Case Discussion
4.
5. Project Guidance
6. Feedback and Clarifications
Certification
Final Quiz
Technology Stack: RDBMS, Java, Hadoop Ecosystem, Sqoop, Hive, Pig, MapReduce96 Hrs.
Big Data – Hadoop – Advanced- Course 2
Day 1: BigDataHadoop Beginner Course Recap 1.Introduction to Big Data and Hadoop 2. Hadoop
3.Hive 4.PIG
5.SQOOP
6.HDFS
7.MapReduce
Quiz – 1
Day 2: Advanced MapReduce
1. Combiners 2. Hadoop Data Types
3. Input Formats in MapReduce 4. Output Formats in MapReduce
5. Distributed Cache
6. Joins in MapReduce
Lab Exercises on MapReduce
Quiz – 2
Day 3: Advanced Flume
1. Apache Flume 2. Big data ecosystem
3. Changing structure of Data 4. Adding flume agent
5. Data volume per agent
6. Example describing a single node flume deployment
Lab Exercises on Flume
Quiz – 3
Day 4: HBase
1. HBase Introduction 2. Characteristics of HBase
3. HBase Architecture 4. Storage Model of HBase
5. When to Use HBase
6. HBase Families
Lab Exercises on HBase Installation
Quiz – 4
Day 5: HBase
1. HBase Components 2. Row Distribution between Region Servers
3. Data Storage 4. Installing of HBase
Technology Stack: RDBMS, Java, Hadoop Ecosystem, Sqoop, Hive, Pig, MapReduce96 Hrs.
5. Configuration of HBase
6. HBase Shell Commands
Lab Exercises on HBase Shell
Quiz – 5
Day 6: Spark
1. What is Spark? 2. Comparison with Hadoop
3. Components of Spark
4. Spark Persistence
5. Transformation 6. PySpark Examples
Lab Exercises Spark
Quiz – 6
Day 7: Spark SQL and RDD
1. Introduction
2. Spark SQL main capabilities
3. Spark SQL usage diagram
4. Spark SQL Examples
5. Important topics in Spark SQL-Data frames
Lab Exercises onInstalling Spark Clusters
Lab Exercises on Spark SQL
Quiz – 7
Day 8: Kafka
1. Understand Kafka and its component
2. Integrating Kafka with real time Streaming Systems
3. Designing a high throughput messaging system
4. Use Kafka to produce and consume
5. Understanding the insights of Kafka API
6. Kafka Flume Integration
Lab Exercises Kafka Cluster building
Quiz – 8
Day 9: Lab I
1. Flume Data Connection into HBase
2. RDBMS to MapReduce
3. Distributed Indexing MapReduce
4. Pattern Matching
Quiz – 9
Day 10: Lab II
1. Movie Recommendation
2. Twitter API Integration for tweet Analysis
3. Data Exploration Using Spark SQL – Wikipedia Dataset
Quiz – 10
Technology Stack: RDBMS, Java, Hadoop Ecosystem, Sqoop, Hive, Pig, MapReduce96 Hrs.
Course – 3
Python programming & Data Analytics ( MongoDB, MySQL)
Day 11: Lab III
1. Combiner Using MapReduce
2. Multiple Reducer using MapReduce
3. Process your tweets in Hive
4. MapReduce Design Patterns
Quiz – 11
Day 12: Lab IV
1. Sorting of Records using Spark
2. List the items
3. Use Case Discussion
4. Pig Design Patterns
5. Project Guidance
6. Feedback and Clarifications
Certification
Final Quiz
Technology Stack: RDBMS, Java, Hadoop Ecosystem, Sqoop, Hive, Pig, MapReduce96 Hrs.
Day 1: Python an Introduction
1.Why Python? 2. Where is Python Used?
3. How Python Programs are Run? 4. Popular IDE’s available
5. Modules
6. Module Namespaces
7. Standard Library modules
Quiz – 1
Day 2: Data Expressions, Statement
1. Built-in Types 2. Strings, Lists, Dictionaries, Tuples, Files, User-defined Classes
3. Numeric Types 4. Lists and Dictionaries
5. Tuples and Files
6. User-defined classes
Lab Exercises
Quiz – 2
Day 3: Control Flow, Functions
1. Python Statement 2. Assignments, Expressions
3. If tests 4. While and for Loop
5. def blocks, global and local scope
6. Arguments & parameters
Lab Exercises
Quiz – 3
Day 4: Compound Data: Lists, Tuples, Dictionaries
1. List operations, List slices, list methods, list loop, mutability, aliasing 2. Tuple Assignments, Tuple as return value
3. Dictionaries: Operations and methods, advanced list processing 4. iteration and dictionary comprehensions,
5. Range, map and zip iterables
6. Selection Sort, Insertion Sort, MergeSort, QuickSort
Lab Exercises
Quiz – 4
Day 5: Oops and Exception Handling
1. OOP: the big picture
2. Class coding basics
3. Realistic examples 4. Polymorphism, Interface, Operator & Function Overloading
5. Try catch block
6. System and user-defined exceptions
Technology Stack: RDBMS, Java, Hadoop Ecosystem, Sqoop, Hive, Pig, MapReduce96 Hrs.
Lab Exercises
Quiz – 5
Day 6: Introduction to Database
1. Relational Databases 2. Single Table CRUD
3. SQL Queries
4. Installing MySQL Database and Workbench
5. Data Definition Language 6. Data Manipulation Language
Lab Exercises on SQL
Quiz – 6
Day 7: DB Operations Using Python
1. Installing Python MySQL Driver
2. Inserting Relational Data using python
3. Reconstruction of Relational Data
4. Insert Update Delete using Python
5. Reading Data using Python
Lab Exercises onSQL
Lab Exercises on Python
Quiz – 7
Day 8: Introduction to NoSQL and MongoDB
1. Introduction to NoSQL
2. CAP Theorem
3. MongoDB Introduction
4. Understanding JSON
5. PyMongo Driver
6.Accessing MongoDB Import & Export Utilities
Lab Exercises MongoDB Installation
Quiz – 8
Day 9: CRUD Operations in MongoDB
1. Introduction to Mongo Shell
2. Query Operations
3. Data Types
4. Filtering Data
5. Data Projection
6. Schema Design
Lab Exercises on Mongo Shell Commands
Quiz – 9
Day 10: Performance Improvements
1. MongoDB Indexing
2. Monitoring
3. Sharding
Technology Stack: RDBMS, Java, Hadoop Ecosystem, Sqoop, Hive, Pig, MapReduce96 Hrs.
Course - 4
Software Testing, Selenium and Java Programming
4. Query tuning
5. Aggregations
Lab Exercises on Mongo Shell Commands
Quiz – 10
Day 11: File Operations
1. Reading CSV file and Loading into Mongo DB
2. Reading JSON file and Loading into MongoDB
3. Reading MongoDB and writing into RDBMS
4. Reading MongoDB and writing into Files
5. Bulk Operations in MongoDB
Lab Exercises on MongoDB Python Language
Quiz – 11
Day 12: Lab OpenStreetMap Data Analysis
1. Using iterative parsing for large datafiles
2. Open Street Map XML Overview
3. Exercises around OpenStreetMap Data
4. Use Care discussions
5. Project Guidance
6. Feedback and Clarifications
Certification
Final Quiz
Technology Stack: RDBMS, Java, Hadoop Ecosystem, Sqoop, Hive, Pig, MapReduce96 Hrs.
Day 1: Introduction to Java Programing
1. The Java Virtual Machine 2. Variables and Data Types
3. Conditional and looping constructs 4. Arrays
5. Object Oriented Programming
6. Fields and Methods
7. Constructors
Quiz – 1
Day 2: Inheritance and Exception Handling
1. Overriding methods 2. Polymorphism
3. Making methods and Class final 4. Abstract classes and methods
5. Interfaces
6. The Exception Class
Lab Exercises
Quiz – 2
Day 3: Object Class, Wrapper Class & Packages
1. Cloning objects 2. The JDK LinkedList Class
3. Strings and Conversions 4. Enumeration interface
5. Package access
6. Documentation comments
Lab Exercises
Quiz – 3
Day 4: Applets, Basics of AWS and Swings
1. Configuration apples 2. Applet capabilities and restrictions
3. Layout Managers, Event Handling 4. The Action Listener Interface and Panels
5. Classes for various controls, such as label, choice, list Checkbox
6. Dialogs and frames
Lab Exercises
Quiz – 4
Day 5: Threads, I/O Package, Networking
1. Synchronization
2. InputStream and OutputStream Classes
3. Reader and Writer Classes 4. Working with URL’s
5. Sockets
6. Java Security
Technology Stack: RDBMS, Java, Hadoop Ecosystem, Sqoop, Hive, Pig, MapReduce96 Hrs.
Lab Exercises
Quiz – 5
Day 6: Java Database Connectivity
1. JDBC Product 2. Types of Drivers
3. Two-tier Client/Server Model
4. Three-tier Client/Server Model
5. Basic concepts of JDBC 6. Interface
Lab Exercises
Quiz – 6
Day 7: Servlets
1. Servlet Interaction & Advanced Servlets
2. Life cycle of Servlets
3. Java Servlet Development Kit
4. Reading Servlet Parameters
5. Reading Initialization Parameters
6. Handling HTTP
Lab Exercises
Quiz – 7
Day 8: Java Server Pages
1. JSP Technologies
2. Understanding the Client-Server Model
3. Understanding Web Server software
4. Configuration the JSP Server
5. Handling JSP Server
6. JSP Transaction Time Errors
Lab Exercises MongoDB Installation
Quiz – 8
Day 9: RMI & ESB & XML
1. RMI Architecture
2. Types of Enterprise Java Beans
3. Features of Session Bean
4. Implementation of Beans
5. XML
6. XML Syntax Rules
Lab Exercises
Quiz – 9
Day 10: Structs
1. Introduction to the Apache Structs
2. MVC Architecture
3. Struct Architecture
Technology Stack: RDBMS, Java, Hadoop Ecosystem, Sqoop, Hive, Pig, MapReduce96 Hrs.
4. Using Struct HTML Tags
5. Customer Validators Example
6. Developing Application with Struct
Lab Exercises
Quiz – 10
Day 11: Hibernate
1. Introduction to Hibernate 3.0
2. Hibernate Architecture
3. First Hibernate Application
4. Hibernate Query Language
5. Introduction to Spring
6. Hibernate with Spring
Lab Exercises
Quiz - 11
Day 12: Introduction to Selenium
1. Introduction to Automation
2. Difference Components of Selenium
3. Selenium IDE Introduction
4. Web Driver with Project Implementation
5. Project Guidance
6. Automation Framework
Lab Exercises
Quiz – 12