Bigdata Hadoop Developer Training in Bangalore : 9916229967 /9740342454
Become an expert in Hadoop by acquiring knowledge on MapReduce, Hadoop architecture, Pig & Hive, Oozier. Also, get familiar with Hbase, Sqoop concepts, while working on industry-based, use-cases and projects.
Prerequisites : -- Basic Unix Commands
- Core Java (OOPS Concepts, Collections , Exceptions ) - For Map-Reduce Programming
- SQL Query knowledge - For Hive Queries
- Any Linux flavor OS (Ex: Ubuntu/Cent OS/Fedora/RedHat Linux) with 4 GB RAM (minimum), 100 GB HDD
- Java 1.6+
- Open-SSH server & client
- MYSQL Database
- Eclipse IDE
- VM Ware (To use Linux OS along with Windows OS)
bigdata hadoop Development Training Objectives :-
All Participant will learn to:
- What Big Data is?
- How Big Data creates several new types of analytical workload
- Big Data technology platforms beyond the data warehouse
- Big Data analytical techniques and front-end tools
- How to analyse un-modelled, multi-structured data using Hadoop, MapReduce & Spark
- How to integrate Big Data with traditional data warehouses and BI systems
- How to clearly understand business use cases for different Big Data technologies
- How to set up and organise Big Data projects including skills
- How to make use of Big Data to deliver business value
Course outline
Module 1: Introduction to bigdata hadoop
- Introduction to big data
- What is BigData
- How much data - Data Scaling
- How it is different
- How data becomes BigData
- BigData Sources
- Characteristics of BigData
- Types of BigData
- Applications of BigData
- Analytics of BigData
- Risks of BigData
- Benefits of BigData
- BigData impact on Market
- Future of BigData
Module 2: Hadoop
- Introduction to Hadoop
- Hadoop History and origin
- What is hadoop used for ?
- Why hadoop
- Scalability
- Vertical Scaling
- Horizontal Scaling
- Hadoop Distributions
- Users of Hadoop
- How hadoop useful for organizations
- Hadoop Ecosystem Overview
- Hadoop Architecture
- Hadoop Daemons
- Master Daemons ( Namenode,JobTracker, Secondary Namenode)
- Slave Daemons ( Data Node,TaskTracker)
- Daemons Architecture
- Roles ‐‐‐ Hadoop Daemons
- Installing Cloudera _ Hadoop
Module 3: HDFS (Hadoop Distributed File System
- Introduction to HDFS
- Traditional File System VS HDFS
- Design Principles
- HDFS Architecture
- Cluster Architecture and Block
- Placement Replication and its Strategy
- HDFS Commands
Module 4: MongoDB
- What is MongoDB?
- Where to Use?
- Configuration On Windows
- Inserting the data into MongoDB?
- Reading the MongoDB data
Module 5: MapReduce
- Introduction to MapReduce
- Stages Of MapReduce
- Design Of MapReduce
- MapReduce Lifecycle
- Mapper
- Reducer
- Developing Variety of MapReduce
- Programs using Eclipse in java
- programming language
- Custom Partitioner
- Combiner
- Custom Record Reader
Module 6: Advanced MapReduce
- More Topics related to MapReduce
- Running MapReduce Programs in OOZIE
- Input Formats
- Counters
Module 7: PIG
- PIG vs. MapReduce
- PIG components
- PIG execution
- PIG Data types
- PIG Architecture
- PIG Latin Relational Operators
- PIG Latin Join and CoGroup
- PIG Latin Group and Union
- Describe, Explain, Illustrate
- PIG Latin: File Loaders
Module 8: UNDERSTANDING APACH Flume
- Flume-How Flume works
- Import/Export Data
- Flume Architecture
Module 9: Cloudera Manager
- Cloudera Manager Overview
- Cloudera Manager Services
Module 10: HIVE
- Intrductin & Installation
- Datatypes in Hive
- Cmmands in Hive
- Internal Tables & External Tables
- Partitions
- Static
- Dynamic
- Bucketing
- UDF's in Hive
- Jins in Hive
- Serde's
- Miscellaneus Commands
- Running Hive in oozie
Module 11: APACHE HBASE & NOSQL Databases
- Introduction to NoSQL
- RDBMS vs NoSQL
- Analytical (OLAP)
- When/Why to use HBase
- HBase Architecture/Storage HBase Features
- HBase Data Model HBase Families
- HBase Master
- HBase vs RDBMS
- Column Families
- Access HBase Data HBase API
- Runtime modes
- Running HBase
Module 12: SQOOP
- Introduction & Installation
- Overview
- Import Data from RDBMS to HDFS
- Parallelism
- Import Data from RDBMS to HIVE
- Real Time Use Cases
- Partitions
- Static
- Dynamic
- Bucketing
Module 13: OOZIE
- Oozie Introduction & Overview
- Running MapReduce with OOZIE
- Running Pig Jobs with OOZIE
- Multiple actions of a workflow
- Scheduling TimeBased Jobs
- Email Notifications when Job Triggers
Module 14: HUE
- Introduction To Hue
- Hue Architecture
- Running Jobs through Hive
Module 15: Data Visualization
- End‐End flow from Extraction to Visualization
- Visualization in Tableau
After Completion of the Course:
- In depth coverage of all the topics
- Unlimited practical facility
- Flexible batch timings
- Study material provided
- Lab Manuals provided
- 24/7 Lab access
- Interview Guidance
- Mock Tests conducted
- Mock Interviews
- Resume Preparation as per the standards
- Guidance for bigdata hadoop Certification Exams
- MNC Reference provided
