Monday, March 17, 2014

Psn Trainings Offers Hadoop Online and Classroom Training in Hyderabad,india



What is Hadoop?

Hadoop is the Apache Software Foundation top-level project that holds the various Hadoop subprojects that graduated from the Apache Incubator. The Hadoop project provides and sup-ports the envelopment of open source software that supplies a framework for the development of highly scalable distributed computing applications. The Hadoop framework handles the processing details, leaving developers free to focus on application logic.Hadoop Training in india

The Apache Hadoop project develops open-source software for reliable, scalable, distributed computing, including:

·         Hadoop Core, our flagship sub-project, provides a distributed file system (HDFS) and support for the Map Reduce distributed computing metaphor.

·         HBase builds on Hadoop Core to provide a Scalable, distributed database. 

·         Pig is a high-level data-flow language and execution framework for parallel computation. It is built on top of Hadoop Core.Hadoop online training in hyderabad

·         Zookeeper is a highly available and reliable coordination system. Distributed applications use Zookeeper to store and mediate updates for critical shared state. 

·         Hive is a data warehouse infrastructure built on Hadoop Core that provides data sum-marization, adhoc querying and analysis of data sets.
      
      Course Content

1.INTRODUCTION

What is Hadoop?

History of Hadoop

Building Blocks - Hadoop Eco-System

Who is behind Hadoop?

What Hadoop is good for and what it is not


2.HDFS

Configuring HDFS

Interacting With HDFS

HDFS Permissions and Security

Additional HDFS Tasks

HDFS Overview and Architecture

HDFS Installation

Hadoop File System Shell

File System Java API


3.MAPREDUCE

Map/Reduce Overview and Architecture

Installation

Developing Map/Red Jobs

Input and Output Formats

Job Configuration

Job Submission

Practicing Map Reduce Programs (atleast 10 Map Reduce Algorithms ) 


4.Getting Started With Eclipse IDE

Configuring Hadoop API on Eclipse IDE

Connecting Eclipse IDE to HDFS


5.Hadoop Streaming


6.AdvancedMapReduce Features

Custom Data Types

Input Formats

Output Formats

Partitioning Data

Reporting Custom Metrics

Distributing Auxiliary Job Data


7.Distributing Debug Scripts

8.Using Yahoo Web Services


9.Pig

Pig Overview

Installation

Pig Latin

Pig with HDFS


10. Hive

Hive Overview

Installation

Hive QL

Hive Unstructured Data Analyzation

Hive Semistructured Data Analyzation


11.HBase

HBase Overview and Architecture

HBase Installation

HBase Shell

CRUD operations

Scanning and Batching

Filters

HBase Key Design


12.ZooKeeper

Zoo Keeper Overview

Installation

Server Mantainace


13.Sqoop

Sqoop Overview

Installation

Imports and Exports


14.CONFIGURATION

Basic Setup

Important Directories

Selecting Machines

Cluster Configurations

Small Clusters: 2-10 Nodes

Medium Clusters: 10-40 Nodes

Large Clusters: Multiple Racks


15.Integrations

16.Putting it all together

Distributed installations

Best Practices