Best Hadoop Bigdata Training institute in Hyderabad,India with real-time Professionals,good working experience,and we provide hadoop certification also
Thursday, April 3, 2014
Monday, March 17, 2014
Psn Trainings Offers Hadoop Online and Classroom Training in Hyderabad,india
What is Hadoop?
Hadoop is the Apache Software Foundation top-level project
that holds the various Hadoop subprojects that graduated from the Apache
Incubator. The Hadoop project provides and sup-ports the envelopment of open
source software that supplies a framework for the development of highly
scalable distributed computing applications. The Hadoop framework handles the
processing details, leaving developers free to focus on application logic.Hadoop Training in india
The Apache Hadoop project develops open-source software for
reliable, scalable, distributed computing, including:
·
Hadoop Core, our flagship sub-project, provides
a distributed file system (HDFS) and support for the Map Reduce distributed
computing metaphor.
·
HBase builds on Hadoop Core to provide a Scalable, distributed database.
·
Pig is a high-level data-flow language and
execution framework for parallel computation. It is built on top of Hadoop
Core.Hadoop online training in hyderabad
·
Zookeeper is a highly available and reliable
coordination system. Distributed applications use Zookeeper to store and
mediate updates for critical shared state.
·
Hive is a data warehouse infrastructure built on
Hadoop Core that provides data sum-marization, adhoc querying and analysis of
data sets.
Course Content
1.INTRODUCTION
What is Hadoop?
History of Hadoop
Building Blocks - Hadoop Eco-System
Who is behind Hadoop?
What Hadoop is good for and what it is not
2.HDFS
Configuring HDFS
Interacting With HDFS
HDFS Permissions and Security
Additional HDFS Tasks
HDFS Overview and Architecture
HDFS Installation
Hadoop File System Shell
File System Java API
3.MAPREDUCE
Map/Reduce Overview and Architecture
Installation
Developing Map/Red Jobs
Input and Output Formats
Job Configuration
Job Submission
Practicing Map Reduce Programs (atleast 10 Map Reduce Algorithms )
4.Getting Started With Eclipse IDE
Configuring Hadoop API on Eclipse IDE
Connecting Eclipse IDE to HDFS
5.Hadoop Streaming
6.AdvancedMapReduce Features
Custom Data Types
Input Formats
Output Formats
Partitioning Data
Reporting Custom Metrics
Distributing Auxiliary Job Data
7.Distributing Debug Scripts
8.Using Yahoo Web Services
9.Pig
Pig Overview
Installation
Pig Latin
Pig with HDFS
10. Hive
Hive Overview
Installation
Hive QL
Hive Unstructured Data Analyzation
Hive Semistructured Data Analyzation
11.HBase
HBase Overview and Architecture
HBase Installation
HBase Shell
CRUD operations
Scanning and Batching
Filters
HBase Key Design
12.ZooKeeper
Zoo Keeper Overview
Installation
Server Mantainace
13.Sqoop
Sqoop Overview
Installation
Imports and Exports
14.CONFIGURATION
Basic Setup
Important Directories
Selecting Machines
Cluster Configurations
Small Clusters: 2-10 Nodes
Medium Clusters: 10-40 Nodes
Large Clusters: Multiple Racks
15.Integrations
16.Putting it all together
Distributed installations
Best Practices
Subscribe to:
Comments (Atom)
