Skip to content

Contact sales

By filling out this form and clicking submit, you acknowledge our privacy policy.

Introduction to Hadoop Administration

Course Summary

The Introduction to Hadoop Administration training course will provide you with a comprehensive understanding of all the steps necessary to operate and maintain a Hadoop cluster, from installation and configuration through load balancing and tuning.

The course begins with an overview of the Big Data landscape, then dives into a system administration working view of running Hadoop. The course concludes with students gaining experience with some of the most common and challenging scenarios Hadoop administrators see in the real world and will become familiar with the most up-to-date details of the platform.

This course requires prior knowledge of basic networking; a working knowledge of Unix environment is helpful.

Purpose
Learn how to administer and maintain Hadoop.
Audience
System administrators, DevOps engineers, and software developers responsible for managing and maintaining Hadoop clusters.
Role
Software Developer - System Administrator
Skill Level
Introduction
Style
Hack-a-thon - Learning Spikes - Workshops
Duration
4 Days
Related Technologies
Hadoop | Java | Apache

 

Productivity Objectives
  • Identify the fundamental concepts of Hadoop.
  • Define and plan a Hadoop cluster.
  • Review HDFS features.
  • Manipulate data into HDFS.
  • Administer MapReduce.
  • Prepare installation and configuration of Hadoop.
  • Demonstrate cluster maintenance.

What You'll Learn:

In the Introduction to Hadoop Administration training course, you'll learn:
  • Hadoop Introduction
    • A Brief History of Hadoop
    • Core Hadoop Components
    • Fundamental Concepts
  • Planning Your Hadoop Cluster
    • General Planning Considerations
    • Choosing Hardware
    • Network Considerations
    • Configuring Nodes
    • Planning for Cluster Management
  • HDFS
    • HDFS Features
    • Writing and Reading Files
    • NameNode Considerations
    • HDFS Security
    • Namenode Web UI
    • Hadoop File Shell
  • Getting Data into HDFS
    • Pulling data from External Sources with Flume
    • Importing Data from Relational Databases with Sqoop
    • REST Interfaces
    • Best Practices
  • MapReduce
    • MapReduce Overview
    • Features of MapReduce
    • Architectural Overview
    • YARN MapReduce Version 2
    • Failure Recovery
    • The JobTracker Web UI
  • Hadoop Installation and Initial
    • Configuration and Deployment Types
    • Installing Hadoop
    • Specifying the Hadoop Configuration
    • Initial HDFS and MapReduce Configuration
    • Log Files
  • Installing/Configuring Hive, Impala, and Pig
    • Hive
    • Impala
    • Pig
  • Hadoop Clients
    • What is a Hadoop Client?
    • Installing and Configuring Hadoop Clients
    • Installing and Configuring Hue
    • Hue Authentication and Configuration
  • Advanced Cluster Configuration
    • Advanced Configuration Parameters
    • Configuring Hadoop Ports
    • Explicitly Including and Excluding Hosts
    • Configuring HDFS for Rack Awareness and HDFS High Availability
  • Hadoop Security
    • Why Hadoop Security is Important
    • Hadoop's Security System Concepts
    • What Kerberos is and How it Works
    • Securing a Hadoop Cluster with Kerberos
  • Managing and Scheduling Jobs
    • Managing Running Jobs
    • Scheduling Hadoop Jobs
    • Configuring the FairScheduler
  • Cluster Maintenance
    • Checking HDFS Status
    • Copying Data Between Clusters
    • Adding/Removing Cluster Nodes
    • Rebalancing the Cluster
    • NameNode Metadata Backup
    • Cluster Upgrades
  • Cluster Monitoring and Troubleshooting
    • General System Monitoring
    • Managing Hadoop's Log Files
    • Monitoring the Clusters
    • Common Troubleshooting Issues
“I appreciated the instructor's technique of writing live code examples rather than using fixed slide decks to present the material.”

VMware

Dive in and learn more

When transforming your workforce, it's important to have expert advice and tailored solutions. We can help. Tell us your unique needs and we'll explore ways to address them.

Let's chat

By filling out this form and clicking submit, you acknowledge our privacy policy.