Need help finding the right learning solutions? Call Us: 720-445-4360
- Onboard For Tech Teams
- Reduce initial time to productivity.
- Increase employee tenure.
- Plug-and-play into HR onboarding and career pathing programs.
- Customize for ad-hoc and cohort-based hiring approaches.
- Upskill For Tech Teams
- Upgrade and round out developer skills.
- Tailor to tech stack and specific project.
- Help teams, business units, centers of excellence and corporate tech universities.
- Reskill For Tech Teams
- Offer bootcamps to give employees a running start.
- Create immersive and cadenced learning journeys with guaranteed results.
- Supplement limited in-house L&D resources with all-inclusive programs to meet specific business goals.
- Design For Tech Teams
- Uplevel your existing tech learning framework.
- Extend HR efforts to provide growth opportunities within the organization.
- Prepare your team for an upcoming tech transformation.
Get your team started on a custom learning journey today!
Our Boulder, CO-based learning experts are ready to help!
Course Summary
The Introduction to Administering Hadoop Clusters training course includes one or more labs to reinforce and extend the topics under discussion, including a review of example configurations and run-time reports. The Hadoop administration course focuses on the key aspects of installing and maintaining a Hadoop cluster in various forms.
The course begins by teaching students how to operate the Hadoop Distributed File System (HDFS) file system and MapReduce I/O framework as complementary technologies. Next the course dives into configuring and monitoring processes to manage storage and job tasks, adding network topology awareness to a cluster and configuring a federated or highly-available storage system. The course concludes with supplementing clusters with enhanced storage features and client tools.
This course can be extended to five days if additional coverage in the following areas as needed:
a) writing MapReduce jobs
b) managing job properties
c) overviews on ecosystem projects such as Hive, Pig, Impala, and HBase
d) lab integration with an existing in-house cluster
Purpose
Learn how to set, configure, and administer Hadoop.
Audience
System adminstrators, developers, and DevOps engineers creating Big Data solutions using Hadoop.
Skill Level
Style
Duration
4 Days
- Productivity Objectives:
- Describe the HDFS file system and MapReduce I/O frameworks.
- Configure and monitor storage management processes and tasks.
- Add network topology awareness to a cluster.
- Configure a highly-available storage system.
- Supplement clusters with enhanced storage features and client tools.
Request Information
Get your team upskilled or reskilled today. Chat with one of our experts to create a custom training proposal. Fully customized at no additional cost.
If you are not completely satisfied with your training class, we'll give you your money back.
about our training
-
Real-World Content
Project-focused demos and labs using your tool stack and environment, not some canned "training room" lab.
-
Expert Practitioners
Industry experts with 15+ years of industry experience that bring their battle scars into the classroom.
-
Experiential Learning
More coding than lecture, coupled with architectural and design discussions.
-
Fully Customized
One-size-fits-all doesn't apply to training teams. That's where we come in!
What You'll Learn
In the Introduction to Administering Hadoop Clusters training course, you'll learn:
- Hadoop Concepts
- Operating on Large Data Sets
- Parallelizing to Improve Performance
- Using Large Block Sizes
- Distributing & Replicating Data
- Assigning Code to Data
- Compensating for Node Failures and Recoveries
- Adding Nodes for Better Performance
- Using Virtualization for Rapid Deployment
- Installing a Hadoop Cluster
- Understanding the NameNode
- Understanding the Secondary NameNode
- Understanding the Data Node
- Understanding the JobTracker
- Understanding the TaskTracker
- Understanding the MapReduce Flow
- Mapping Data
- Shuffling and Sorting
- Reducing Data
- Using the Write Once, Read Many Approach
- Reviewing Job Performance
- Interpreting Console Output
- Navigating the JobTracker UI
- Using TaskTracker Logs
- Configuring Nodes
- Understanding Hadoop Property Management
- Managing Core Properties
- Managing HDFS Properties
- Managing MapReduce Properties
- Managing Worker Properties
- Restricting Job Property Changes
- Supporting Federated & HA File Systems
- Restoring NameNode Services
- Protecting NameNode Metadata
- Using Federated NameNodes
- Understanding the NameNode HA Model
- Configure a Federated HDFS system
- Create defensive copies of NameNode metadata
- Alt: Configure an HA NameNode for manual failover
- Controlling Jobs and Resources
- Scheduling Jobs
- Understanding the FairScheduler
- Orchestrating Workflows
- Importing Legacy and Continuous Data
- Using Sqoop
- Using Flume
- Understanding Hive, Impala, and HBase
- Maintaining HDFS
- Checking block integrity
- Balancing data across nodes
- Using HDFS Safe Mode
- Addressing Other HDFS Systems
- Restricting Node Additions
- Installing Ecosystem Packages
- Installing Pig
- Installing Hive
- Reviewing HBase Requirements
- Improving Hadoop Security
- Reviewing Authentication & Authorization
- Understanding Hadoop’s Authorization Model
- Understanding Kerberos Architecture
- Reviewing Kerberos Implementation Options
Real-world content
Project-focused demos and labs using your tool stack and environment, not some canned "training room" lab.
Expert Practitioners
Industry experts that bring their battle scars into the classroom.
Experiential Learning
More coding than lecture, coupled with architectural and design discussions.
Fully Customized
One-size-fits-all doesn't apply to training teams. That's where we come in!
Elite Instructor Program
We recently launched our internal Elite Instructor Program. The community driven instructor program is designed to support instructors in transforming students’ lives by consistently showing a world-class level of engagement, ability, and teaching prowess. Reach out today to learn more about our instructors.
Customized Technical Learning Solutions to Help Attract and Retain Talented Developers
Talk to one of our Learning Solution Architects today
Let DI help you design solutions to onboard, upskill or reskill your software development organization. Fully customized. 100% guaranteed.
DevelopIntelligence leads technical and software development learning programs for Fortune 500 companies. We provide learning solutions for hundreds of thousands of engineers for over 250 global brands.
“I appreciated the instructor’s technique of writing live code examples rather than using fixed slide decks to present the material.”
VMwareResources
Thank you for everyone who joined us this past year to hear about our proven methods of attracting and retaining tech talent.
- Boulder, Colorado Headquarters: 980 W. Dillon Road, Louisville, CO 80027
- 877-629-5631, 720-445-4360
© 2013 - 2020 DevelopIntelligence LLC - Privacy Policy