Skip to content

Contact sales

By filling out this form and clicking submit, you acknowledge our privacy policy.

Hadoop Security

Course Summary

The Hadoop Security training course focuses on securing Hadoop Clusters in order to organize data and keep it safe and secure. This also ensures that the organization is compliant with various standards like PCI, HIPAA, etc.

This course begins by focusing on authentication, authorization, and encryption aspects of Cloudera Hadoop Security. Students will learn the theoretical and practical aspects of Hadoop Security. Next, through a series of short lectures and hands-on exercises and labs, students will explore authentication approaches, working with Kerberos, and authorization using Apache Sentry. The course concludes by providing a deep-dive into encryption and other security-related topics including HBase ACLs, impersonation, and masking sensitive data.

Purpose
Learn how to implement secure Hadoop clusters using authentication, authorization, and encryption.
Audience
Developers and developer teams needing to learn Hadoop Security.
Role
Data Engineer - Software Developer - System Administrator
Skill Level
Intermediate
Style
Targeted Topic - Workshops
Duration
2 Days
Related Technologies
Hadoop | Cybersecurity

 

Productivity Objectives
  • Describe Cloudera Hadoop Security Fundamentals
  • Identify Kerberos Basics and its purpose
  • Implement Kerberos
  • Integrate Windows AD with Kerberos
  • Implement Sentry
  • Use data in motion encryption and data at rest encryption

What You'll Learn:

In the Hadoop Security training course, you'll learn:
  • Hadoop Security Overview
    • What is Hadoop Security?
    • Why it is Important
    • Security Aspects: Key things to consider
    • Securing the Hadoop ecosystem
    • Spin up Cloudera multi-node cluster
  • Authentication
    • Authentication Approaches, Pros, and Cons
    • Introduction to MIT Kerberos
    • How to work with MIT Kerberos?
    • How to enable Kerberos Authentication in Cloudera Manager (CM)
    • Executing HDFS commands
    • Working with YARN applications
    • Perform Ad-hoc Analysis using Hive
    • Kerberos Integration with Windows Active Directory (AD)
    • Advantages of working with AD
    • Windows Active Directory Server Installation
    • Configuring AD Server
    • Integrating Hue With Active Directory
    • Preparing Cluster With Kerberos Authentication
    • Integrating Kerberos With Active Directory
    • Enabling Single Sign-On
  • Authorization
    • Key Authorization Frameworks
    • What is Apache Sentry?
    • Working with Sentry Authorization
    • Integrating Sentry with HUE
    • Querying Hive
    • HDFS Extended ACLs
    • Limitations of Sentry
    • What is Cloudera Record Service?
    • Why it is required
    • Implementing Record Service
  • Encryption
    • Types of Encryptions
    • OS level encryption
    • HDFS encryption
    • Setting up HDFS Encryption Zone
    • Working with HDFS Encryption Zones
    • Data in motion Encryption
    • Introduction to SSL Tools
    • Using a Self-Signed Root CA
    • Enabling and Validating SSL For Hadoop Core
    • SASL Hive And HiveServer2
    • SSL With Hue
  • Other Security-Related Topics
    • Auditing using Cloudera Navigator
    • HBase ACLs
    • Impersonation
    • Masking Sensitive Data
    • Login using KeyTab file
    • UserGroupInformation Basics
“I appreciated the instructor's technique of writing live code examples rather than using fixed slide decks to present the material.”

VMware

Dive in and learn more

When transforming your workforce, it's important to have expert advice and tailored solutions. We can help. Tell us your unique needs and we'll explore ways to address them.

Let's chat

By filling out this form and clicking submit, you acknowledge our privacy policy.