-
Onboard
For Tech Teams
- Reduce initial time to productivity.
- Increase employee tenure.
- Plug-and-play into HR onboarding and career pathing programs.
- Customize for ad-hoc and cohort-based hiring approaches.
-
Upskill
For Tech Teams
- Upgrade and round out developer skills.
- Tailor to tech stack and specific project.
- Help teams, business units, centers of excellence and corporate tech universities.
-
Reskill
For Tech Teams
- Offer bootcamps to give employees a running start.
- Create immersive and cadenced learning journeys with guaranteed results.
- Supplement limited in-house L&D resources with all-inclusive programs to meet specific business goals.
-
Design
For Tech Teams
- Uplevel your existing tech learning framework.
- Extend HR efforts to provide growth opportunities within the organization.
- Prepare your team for an upcoming tech transformation.
Get your team started on a custom learning journey today!
Our Boulder, CO-based learning experts are ready to help!
Course Summary
The Hadoop for Data Analysts training course is designed to demonstrate how to manage, manipulate, and query large complex data in real time, using SL and familiar scripting languages on Hadoop.
The course begins with an introduction to Hadoop basics. Next, it explores how Apache Pig and Apache Hive enable data transformations and analyses via filters, joins, and user-defined functions. The course concludes by examining how to analyze and process data with Pig, and how to optimize Hive.
- Productivity Objectives:
- Understand Hadoop fundamentals
- Know how to use Pig to analyze data
- Understand how to process complex data with Pig
- Troubleshoot Pig
- Know when to use Hive
- Know how to manage data with Hive
- Understand how to optimize Hive
Request Information
Get your team upskilled or reskilled today. Chat with one of our experts to create a custom training proposal. Fully customized at no additional cost.

If you are not completely satisfied with your training class, we'll give you your money back.




about our training
-
Real-World Content
Project-focused demos and labs using your tool stack and environment, not some canned "training room" lab.
-
Expert Practitioners
Industry experts with 15+ years of industry experience that bring their battle scars into the classroom.
-
Experiential Learning
More coding than lecture, coupled with architectural and design discussions.
-
Fully Customized
One-size-fits-all doesn't apply to training teams. That's where we come in!
What You'll Learn
In the Hadoop for Data Analysts training course, you'll learn:
- Understanding Hadoop
- Hadoop Overview
- The Hadoop Ecosystem
- The Hadoop Distributed File System (HDFS)
- Input Data into HDFS
- The MapReduce Framework and YARN
- Overview of Sqoop/Flume
- Overview of Ozzie Workflow Engine
- Introduction to Pig
- Pig’s Features/Use Cases
- Interact with Pig
- Basic Data Analysis with Pig
- Pig Latin
- Load Data
- Field Definitions and Simple Data Types
- Data Output
- View the Schema
- Filter/Sort Data
- Common Functions
- Processing Complex Data with Pig
- Storage Formats
- Complex/Nested data types
- Groups
- Built-in functions for working with complex data
- Iterate grouped data
- MultiData Set Operations with Pig
- Combine Data Sets
- Join Data Sets
- Set Operations
- Split Data Sets
- Extending Pig
- Parameters
- Macros/Imports
- UDFs
- Use Other Languages to Process Data with Pig
- Pig Troubleshooting and Optimization
- Logs
- Hadoop’s Web UI
- Data samples and debugs
- Understand the execution plan
- Improve the performance
- Introduction to Hive
- Hive schema and data storage
- Hive vs. traditional databases
- Hive vs. pig
- When to use Hive
- Relational data analysis with Hive
- Hive databases and tables
- Basic HiveQL syntax
- Data types
- Joining data sets
- Common built-in functions
- Hive Data Management
- Hive data formats
- Create databases and Hivemanaged tables
- Load Data into Hive
- Alter databases and tables
- Self-managed tables
- Simplify queries with views
- Store query results
- Control access to data
- Text Processing with Hive
- Text Processes
- Important string functions
- Use regular expressions in Hive
- Hive Optimization
- Understand query performance
- Control job execution plan
- Partitioning
- Bucketing
- Index Data
- Extending Hive
- Data Transformation with Custom Scripts
- User-defined Functions
- Parameterized Queries
Real-world content
Project-focused demos and labs using your tool stack and environment, not some canned "training room" lab.
Expert Practitioners
Industry experts that bring their battle scars into the classroom.
Experiential Learning
More coding than lecture, coupled with architectural and design discussions.
Fully Customized
One-size-fits-all doesn't apply to training teams. That's where we come in!

Elite Instructor Program
We recently launched our internal Elite Instructor Program. The community driven instructor program is designed to support instructors in transforming students’ lives by consistently showing a world-class level of engagement, ability, and teaching prowess. Reach out today to learn more about our instructors.
Customized Technical Learning Solutions to Help Attract and Retain Talented Developers
Let DI help you design solutions to onboard, upskill or reskill your software development organization. Fully customized. 100% guaranteed.
DevelopIntelligence leads technical and software development learning programs for Fortune 500 companies. We provide learning solutions for hundreds of thousands of engineers for over 250 global brands.



“I appreciated the instructor’s technique of writing live code examples rather than using fixed slide decks to present the material.”
VMwareAbout Us
LET’S DISCUSS
DevelopIntelligence has been in the technical/software development learning and training industry for nearly 20 years. We’ve provided learning solutions to more than 48,000 engineers, across 220 organizations worldwide.
Resources
Thank you for everyone who joined us this past year to hear about our proven methods of attracting and retaining tech talent.

- Boulder, Colorado Headquarters: 980 W. Dillon Road, Louisville, CO 80027
© 2013 - 2022 DevelopIntelligence LLC - Privacy Policy