The Advanced Apache Spark training course is designed to deeply explore Apache Spark.
The course begins with a review of core Apache Spark concepts followed by lesson on understanding Spark internals for performance. Next, it discusses the new features of Spark 2 and how to use them. The course concludes with lessons on advanced Spark SQL streaming, high performance Spark applications and best practices.
Purpose
|
Learn how to use Spark internals for working with NoSQL databases as well debugging and troubleshooting. |
Audience
|
Developers who have taken the introduction to Spark or who have equivalent experience. |
Role
| Data Engineer - Data Scientist - Software Developer |
Skill Level
| Intermediate |
Style
| Fast Track - Targeted Topic - Workshops |
Duration
| 4 Days |
Related Technologies
| Apache Spark | NoSQL | Apache |
Productivity Objectives
- Apply the Apache Spark fundamentals to gain a deeper understanding of Spark internals
- Identify the operational tweaks to gain the maximum performance from Spark
- Describe how to use GraphX and MLib for machine learning