The Introduction to ETL Management with Airflow training course is designed to demonstrate the use of Airflow schedule and maintain numerous Extract, Transform and Load (ETL) processes running on a large scale Enterprise Data Warehouse (EDW).
The course begins with an introduction to Airflow, including a brief background and an exploration of the Airflow framework, database and User Interface (UI). Next, the course dives into Airflow development including operators and plugins, Directed Acyclic Graphs (DAGs), and scheduling. The course concludes with a session on deploying with Airflow and complex task dependency management.
Purpose
|
Learn how to use Apache Airflow to manage data warehouses. |
Audience
|
DevOps engineers who want to monitor their enterprise data warehouses. |
Role
| Project Manager - Software Developer - System Administrator - Technical Manager |
Skill Level
| Intermediate |
Style
| Hack-a-thon - Learning Spikes - Workshops |
Duration
| 2 Days |
Related Technologies
| Big Data Training | Apache Airflow | Apache | Server Administration |
Productivity Objectives
- Assess how to organize and arrange scheduling
- Determine how to standardize Extract, Transform and Load (ETL) formats and processes
- Integrate Scheduling code into regular code flows