Skip to content

Contact sales

By filling out this form and clicking submit, you acknowledge our privacy policy.

Site Reliability Engineering

Course Summary

The Site Reliability Engineering training course is designed to demonstrate a discipline where the main goals are to create ultra-scalable and highly-reliable software systems. Site Reliability Engineering (SRE) incorporates aspects of software engineering and applies them to infrastructure and operations problems. SRE was created and implemented at Google in the early 2000s to make their sites run more smoothly, efficiently, and reliably.

This course begins by describing SRE and explains how it incorporates aspects of software engineering and applies them to infrastructure and operations problems. Next, the course covers a high-level overview of the history of SRE, the differences between SRE and DevOps, and roles and responsibilities. From there, students move into budgeting, planning, and monitoring. The course concludes with students working with practical examples and learning best practices.

Purpose
Learn to incorporate the principles of SRE into practice.
Audience
Developers and developer teams looking to incorporate the principles of SRE into practice.
Role
DevOps Engineer - Project Manager - Q/A - Software Developer - System Administrator - Technical Manager - Web Developer
Skill Level
Introduction
Style
Learning Spikes - Workshops
Duration
2 Days
Related Technologies
SRE | Testing

 

Productivity Objectives
  • Compare the differences between SRE and DevOps.
  • Describe the roles and responsibilities of SRE team members.
  • Demonstrate understanding of SRE processes and best practices.

What You'll Learn:

In the Site Reliability Engineering training course, you'll learn:
  • SRE Overview
    • Origins of SRE
    • Differences between SRE and DevOps
    • High-level responsibilities of an SRE
  • SLAs, SLOs, SLIs
  • Risk and Error Budgeting
  • Capacity Planning
  • Monitoring
  • Intelligent Alerting
  • Provisioning
    • Practical Examples
  • Monitoring and Alerting
    • Practical Examples
  • Reacting To/Preventing Problems
    • Practical Examples
  • Error/Risk Budgeting
    • Practical Examples
  • SRE Best Practices
    • Tips and Tricks
“I appreciated the instructor's technique of writing live code examples rather than using fixed slide decks to present the material.”

VMware

Dive in and learn more

When transforming your workforce, it's important to have expert advice and tailored solutions. We can help. Tell us your unique needs and we'll explore ways to address them.

Let's chat

By filling out this form and clicking submit, you acknowledge our privacy policy.