We may earn an affiliate commission when you visit our partners.
Course image
Udacity logo

Site Reliability Engineering (SRE) Fluency

Richard Phung, Travis Scotto, and Sonny Sevin

Discover the fundamentals of Site Reliability Engineering, including Zero Trust Security, Service Level Objectives, capacity management, on-call effectiveness, and incident management. Enroll today!

What's inside

Syllabus

This lesson is a review of the core components required to implement a zero trust security system and how policy-based management systems allow us to "Never Trust, Always Verify".
Read more
In this lesson, we will learn about how SREs monitor using SLOs and SLIs. We will create queries in Prometheus and dashboard in Grafana.
System capacity is an essential part of ensuring reliability. This lesson discusses how to balance system capacity with costs to ensure that resources and money are not being wasted.
Having a solid on-call is very important to achieving peak reliability. This lesson discusses how to have balanced on-call shifts with a solid incident management process that your team can follow.

Good to know

Know what's good
, what to watch for
, and possible dealbreakers
Develops skills and knowledge relevant to the industry
Teaches using common monitoring tools and systems
Covers SRE fundamentals including security

Save this course

Save Site Reliability Engineering (SRE) Fluency to your list so you can find it easily later:
Save

Activities

Coming soon We're preparing activities for Site Reliability Engineering (SRE) Fluency. These are activities you can do either before, during, or after a course.

Career center

Learners who complete Site Reliability Engineering (SRE) Fluency will develop knowledge and skills that may be useful to these careers:
SRE
Site Reliability Engineers (SREs) are responsible for ensuring the reliability and performance of an organization's IT systems. Zero Trust Security, monitoring with SLOs and SLIs, capacity management, on-call effectiveness, and incident management are all essential skills for an SRE. This course covers all of these topics, and can help build a strong foundation for a career as an SRE.
Information Security Analyst
Information Security Analysts implement and maintain security measures to protect an organization's information systems. Zero Trust Security is a fundamental principle of information security, and this course can help build a strong foundation in this area. Additionally, the monitoring and incident management covered in this course are essential skills for an Information Security Analyst.
Security Consultant
Security Consultants help organizations improve their security posture. Zero Trust Security is a fundamental principle of information security, and this course can help build a strong foundation in this area. Additionally, the monitoring and incident management covered in this course are essential skills for a Security Consultant.
Cybersecurity Engineer
Cybersecurity Engineers design, implement, and maintain cybersecurity systems to protect an organization's networks and data from unauthorized access. Zero Trust Security is a key principle of cybersecurity, and this course can help build a strong foundation in this area. Additionally, the monitoring and incident management covered in this course are essential skills for a Cybersecurity Engineer.
IT Manager
IT Managers develop and execute IT strategies for an organization. The Zero Trust Security, monitoring with SLOs and SLIs, and capacity management covered in this course are all essential for an IT Manager to execute a successful IT strategy.
Chief Information Officer (CIO)
Chief Information Officers (CIOs) are responsible for managing all aspects of an organization's IT operations. Zero Trust Security, monitoring with SLOs and SLIs, and on-call management are all responsibilities of a CIO. As such, this course may be helpful in building a foundation for a career as a CIO.
Risk Analyst
Risk Analysts identify and assess risks to an organization's operations. Zero Trust Security is an important consideration for Risk Analysts, as it can help to reduce the risk of a data breach. This course can help build a foundation in Zero Trust Security for Risk Analysts.
Network Administrator
Network Administrators ensure that data flows efficiently between devices, applications, and users. Implementing Zero Trust Security, monitoring using SLOs and SLIs, and managing on-call shifts all fall within Network Administrator responsibilities. As this course covers these topics, it can help build a foundation for someone who wants to become a Network Administrator.
DevOps Engineer
DevOps Engineers work to bridge the gap between development and operations teams. Zero Trust Security is becoming increasingly important in DevOps, and this course may be helpful for a DevOps Engineer who wants to learn more about it. Additionally, monitoring with SLOs and SLIs can help DevOps Engineers ensure that their systems are performing as expected.
Software Engineer
Software Engineers design, develop, and maintain software applications. Zero Trust Security is becoming increasingly important in software development, and this course may be helpful for a Software Engineer who wants to learn more about it. Additionally, monitoring with SLOs and SLIs can help Software Engineers ensure that their applications are performing as expected.
Data Analyst
Data Analysts collect and analyze data to help organizations understand their business performance. Zero Trust Security is becoming increasingly important in data analytics, as data breaches can lead to the loss of sensitive information. Additionally, monitoring with SLOs and SLIs can help Data Analysts ensure that their systems are performing as expected.
Cloud Architect
Cloud Architects design and implement cloud computing solutions for organizations. Zero Trust Security is an important consideration for Cloud Architects, and this course can help build a foundation in this area. Additionally, monitoring with SLOs and SLIs can help Cloud Architects ensure that their systems are performing as expected.
Data Scientist
Data Scientists use data to build models that can help organizations make better decisions. Zero Trust Security is becoming increasingly important in data science, as data breaches can lead to the loss of sensitive information. Additionally, monitoring with SLOs and SLIs can help Data Scientists ensure that their models are performing as expected.
Systems Architect
Systems Architects are responsible for designing and maintaining the architecture of an organization's information technology systems. Zero Trust Security is an important consideration for Systems Architects, and this course can help build a foundation in this area. Additionally, monitoring with SLOs and SLIs can help Systems Architects ensure that their systems are performing as expected.
Compliance Manager
Compliance Managers are in charge of creating, testing, and executing policies and procedures to ensure compliance with laws and regulations. While this course does not explicitly teach about laws and regulations, the Zero Trust Security it covers will help Compliance Managers ensure better security policies.

Reading list

We've selected seven books that we think will supplement your learning. Use these to develop background knowledge, enrich your coursework, and gain a deeper understanding of the topics covered in Site Reliability Engineering (SRE) Fluency.
Provides foundational knowledge and practical advice on deploying and managing large-scale, distributed systems. It covers concepts such as SLOs, error budgets, on-call, and incident management. 
Presents a comprehensive overview of best practices for building secure and reliable software systems. It covers topics such as threat modeling, secure coding, and software testing. 
Provides a thorough foundation in reliability engineering principles. It covers topics such as probability theory, failure analysis, and reliability modeling. 
Provides a comprehensive overview of security engineering principles. It covers topics such as threat modeling, secure design, and security testing. 
Provides a deep dive into the challenges and best practices of designing data-intensive applications. It covers topics such as data modeling, scalability, and fault tolerance. 
Provides a practical guide to implementing DevOps principles and practices. It covers topics such as continuous integration, continuous delivery, and cultural change. 
Provides a comprehensive overview of Kubernetes, a popular container orchestration platform. It covers topics such as cluster management, deployment strategies, and troubleshooting. 

Share

Help others find this course page by sharing it with your friends and followers:

Similar courses

Here are nine courses similar to Site Reliability Engineering (SRE) Fluency.
SRE Fundamentals and Security
Most relevant
SRE Infrastructure, Resiliency and Deployment Automation
Most relevant
Implementing Site Reliability Engineering (SRE)...
Most relevant
SRE for Azure Deep Dive
Most relevant
Introduction to Zero Trust
Most relevant
AZ-400: Designing and Implementing Microsoft DevOps...
Most relevant
Security Management and Governance
Most relevant
Establishing a Culture of Reliability
Most relevant
Site Reliability Engineering: Measuring and Managing...
Our mission

OpenCourser helps millions of learners each year. People visit us to learn workspace skills, ace their exams, and nurture their curiosity.

Our extensive catalog contains over 50,000 courses and twice as many books. Browse by search, by topic, or even by career interests. We'll match you to the right resources quickly.

Find this site helpful? Tell a friend about us.

Affiliate disclosure

We're supported by our community of learners. When you purchase or subscribe to courses and programs or purchase books, we may earn a commission from our partners.

Your purchases help us maintain our catalog and keep our servers humming without ads.

Thank you for supporting OpenCourser.

© 2016 - 2024 OpenCourser