SRE is the hot way to manage apps in production - but are you and your systems ready for it? This course teaches you how to design systems for maximum reliability, find the gaps in your current system design and adopt SRE smoothly and effectively.
SRE is the hot way to manage apps in production - but are you and your systems ready for it? This course teaches you how to design systems for maximum reliability, find the gaps in your current system design and adopt SRE smoothly and effectively.
Before you adopt SRE you need to be sure that your systems are designed to work well with SRE practices. In this course, Incorporating Site Reliability Engineering (SRE) in Your System Design, you’ll learn how to design systems with SRE in mind and assess what's missing in your existing systems. First, you’ll discover how to architect apps for reliability, so temporary problems are automatically managed and bigger issues are quickly alerted. Next, you’ll explore how observability design supports SRE and helps you get your apps back online. Finally, you’ll delve into how to effectively measure and report on service levels. When you’re finished with this course, you’ll have the skills and knowledge of system design needed to bring your own apps into SRE.
OpenCourser helps millions of learners each year. People visit us to learn workspace skills, ace their exams, and nurture their curiosity.
Our extensive catalog contains over 50,000 courses and twice as many books. Browse by search, by topic, or even by career interests. We'll match you to the right resources quickly.
Find this site helpful? Tell a friend about us.
We're supported by our community of learners. When you purchase or subscribe to courses and programs or purchase books, we may earn a commission from our partners.
Your purchases help us maintain our catalog and keep our servers humming without ads.
Thank you for supporting OpenCourser.