Site Reliability Engineers must have the right tools and strategies to perform in a fast-paced technical environment. Nine competency areas guide the successful practice of IBM Cloud SREs.
● Applying Site Reliability Engineering principles
Site Reliability Engineers must have the right tools and strategies to perform in a fast-paced technical environment. Nine competency areas guide the successful practice of IBM Cloud SREs.
● Applying Site Reliability Engineering principles
● Operations
● Monitoring and incident management
● Security and compliance
● Compute infrastructure
● Networking
● Storage and data management
● Reliability and resiliency
● Deployment automation
In this second course of the three-part Professional Certificate in Site Reliability Engineering (SRE), you will focus on the following five SRE competencies:
● Compute infrastructure
● Networking
● Storage and data management
● Reliability and resiliency
● Deployment automation
NOTE: The remaining four SRE competencies are covered in Course 1: SRE Fundamentals and Security.
This course covers approximately 50% of the required content to help you prepare for the “IBM Certified Professional SRE - Cloud V2” certification exam.
If you are interested in pursuing the “IBM Certified Professional SRE - Cloud V2” certification, to improve your passing success, we recommend that you complete all three offerings of the Professional Certificate in Site Reliability Engineering (SRE) to ensure a successful certification exam experience.
Compute infrastructure
● Troubleshoot VMs, IBM Kubernetes Service (IKS), Red Hat OpenShift and serverless services on IBM Cloud
● Configure for high availability and scalability
● Explain the impact of compute on service performance
Networking
● Troubleshoot external connections to IBM Cloud
● Troubleshoot inter service connectivity on IBM Cloud
● Explain the reliability ramifications of IBM Cloud networking features
● Explain the impact of networking on service performance
Storage and data management
● Manage storage and data attributes
● Manage data replication and retention
● Explain the impact of storage on service performance
● Monitor data security and compliance
● Identify storage data durability and capacity management
Reliability and resiliency
● Design and improve reliability for the system/service
● Design for failure and recovering from failure
Deployment automation
● Design non-disruptive deployment
● Troubleshoot provisioning of IBM Cloud resources
● Implement Infrastructure as Code
● Explain the responsibilities of the SRE to the CI/CD Pipelines
● Troubleshoot CI/CD pipelines
OpenCourser helps millions of learners each year. People visit us to learn workspace skills, ace their exams, and nurture their curiosity.
Our extensive catalog contains over 50,000 courses and twice as many books. Browse by search, by topic, or even by career interests. We'll match you to the right resources quickly.
Find this site helpful? Tell a friend about us.
We're supported by our community of learners. When you purchase or subscribe to courses and programs or purchase books, we may earn a commission from our partners.
Your purchases help us maintain our catalog and keep our servers humming without ads.
Thank you for supporting OpenCourser.