We may earn an affiliate commission when you visit our partners.
Course image
Google Cloud Training

Using a combination of presentations, demos, hands-on labs, and real-world case studies, attendees gain experience with full-stack monitoring, real-time log management and analysis, debugging code in production, tracing application performance bottlenecks, and profiling CPU and memory usage.

This class is intended for the following participants:

  • Cloud architects
  • Administrators
  • SysOps personnel
  • Cloud developers
  • DevOps personnel

What's inside

Learning objectives

  • Explain the purpose and capabilities of google cloud observability.
  • Implement monitoring for multiple cloud projects.
  • Create effective monitoring dashboards and alerts.
  • Create alerting policies, uptime checks and alerts.
  • Explain how to collect logs using cloud logging and export for further analysis.

Syllabus

0. Introduction
Welcome to Logging and Monitoring in Google Cloud! We will cover the pre-requisites, audience and the course objectives.
1. Introduction to Google Cloud Observability
Read more
In this module, we will take some time to do a high-level overview of the various products which comprise Google Cloud's logging, monitoring, and observability suite.
2. Monitoring Critical Systems
Monitoring is all about keeping track of exactly what's happening with the resources we've spun up inside of Google's Cloud. In this module, we'll take a look at options and best practices as they relate to monitoring project architectures. We'll differentiate the core IAM roles needed to decide who can do what as it relates to monitoring. Just like architecture, this is another crucial early step. We will examine some of the Google created default dashboards, and see how to use them appropriately. We will create charts and use them to build custom dashboards to show resource consumption and application load. And, finally, we will define uptime checks to track liveliness and latency.
3. Alerting Policies
Alerting gives timely awareness to problems in your cloud applications so you can resolve the problems quickly. In this module, you will learn how to develop alerting strategies, define alerting policies, add notification channels, identify types of alerts and common uses for each, construct and alert on resource groups, and manage alerting policies programmatically.
4. Advanced Logging and Analysis
In this module, we will examine some of Google Cloud's advanced logging and analysis capabilities. Specifically, in this module you will learn to identify and choose among resource tagging approaches, define log sinks, create monitoring metrics based on log entries, link application errors to Logging and other operation tools using Error Reporting, and export logs to BigQuery for long term storage and SQL based analysis.
5. Working with Audit Logs
In this module, we will examine how to use Cloud Audit logs. You will learn how to use Cloud Audit logs to answer the question, “Who, did what, and when?” We will also cover best practices for Audit Logging.
6. Course Summary
We will summarize the topics covered in this couse.
7. Course Resources
Student PDF links to all modules.

Save this course

Save Logging and Monitoring in Google Cloud to your list so you can find it easily later:
Save

Activities

Coming soon We're preparing activities for Logging and Monitoring in Google Cloud. These are activities you can do either before, during, or after a course.

Career center

Learners who complete Logging and Monitoring in Google Cloud will develop knowledge and skills that may be useful to these careers:

Reading list

We haven't picked any books for this reading list yet.
Provides a complete guide to Cloud Logging, including its features, use cases, and how to use it effectively. It good resource for anyone who wants to learn more about Cloud Logging.
Provides a cloud-native focused guide to logging and monitoring in Google Cloud. It covers a variety of topics, including how to use Cloud Logging, Cloud Monitoring, and Stackdriver Trace and Debug to monitor your cloud-native applications.
Provides a comprehensive overview of Site Reliability Engineering (SRE), a discipline focused on improving the reliability, performance, and efficiency of complex distributed systems. It covers topics such as service level objectives (SLOs), error budgets, monitoring and alerting, capacity planning, and incident response.
Provides a comprehensive overview of the art of monitoring. It covers topics such as the different types of monitoring tools, the principles of effective monitoring, and the challenges of monitoring complex systems.
Provides a comprehensive overview of Prometheus, an open-source monitoring system. It covers topics such as installing and configuring Prometheus, creating alerts, and using Prometheus to monitor different types of systems.
Provides a comprehensive overview of Jaeger, an open-source distributed tracing system. It covers topics such as installing and configuring Jaeger, creating traces, and using Jaeger to monitor different types of systems.
Provides a comprehensive overview of performance engineering. It covers topics such as performance metrics, data collection and analysis, and performance modeling.
Provides a practical guide to site reliability engineering (SRE), including topics such as monitoring, alerting, and incident response. It provides exercises and case studies to help readers apply SRE principles and practices to their own organizations.
Covers the principles and practices of observability engineering, which is essential for monitoring and alerting systems. It provides guidance on how to design, implement, and operate observability systems to ensure that they are effective and reliable.
Provides a comprehensive overview of system and network administration, including topics such as monitoring, alerting, and incident response. It is particularly relevant for understanding the practical aspects of implementing and managing alerting policies in real-world environments.
Focuses on incident management for DevOps teams, covering topics such as monitoring, alerting, and incident response. It provides guidance on how to design and implement alerting policies that are effective in identifying and responding to incidents.
Provides a comprehensive overview of alerting best practices and strategies. It covers topics such as designing effective alerts, reducing alert fatigue, and integrating alerting with other monitoring and incident response systems.
Introduces the use of machine learning for observability, including topics such as anomaly detection, predictive analytics, and root cause analysis. It provides guidance on how to design and implement alerting policies that leverage machine learning to improve their effectiveness and accuracy.
Provides a comprehensive overview of DevOps, covering topics such as monitoring, alerting, and incident response. It provides guidance on how to implement DevOps practices in organizations to improve the reliability and availability of software systems.
Provides a comprehensive overview of incident management and response, including topics such as monitoring, alerting, and incident response. It provides guidance on how to design and implement incident management processes and procedures in organizations.
A comprehensive reference guide for Elasticsearch, covering architecture, data ingestion, searching, and log analysis use cases.
Provides a comprehensive guide to auditing cloud security. It covers everything from risk assessment to incident response.
Provides a practical guide to auditing cloud security. It covers everything from planning and scoping an audit to reporting on findings.

Share

Help others find this course page by sharing it with your friends and followers:

Similar courses

Similar courses are unavailable at this time. Please try again later.
Our mission

OpenCourser helps millions of learners each year. People visit us to learn workspace skills, ace their exams, and nurture their curiosity.

Our extensive catalog contains over 50,000 courses and twice as many books. Browse by search, by topic, or even by career interests. We'll match you to the right resources quickly.

Find this site helpful? Tell a friend about us.

Affiliate disclosure

We're supported by our community of learners. When you purchase or subscribe to courses and programs or purchase books, we may earn a commission from our partners.

Your purchases help us maintain our catalog and keep our servers humming without ads.

Thank you for supporting OpenCourser.

© 2016 - 2025 OpenCourser