Observability with Grafana, Prometheus,Loki, Alloy and Tempo from Udemy

What's inside

Learning objectives

Fundamentals of observability (types of telemetry data, metric collection methods etc.)
Prometheus (installation, configuration and usage) comprising 21 lectures.
Installation of grafana on windows, mac, linux (multiple flavours) and with docker.
Architecture of highly available and highly scalable grafana for produciton use.
Dashboard design best practices (browser apps, backend apps and infrastructure)
Building dashboards and graphs in grafana
Creating and managing alerts and notifications in grafana
Integration with mysql, sql server, aws cloudwatch, gcp etc.

Grafana loki: retrieval and visualisation of logs
Administration of grafana (users, teams, oauth integraiton, ldap integration etc.)
Opentelemetry
Grafana alloy
Grafana tempo
Show more
Show less

Fundamentals of observability (types of telemetry data, metric collection methods etc.)
Prometheus (installation, configuration and usage) comprising 21 lectures.
Installation of grafana on windows, mac, linux (multiple flavours) and with docker.
Architecture of highly available and highly scalable grafana for produciton use.
Dashboard design best practices (browser apps, backend apps and infrastructure)
Building dashboards and graphs in grafana
Creating and managing alerts and notifications in grafana
Integration with mysql, sql server, aws cloudwatch, gcp etc.
Grafana loki: retrieval and visualisation of logs
Administration of grafana (users, teams, oauth integraiton, ldap integration etc.)
Opentelemetry
Grafana alloy
Grafana tempo
Show more
Show less

Syllabus

Introduction

Foundations of Observability

Evolution of Software Architecture and Observability

What is Monitoring

Methods of Monitoring

What is Observability

Types of Telemetry Data

Methods of Metric Collection

Methods of Collecting Metrics. Push vs. Scrape

Learn installation, configuration and use of Prometheus.

Installing Prometheus on Windows

Installing Prometheus on Mac OS

Installing Prometheus on Linux (Ubuntu)

Collecting Metrics (Unix , Linux and Mac)

Node Exporter - Part 1 (Linux, Mac)

Node Exporter - Part 2 (Linux, Mac)

Node Exporter - Part 3 (Linux, Mac)

Running Node Exporter as a Service on Ubuntu

Installing Node Exporter on Mac, with Homebrew

Collecting Metrics in Windows using MMI Exporter

Data Model of Prometheus

Data Types in Prometheus

Binary Arithmatic Operators in Prometheus

Binary Comparison Operators in Prometheus

Set Binary Operators in Prometheus

Matchers and Selectors in Prometheus

Aggregation Operators

Time Offsets

Clamping and Checking Functions

Delta and iDelta

Sorting and TimeStamp

Aggregations Over Time

Installing and Configuring Grafana

Let's compare the pros and cons of installing Grafana locally versus using the cloud-based Grafana.

You will learn how to install and configure Grafana on Ubuntu LTS 18.04 ( and above ). The step by step instructions of setting up Grafana is attached to this lecture as well.

Installing Grafana on Amazon Linux, Red Hat, CentOS, RHEL, and Fedora

Windows is the most popular operating system for servers and personal computers. Therefore it is essential to know that how Grafana can be installed and configured on a Windows instance.

If you are a proud Mac user, you can install Grafana directly on your Mac computer and use it to learn more about it. In this lecture you will learn that how you can install and configure Grafana using Homebrew.

Configuring Grafana

A quick and easy way of installing Grafana is using its Docker image. In this lecture you will see that how you can use Grafana's docker image to quickly setup your observability stack.

Learn about powerful features of Grafana and make modern dashboards

Dashboards in Grafana are designed for different purposes, such as monitoring browser applications or infrastructure. Each dashboard type is used by a different role or team in the organisation, who may have different KPIs to watch.

In this lecture I will explain the most common dashboard layouts and structure for each dashboard type.

The Shoe Hub is an imaginary company we will use throughout the course to explain how you can visualise business and technical metrics.

Connecting Grafana to Prometheus

Creating and Managing Dashboards in Grafana

Graph panel is suitable for creating charts and histograms. In this lecture you will learn how to use Graph panel and display the metrics from Graphite on it.

Multiple and Accumulative Queries

In this lecture you will visualise the data of different payment methods in the US so that we can have a good understanding how the customers prefer to pay.

Using the Data Transformations feature of Grafana, you can mix and match existing panel rows to create new rows, look up data or convert data types.

The Time Series panel is suitable for showing the data trend over time. However, ,we can compare different related values in percentage form using Pie Charts. For example, we can show the percentage of infrastructure failures are related to disk, what percentage is related to network and what percentage is related to power outage.

Sometimes, we want to compare a metric's current value(s) to the values(s) of the same metric but in the past. For example, you could display the current Shoe sales compared to last month's sales or make a week-on-week revenue comparison. Such graphs can be used to understand of the state of a metric easilywhether a metric's state is increasing or decreasing. For example we can see if the network errors have gone down since last week, or if our marketing efforts have paid off and our sales has gone up since last month. In this lecture we will learn that how we can do this using Grafana and Prometheus.

Practice : Working with Charts and Thresholds

Sometimes it is essential for us to know if the values of data points are above or below a given threshold. For example, if the network errors go above a certain number, or if the orders received per hour are unusually low. We achieve this by Thresholds in Grafana.

Variables, a key feature of Grafana, allow us to create dynamic dashboards and panels with less work and effort than when we hard-code everything.

Practie Creating Dynamid Dashboards

Solved: Creating Dynamic Dashboards

In Grafana, if we show two or more lines on a Graph panel, and the values of these lines are vastly different, then one or some of those lines may become so compressed that we may see their data points as zeros. For example of we show the response time of an IoT device that responds slowly, and the response time of an API that responds very quickly, on the same Graph panel, the response time of the API may be seem as a straight line with value of zero.

In this lecture we will learn that how we can overcome this problem.

Working with the Gauge and Bar Gauge Panels

Working with Alerts, Notifications and Annotations in Grafana

Alerts are defined based on thresholds or mathematical formulations in Grafana. Over time, the alerting system in Grafana has evolved, improved and become somewhat complex. In this lecture you will learn about the concepts and terminalogies of the Grafana Alerting System, and you will learn how this ecosystem works.

Alerts in Grafana are based on queries written in a data source-specific language, such as PromQL for Prometheus. The results of these queries are checked periodically, and if they violate a rule, such as a threshold, that we define, alerts are raised.

In this lecture you will learnt that how you define an alerting rule.

It is not practical to constantly watch the dashboards to see if alerts are raised. Instead, we deliver notifications in various formats, such as emails or Slack messages, to inform relevant people of the alert.

In this lecture you will learn that how you create contact points as well as notification policies to filter the notifications and direct them to the right people.

Slack is a popular collaboration tool that many teams use to chat, exchange team data and receive notifications. Grafana can send alert messages to Slack, too. In this lecture, you will learn how to send Slack notifications for a firing alert.

Sometimes, we do not want to send out notifications temporarily. For example, you may not want to send out notifications at midnight. In such cases we can use Grafana's ability to silence the alerts based on a define time period.

Annotations are a way to describe the rich events. In this lecture you will see that how you can use annotations to describe and understand your Grafana panels better.

Integration of Grafana with Cloudwatch, MySql and Elasticsearch

MySQL Is a very common database and it makes sense to use MySql when your data and metrics already reside in your MySQL database. This lecture will show you how to use MySQL as your data source.

Integration of Grafana with SQL Server

If you have deployed your systems to Amazon Web Services (AWS) you can connect Grafana to AWS's metric service called Amazon CloudWatch, and visualise the metrics of your AWS resources in Grafana, without having to move those metrics to a time-series database such as Prometheus.

With Grafana and GCP's monitoring API enabled, you can monitor your Google Cloud resources efficiently, without moving their metrics to a time series database such as Prometheus. In this lecture you will learn that how you can leverage the out-of-the-box dashboard of Grafana to setup your observability system in a few minutes.

Students will learn how to install Loki, send the logs to it, and analyse them in Grafana

About Grafana Loki

Options of Using Grafana Loki (Cloud vs. On-Prem)

Instaalling Grafana Loki with Docker

Installing Loki and Promtail on Linux (Ubuntu)

Ingesting Log Entries into Loki using Promtail

Creating and Attaching Static Labels

Dynamic Labels: Extracting Labels from Unstructured Logs

Visualising Loki Queries on Dashboards

Learn how to create users, assign roles and install plugins

Overview of Administration in Grafana

Organisations are great for giving a good shape to your observability platform so that it stays organised and well managed as it grows as it grows as it grows. In this lecture you will learn how you can work with the Organisations feature of grafana and administer teams and users.

One way of authenticating external users is OAuth. Google is a major identity provider and reliable, too. Many companies use Google Suite to manage their users and identities. These companies would like to authenticate their Grafana users against Google. In this lecture, we learn how external users can be authenticated using an OAuth provider such as Google.

Many companies use Active Directory or other LDAP-compatible directory services to manage their users, so they would prefer to authenticate their Grafana users with the existing directory users, too.

This video will teach us how to configure Grafana to authenticate users against a given Directory Service, such as Microsoft Active Directory.

You can extend the capability of your dashboards by using Plugins. This lecture will show you how you can setup plugins and use them.

Deployment of Grafana in a Production Environment.

When you deploy Grafana in a Production capacity, you must ensure that Grafana will be highly available and that a failure in part of your deployment will not take down Grafana or make it unavailable.
In this lecture, you will learn about the architecture of a highly available graffiti.

When Grafana is deployed in a heavily used Production environment, you must take measures to ensure that your deployment is scalable and can cope with increased load.
In this lecture we will upgrade our HA deployment of Grafana to a HA & Scalable deployment.

Open Telemetry and Grafana with Grafana Alloy

With the advent of cloud-native applications and the microservices architecture, commercial observability platforms gained attention and became famous. However, they can be costly, and once integrated with them, it may be pretty challenging to break away from them and adopt a different vendor's observability platform.
OpenTelemetry, or OTel, is an open-source initiative incubated by the Cloud Native Computing Foundation (CNCF) that aims to enable developers and DevOps engineers to generate, export, and collect telemetry data without being locked into a specific vendor.

Learn about the architecture of a scalable observability system based on Opentelemetry.

Learn about configuring Prometheus to receive Opentelemetry metrics.

Grafana Alloy is Grafana's Opentelemetry Collector. It can receive OTel metrics from various sources and deliver them to a variety of backend databases after processing them.

In this lecture, we will install Grafana Alloy locally on a Mac computer. Installation instructions for installing Grafana Alloy on Windows and Linux are provided at the end of this section.

Grafana Alloy plays a pivotal role in receiving, processing, and forwarding Opentelemetry signals to downstream systems, such as Prometheus. In this lecture, you will learn how to create receivers, processors, and exporters to achieve this goal.

In this lecture, we will analyse a microservice that produces a counter and exports it to Grafana Alloy via OTLP.

Installing Grafana Alloy on Ubuntu

Grafana Tempo: Tracing in Distributed Systems

Tracing in distributed systems, particularly within a microservices architecture, is crucial for understanding and optimizing system performance and reliability. Tracing becomes complex yet essential as modern applications are built using microservices, where various components communicate over networks.

Tracing involves tracking a request's journey across multiple microservices, providing insights into each service's performance, dependencies, and bottlenecks. Distributed tracing tools like Jaeger and Zipkin enable developers to visualize this journey, often represented as a trace or a series of interconnected spans.

In microservices, where each service is responsible for a specific function, tracing helps identify latency issues, failures, and inefficiencies that may occur at any point in the system. By correlating traces across services, developers can pinpoint the root cause of problems and optimize performance.

Moreover, tracing facilitates debugging and monitoring in production environments, aiding in troubleshooting and ensuring system reliability. It also supports distributed system testing, allowing developers to simulate various scenarios and analyze system behaviour under different conditions.

In this lecture you will learn all aspects of Telemetry and its relevance to Grafana.

Good to know

Know what's good

, what to watch for

, and possible dealbreakers

Develops knowledge and skills in Grafana and related tools, which are core skills for DevOps engineers, site reliability engineers, and data analysts

Taught by Aref Karimi, who has extensive experience in observability and software development

Explores observability and telemetry, which are standard in the IT industry

Covers a range of topics, from Prometheus to Opentelemetry, providing a comprehensive understanding of observability

Provides hands-on labs and interactive materials, fostering practical skills development

Offers integration with cloud services like AWS and GCP, making it relevant to modern cloud-based systems

Activities

Be better prepared before your course. Deepen your understanding during and after it. Supplement your coursework and achieve mastery of the topics covered in Observability with Grafana, Prometheus,Loki, Alloy and Tempo with these activities:

Create a resource list for Grafana monitoring

Show steps

Compile a list of useful resources to support your Grafana monitoring efforts.

Browse courses on Grafana

Show steps

Search for Grafana monitoring resources online.
Evaluate the relevance and quality of the resources.
Organize the resources into a list or document.

Join a Grafana study group

Show steps

Collaborate with other students to enhance your understanding of Grafana.

Browse courses on Grafana

Show steps

Find a Grafana study group online or in your local community.
Attend study group meetings regularly.
Participate in discussions and ask questions.

Attend a Grafana workshop

Show steps

Learn from experts and network with other Grafana users and developers.

Browse courses on Grafana

Show steps

Find a Grafana workshop that aligns with your interests.
Register for the workshop.
Attend the workshop and participate in the activities.

Five other activities

Expand to see all activities and additional details

Show all eight activities

Follow the Grafana documentation tutorial

Show steps

Gain hands-on experience setting up and configuring Grafana.

Browse courses on Grafana

Show steps

Follow the 'Getting Started' tutorial.
Follow the 'Creating your first dashboard' tutorial.
Follow the 'Alerting' tutorial.

Create Grafana dashboards and alerts

Show steps

Build practical skills in creating and managing Grafana dashboards and alerts.

Browse courses on Grafana

Show steps

Create a dashboard to visualize metrics from a sample application.
Create alerts to notify you of critical events.
Troubleshoot any issues you encounter.

Read Observability Engineering

Show steps

Review the techniques and practices for designing and building effective observability systems.

View Observability Engineering: Achieving Production... on Amazon

Show steps

Read the chapters on Monitoring, Metrics, and Dashboards.
Read the chapter on Alerting and Monitoring.
Read the chapter on Grafana.
Read the chapter on Loki.
Read the chapter on Tempo.

Develop a monitoring and alerting strategy for a web application

Show steps

Apply the concepts learned in the course to design and implement a comprehensive monitoring and alerting solution.

Browse courses on Observability

Show steps

Identify the key metrics to monitor.
Determine the thresholds for alerts.
Create Grafana dashboards to visualize the metrics.
Configure alerts to notify the appropriate stakeholders.
Test the monitoring and alerting system.

Build a custom Grafana plugin

Show steps

Extend the functionality of Grafana by creating a custom plugin.

Browse courses on Grafana

Show steps

Identify a need for a custom plugin.
Design the plugin's functionality.
Develop the plugin's code.
Test the plugin.
Release the plugin to the Grafana community.

Career center

Learners who complete Observability with Grafana, Prometheus,Loki, Alloy and Tempo will develop knowledge and skills that may be useful to these careers:

Observability Engineer

An Observability Engineer is responsible for designing and implementing observability systems. They work to ensure that systems are monitored and alerted on, and that data is available for analysis. This course can be useful for aspiring Observability Engineers as it provides a foundation in all aspects of observability. By understanding how to collect, store, and analyze data, Observability Engineers can build systems that provide valuable insights into system performance and behavior.

See salaries and explore the career path for Observability Engineer

DevOps Engineer

A DevOps Engineer is responsible for bridging the gap between development and operations teams. They work to ensure that software is delivered quickly and reliably. This course can be useful for aspiring DevOps Engineers as it provides a foundation in monitoring and alerting, which are key aspects of DevOps. By understanding how to monitor system performance and create alerts, DevOps Engineers can ensure that issues are identified and resolved quickly, minimizing downtime.

See salaries and explore the career path for DevOps Engineer

Performance Engineer

A Performance Engineer is responsible for optimizing the performance of software systems. They work to identify and resolve bottlenecks, and to ensure that systems meet performance requirements. This course can be useful for aspiring Performance Engineers as it provides a foundation in profiling and tracing. By understanding how to collect and analyze data about system performance, Performance Engineers can identify and resolve issues that impact performance.

See salaries and explore the career path for Performance Engineer

Systems Engineer

A Systems Engineer is responsible for designing, building, and maintaining computer systems. They work to ensure that systems are reliable, scalable, and secure. This course can be useful for aspiring Systems Engineers as it provides a foundation in systems engineering principles. By understanding how to design, build, and test systems, Systems Engineers can build systems that meet the needs of the business.

See salaries and explore the career path for Systems Engineer

Data Scientist

A Data Scientist is responsible for using data to solve business problems. They work to collect, analyze, and interpret data, and to develop models that can predict future outcomes. This course can be useful for aspiring Data Scientists as it provides a foundation in data science principles. By understanding how to collect, analyze, and interpret data, Data Scientists can develop models that can help businesses make better decisions.

See salaries and explore the career path for Data Scientist

Cloud Engineer

A Cloud Engineer is responsible for designing, building, and maintaining cloud computing systems. They work to ensure that systems are reliable, scalable, and secure. This course can be useful for aspiring Cloud Engineers as it provides a foundation in cloud computing concepts. By understanding how to design, build, and test cloud systems, Cloud Engineers can build systems that meet the needs of the business.

See salaries and explore the career path for Cloud Engineer

Data Engineer

A Data Engineer is responsible for designing, building, and maintaining data systems. They work to ensure that data is accurate, reliable, and accessible. This course can be useful for aspiring Data Engineers as it provides a foundation in metrics collection and analysis. By understanding how to collect, store, and analyze data, Data Engineers can build systems that provide valuable insights into business operations.

See salaries and explore the career path for Data Engineer

Software Developer

A Software Developer is responsible for designing, building, and maintaining software applications. This course can be useful for aspiring Software Developers as it provides a foundation in software engineering best practices. By understanding how to design, build, and test software, Software Developers can build high-quality applications that meet user needs.

See salaries and explore the career path for Software Developer

QA Engineer

A QA Engineer is responsible for testing software and hardware products to ensure that they meet quality standards. They work to identify bugs, write test cases, and execute tests. This course can be useful for aspiring QA Engineers as it provides a foundation in software testing principles. By understanding how to test software and write test cases, QA Engineers can help ensure that products are released with high quality.

See salaries and explore the career path for QA Engineer

Business Analyst

A Business Analyst is responsible for understanding the needs of a business and developing solutions to meet those needs. They work to gather requirements, analyze data, and develop recommendations. This course can be useful for aspiring Business Analysts as it provides a foundation in business analysis principles. By understanding how to gather requirements, analyze data, and develop recommendations, Business Analysts can help businesses make better decisions.

See salaries and explore the career path for Business Analyst

Product Manager

A Product Manager is responsible for the development and launch of new products. They work to define the product vision, set product strategy, and manage the product roadmap. This course can be useful for aspiring Product Managers as it provides a foundation in product management principles. By understanding how to define the product vision, set product strategy, and manage the product roadmap, Product Managers can help businesses launch successful products.

See salaries and explore the career path for Product Manager

Project Manager

A Project Manager is responsible for the planning, execution, and completion of projects. They work to define project scope, set project goals, and manage project resources. This course can be useful for aspiring Project Managers as it provides a foundation in project management principles. By understanding how to define project scope, set project goals, and manage project resources, Project Managers can help businesses complete projects successfully.

See salaries and explore the career path for Project Manager

Technical Writer

A Technical Writer is responsible for creating documentation for software and hardware products. They work to explain how products work, and to provide instructions on how to use them. This course can be useful for aspiring Technical Writers as it provides a foundation in technical writing principles. By understanding how to write clear and concise documentation, Technical Writers can help users understand how to use products effectively.

See salaries and explore the career path for Technical Writer

Technical Support Specialist

A Technical Support Specialist is responsible for providing technical support to users. They work to troubleshoot problems, answer questions, and resolve issues. This course can be useful for aspiring Technical Support Specialists as it provides a foundation in troubleshooting and problem-solving. By understanding how to troubleshoot problems and resolve issues, Technical Support Specialists can help users get the most out of their products.

See salaries and explore the career path for Technical Support Specialist

Site Reliability Engineer

A Site Reliability Engineer (SRE) is responsible for the design, building, and maintenance of software systems. They work to ensure that systems are reliable, scalable, and performant. This course can be useful for aspiring SREs as it provides a foundation in observability, which is a key aspect of maintaining reliable systems. By understanding how to collect, store, and analyze data about system performance, SREs can identify and resolve issues before they impact users.

See salaries and explore the career path for Site Reliability Engineer

Observability with Grafana, Prometheus,Loki, Alloy and Tempo

What's inside

Learning objectives

Syllabus

Good to know

Save this course

Activities

Career center

Reading list

Share

Similar courses