Lab
A Cloud Guru

Using Alertmanager with Prometheus

Prometheus does not just limit us to recording metrics. One of Prometheus's core functionalities is the ability to define and route alerts to any alert management endpoint we define — or, in the case of this hands-on lab, Prometheus's own sideproject, Alertmanager. Once we have our desired alerting thresholds defined, we need to set up our routes and receivers for the Alertmanager, ensuring our notifications are going to the right end user at the correct frequency and with the right information.

Try for free Contact sales

Path Info

Level

Intermediate

Duration

30m

Published

Apr 05, 2019

Challenge

Add a Rules File
1. Add a rules file configuration to the Prometheus config:
```
sudo $EDITOR /etc/prometheus/prometheus.yml

rule_files:
  - "rules.yml"
```
2. Save and exit.
3. Create and open the rules.yml file:
```
sudo $EDITOR /etc/prometheus/rules.yml
```

Challenge

Add an Alert to Track Uptime

Before creating the alert itself, create a recording of the desired metric:

groups:
  - name: uptime
    rules:
      - record: job:uptime:average:ft
        expr: avg without (instance) (up{job="forethought"})

Create an alert based on this recording:

groups:
  - name: uptime
    rules:
      - record: job:uptime:average:ft
        expr: avg without (instance) (up{job="forethought"})
      - alert: ForethoughtApplicationDown
        expr: job:uptime:average:ft < .75
        for: 30s
        labels:
          severity: page
          team: devops

Save and exit.

Restart Prometheus:

sudo systemctl restart prometheus
sudo systemctl status prometheus

Challenge

Configure Alertmanager to Use an SMTP Smarthost

Open the Alertmanager configuration file:

sudo $EDITOR /etc/alertmanager/alertmanager.yml

Define the global settings:

global:
  resolve_timeout: 5m
  smtp_smarthost: 'localhost:25'
  smtp_from: 'prometheus'

Challenge

Set up Alertmanager Routing

Set up the backup route:

route:
  receiver: 'email_backup'
  group_by: ['alertname']
  group_wait: 10s
  group_interval: 10s
  repeat_interval: 1m

Set up the route for critical alerts:

route:
  receiver: 'email_backup'
  group_by: ['alertname']
  group_wait: 10s
  group_interval: 10s
  repeat_interval: 1m
  routes:
    - match:
        severity: 'critical'
      group_by: ['team']
      receiver: 'email_pager'

Set up the route for team alerts:

route:
  receiver: 'email_backup'
  group_by: ['alertname']
  group_wait: 10s
  group_interval: 10s
  repeat_interval: 1m
  routes:
    - match:
        severity: 'critical'
      group_by: ['team']
      receiver: 'email_pager'
      routes:
        - match:
            team: devops
            receiver: 'email_devops'
``

Challenge

Create the Needed Receivers

Create the receivers:

receivers:
- name: 'email_backup'
  email_configs:
    - to: '[email protected]'
- name: 'email_pager'
  email_configs:
    - to: '[email protected]'
- name: 'email_devops'
  email_configs:
    - to: '[email protected]'

Save and exit.
Restart Alertmanager:
```
sudo systemctl restart alertmanager
```

Author

A Cloud Guru

The Cloud Content team comprises subject matter experts hyper focused on services offered by the leading cloud vendors (AWS, GCP, and Azure), as well as cloud-related technologies such as Linux and DevOps. The team is thrilled to share their knowledge to help you build modern tech solutions from the ground up, secure and optimize your environments, and so much more!

What's a lab?

Hands-on Labs are real environments created by industry experts to help you learn. These environments help you gain knowledge and experience, practice without compromising your system, test without risk, destroy without fear, and let you learn from your mistakes. Hands-on Labs: practice your skills before delivering in the real world.

Provided environment for hands-on practice

We will provide the credentials and environment necessary for you to practice right within your browser.

Guided walkthrough

Follow along with the author’s guided walkthrough and build something new in your provided environment!

Did you know?

On average, you retain 75% more of your learning if you get time for practice.

Start learning by doing today

View Plans

Using Alertmanager with Prometheus

Path Info

Table of Contents

Add a Rules File

Add an Alert to Track Uptime

Configure Alertmanager to Use an SMTP Smarthost

Set up Alertmanager Routing

Create the Needed Receivers

What's a lab?

Provided environment for hands-on practice

Guided walkthrough

Did you know?

Start learning by doing today