Alerting on numeric data

Among certain data sources numeric data that is not time series can be directly alerted on, or passed into Server Side Expressions (SSE). This allows for more processing and resulting efficiency within the data source, and it can also simplify alert rules. When alerting on numeric data instead of time series data, there is no need to reduce each labeled time series into a single number. Instead labeled numbers are returned to Grafana instead.

Tabular Data

This feature is supported with backend data sources that query tabular data:

  • SQL data sources such as MySQL, Postgres, MSSQL, and Oracle.
  • The Azure Kusto based services: Azure Monitor (Logs), Azure Monitor (Azure Resource Graph), and Azure Data Explorer.

A query with Grafana managed alerts or SSE is considered numeric with these data sources, if:

  • The “Format AS” option is set to “Table” in the data source query.
  • The table response returned to Grafana from the query includes only one numeric (e.g. int, double, float) column, and optionally additional string columns.

If there are string columns then those columns become labels. The name of column becomes the label name, and the value for each row becomes the value of the corresponding label. If multiple rows are returned, then each row should be uniquely identified their labels.

Example

For a MySQL table called “DiskSpace”:

TimeHostDiskPercentFree
2021-June-7web1/etc3
2021-June-7web2/var4
2021-June-7web3/var8

You can query the data filtering on time, but without returning the time series to Grafana. For example, an alert that would trigger per Host, Disk when there is less than 5% free space:

  1. SELECT Host, Disk, CASE WHEN PercentFree < 5.0 THEN PercentFree ELSE 0 END FROM (
  2. SELECT
  3. Host,
  4. Disk,
  5. Avg(PercentFree)
  6. FROM DiskSpace
  7. Group By
  8. Host,
  9. Disk
  10. Where __timeFilter(Time)

This query returns the following Table response to Grafana:

HostDiskPercentFree
web1/etc3
web2/var4
web3/var0

When this query is used as the condition in an alert rule, then the non-zero will be alerting. As a result, three alert instances are produced:

LabelsStatus
{Host=web1,disk=/etc}Alerting
{Host=web2,disk=/var}Alerting
{Host=web3,disk=/var}Normal