Checks and scripts

The basic idea is that monitoring is performed through Checks which execute a specific Script (only Python is supported for now) as a Kubernetes job.

Check statuses are defined by the following elements:

The check output, represented as a String
The check result represented by one of the following numbers
- 0: okay
- 1: warning
- 2: critical
- 3: unknown
- 4: pending

For example, if we would like to monitor a certain service, we could define a Check as follows:

apiVersion: pinglow.io/v1alpha1
kind: Check
metadata:
  name: my-service-reachability
  namespace: pinglow
spec:
  scriptRef: check-service
  interval: 300
  secretRefs:
    - my-service-definition

As we can see, the Check references a standard secret and so its keys and values will be automatically passed as environment variables in the Kubernetes job used to run the script.

apiVersion: pinglow.io/v1alpha1
kind: Script
metadata:
  name: check-service
  namespace: pinglow
spec:
  language: Python
  python_requirements:
    - requests
  content: |
    import requests
    import os
    import sys

    url = os.environ.get("URL")

    response = requests.get(url, timeout=5)

    if response.status_code != 200:
      print("Error in contacting endpoint")
      sys.exit(2)

Performance data

When writing a script, it is possible to print not only the general output, but also some performance data that will be stripped out from the output and wrote separately in a dedicated table in TimescalDB (and returned also separately by the API).

To specify both an output and some performance data, it is possible to use the following format: output|key=value,key2=value.

For example, a script which may read some temperature and humidity data may be partially similar to what depicted below:

apiVersion: pinglow.io/v1alpha1
kind: Script
metadata:
  name: script-temperature-humidity
  namespace: pinglow
spec:
  language: Python
  content: |
    temperature = getTemperature()
    humidity = getHumidity()

    print(f"Your temperature and humidity are OK!|temperature={temperature},humidity={humidity}")

Passive checks

Sometimes, we do not want an active action from a check but instead we would like an external system to send the results of a certain operation to our system. For this reason, it is possible to define a Check as passive.

In this case, no script or check interval is needed and it is possible to set its status through the associated API (check out the API reference for more information).

apiVersion: pinglow.io/v1alpha1
kind: Check
metadata:
  name: my-passive-check
  namespace: pinglow
spec:
  passive: true

Clearly, it is possible to get notifications also for passive check results. See the notifications section for more information on how to configure them!

Performance data​

Passive checks

Performance data