TazLab K8s Monitoring

Observability is managed as code to ensure dashboards and alerting rules are preserved during cluster rebirth.

The Monitoring Stack

  • Helm: prometheus-community/kube-prometheus-stack
  • Namespace: monitoring
  • Storage: Persisted in tazlab-db (PostgreSQL) for Grafana.

Dashboard Management

Dashboards are stored in infrastructure/operators/monitoring/dashboards/ as Kubernetes ConfigMap resources.

  • Label: grafana_dashboard: "1".
  • Sidecar: Grafana uses a sidecar to watch these ConfigMaps and load them automatically.

Active Dashboards

  • Cluster Health: High-level node and pod metrics.
  • Nodes-Pro: Detailed hardware and kernel metrics.
  • Flux Stats: Reconciliation and image automation success rates.

See Also