Skip to main content

Monitoring

Learn how to monitor your KubeZero platform and applications.

Overview

KubeZero includes a comprehensive monitoring stack that provides visibility into your platform's health and performance.

Monitoring Components

  • Prometheus for metrics collection
  • Grafana for visualization
  • Alertmanager for alerting
  • Jaeger for distributed tracing

Setting Up Monitoring

# Enable monitoring in your stack configuration
monitoring:
enabled: true
prometheus:
retention: 15d
grafana:
adminPassword: "your-secure-password"

Accessing Dashboards

After deployment, you can access the monitoring dashboards:

  • Grafana: https://grafana.your-domain.com
  • Prometheus: https://prometheus.your-domain.com
  • Alertmanager: https://alertmanager.your-domain.com

Common Metrics

Monitor these key metrics for your KubeZero platform:

  • Cluster resource utilization
  • Application response times
  • Error rates
  • Pod restart counts

Alerting

Configure alerts for critical events:

  • High CPU/memory usage
  • Pod failures
  • Certificate expiration
  • Service downtime

Troubleshooting

Common monitoring issues and solutions:

  • Dashboard not loading
  • Missing metrics
  • Alert fatigue
  • Storage issues

For detailed configuration options, see the monitoring configuration reference.