Monitoring
Learn how to monitor your KubeZero platform and applications.
Overview
KubeZero includes a comprehensive monitoring stack that provides visibility into your platform's health and performance.
Monitoring Components
- Prometheus for metrics collection
- Grafana for visualization
- Alertmanager for alerting
- Jaeger for distributed tracing
Setting Up Monitoring
# Enable monitoring in your stack configuration
monitoring:
enabled: true
prometheus:
retention: 15d
grafana:
adminPassword: "your-secure-password"
Accessing Dashboards
After deployment, you can access the monitoring dashboards:
- Grafana:
https://grafana.your-domain.com
- Prometheus:
https://prometheus.your-domain.com
- Alertmanager:
https://alertmanager.your-domain.com
Common Metrics
Monitor these key metrics for your KubeZero platform:
- Cluster resource utilization
- Application response times
- Error rates
- Pod restart counts
Alerting
Configure alerts for critical events:
- High CPU/memory usage
- Pod failures
- Certificate expiration
- Service downtime
Troubleshooting
Common monitoring issues and solutions:
- Dashboard not loading
- Missing metrics
- Alert fatigue
- Storage issues
For detailed configuration options, see the monitoring configuration reference.