Introduction

The Monitoring module is a core component of the ACP platform's observability suite that provides comprehensive monitoring and alerting capabilities for platform administrators and operations teams.

This module delivers four essential monitoring capabilities:

  • Metrics collection for real-time performance data gathering from clusters, nodes, applications, and containers
  • Dashboards for intuitive visualization and analysis of system health and performance trends
  • Alerting for proactive detection of issues through customizable rules and thresholds
  • Notifications for timely delivery of alert information to operations personnel

By integrating these capabilities with open-source components like Prometheus and VictoriaMetrics, it enables organizations to maintain system reliability, prevent downtime, reduce operational costs, and ensure optimal performance across their entire infrastructure.