The monitoring component serves as the infrastructure for monitoring, alerting, inspection, and health checking functions within the observability module. This document describes how to install the ACP Monitoring with Prometheus plugin or the ACP Monitoring with VictoriaMetrics plugin within a cluster.
Before install the monitoring components, please ensure the following conditions are met:
global
cluster can access port 11780 of the workload cluster.Navigate to App Store Management > Cluster Plugins and select the target cluster.
Locate the ACP Monitoring with Prometheus plugin and click Install.
Configure the following parameters:
Parameter | Description |
---|---|
Scale Configuration | Supports three configurations: Small Scale, Medium Scale, and Large Scale: - Default values are set based on the recommended load test values of the platform - You can choose or customize quotas based on the actual cluster scale - Default values will be updated with platform versions; for fixed configurations, custom settings are recommended |
Storage Type | - LocalVolume: Local storage with data stored on specified nodes - StorageClass: Automatically generates persistent volumes using a storage class - PV: Utilizes existing persistent volumes Note: Storage configuration cannot be modified after Installation |
Replica Count | Sets the number of monitoring component pods Note: Prometheus supports only single-node installation |
Parameter Configuration | Data parameters for the monitoring component can be adjusted as needed |
Click Install to complete the installation.
Once installation is complete, the components can be accessed at the following addresses (replace <>
with actual values):
Component | Access Address |
---|---|
Thanos | <platform_access_address>/clusters/<cluster>/prometheus |
Prometheus | <platform_access_address>/clusters/<cluster>/prometheus-0 |
Alertmanager | <platform_access_address>/clusters/<cluster>/alertmanager |
Ensure the plugin has been published by checking for ModulePlugin and ModuleConfig resources, in the global
cluster:
This indicates that the ModulePlugin prometheus
exists in the cluster and version v4.1.0
is published.
Create a ModuleInfo resource to install the plugin without any configuration parameters:
Reference for resources settings, example prometheus:
For more details, please refer to the Monitor Component Capacity Planning
YAML field reference (VictoriaMetrics):
Field path | Description |
---|---|
metadata.labels.cpaas.io/cluster-name | Target cluster name where the plugin is installed. |
metadata.labels.cpaas.io/module-name | Must be victoriametrics . |
metadata.labels.cpaas.io/module-type | Must be plugin . |
metadata.name | ModuleInfo name (e.g., <cluster>-victoriametrics ). |
spec.version | Plugin version to install. |
spec.config.storage.type | Storage type: LocalVolume , StorageClass , or PV . |
spec.config.storage.capacity | Storage size for VictoriaMetrics (Gi). Minimum 30 Gi recommended. |
spec.config.storage.nodes | Node list when storage.type=LocalVolume . Up to 1 node supported. |
spec.config.storage.path | LocalVolume path when storage.type=LocalVolume . |
spec.config.storage.storageClass | StorageClass name when storage.type=StorageClass . |
spec.config.storage.pvSelectorK | PV selector key when storage.type=PV . |
spec.config.storage.pvSelectorV | PV selector value when storage.type=PV . |
spec.replicas | Replica count; LV does not support multiple replicas. |
spec.config.components.vmstorage.retention | Data retention days for vmstorage. |
spec.config.components.vmagent.scrapeInterval | Scrape interval seconds; applies to ServiceMonitors without interval . |
spec.config.components.vmagent.scrapeTimeout | Scrape timeout seconds; must be less than scrapeInterval . |
spec.config.components.vmstorage.resources | Resource settings for vmstorage. |
spec.config.components.nodeExporter.port | Node Exporter port (default 9100). |
spec.config.components.nodeExporter.resources | Resource settings for Node Exporter. |
spec.config.components.alertmanager.resources | Resource settings for Alertmanager. |
spec.config.components.kubeStateExporter.resources | Resource settings for Kube State Exporter. |
spec.config.components.prometheusAdapter.resources | Resource settings for Prometheus Adapter (used for HPA/custom metrics). |
spec.config.components.vmagent.resources | Resource settings for vmagent. |
spec.config.size | Monitoring scale: Small , Medium , or Large . |
Since the ModuleInfo name changes upon creation, locate the resource via label to check the plugin status and version:
Field explanations:
NAME
: ModuleInfo resource nameCLUSTER
: Cluster where the plugin is installedMODULE
: Plugin nameDISPLAY_NAME
: Display name of the pluginSTATUS
: Installation status; Running
means successfully installed and runningTARGET_VERSION
: Intended installation versionCURRENT_VERSION
: Version before installationNEW_VERSION
: Latest available version for installationNavigate to App Store Management > Cluster Plugins and select the target cluster.
Locate the ACP Monitoring with VictoriaMetrics plugin and click Install.
Configure the following parameters:
Parameter | Description |
---|---|
Scale Configuration | Supports three configurations: Small Scale, Medium Scale, and Large Scale: - Default values are set based on the recommended load test values of the platform - You can choose or customize quotas based on the actual cluster scale - Default values will be updated with platform versions; for fixed configurations, custom settings are recommended |
Install Agent Only | - Off: Install the complete VictoriaMetrics component suite - On: Install only the VMAgent collection component, which relies on the VictoriaMetrics Center |
VictoriaMetrics Center | Select the cluster where the complete VictoriaMetrics component has been installed |
Storage Type | - LocalVolume: Local storage with data stored on specified nodes - StorageClass: Automatically generates persistent volumes using a storage class - PV: Utilizes existing persistent volumes |
Replica Count | Sets the number of monitoring component pods: - LocalVolume storage type does not support multiple replicas - For other storage types, please refer to on-screen prompts for configuration |
Parameter Configuration | Data parameters for the monitoring component can be adjusted Note: Data may temporarily exceed the retention period before being deleted |
Click Install to complete the installation.
Ensure the plugin has been published by checking for ModulePlugin and ModuleConfig resources, in the global
cluster:
This indicates that the ModulePlugin victoriametrics
exists in the cluster and version v4.1.0
is published.
Create a ModuleInfo resource to install the plugin without any configuration parameters:
Reference for resources settings, example prometheus:
For more details, please refer to the Monitor Component Capacity Planning
YAML field reference (Prometheus):
Field path | Description |
---|---|
metadata.labels.cpaas.io/cluster-name | Target cluster name where the plugin is installed. |
metadata.labels.cpaas.io/module-name | Must be prometheus . |
metadata.labels.cpaas.io/module-type | Must be plugin . |
metadata.name | ModuleInfo name (e.g., <cluster>-prometheus ). |
spec.version | Plugin version to install. |
spec.config.storage.type | Storage type: LocalVolume , StorageClass , or PV . |
spec.config.storage.capacity | Storage size for Prometheus (Gi). Minimum 30 Gi recommended. |
spec.config.storage.nodes | Node list when storage.type=LocalVolume . Up to 1 node supported. |
spec.config.storage.path | LocalVolume path when storage.type=LocalVolume . |
spec.config.storage.storageClass | StorageClass name when storage.type=StorageClass . |
spec.config.storage.pvSelectorK | PV selector key when storage.type=PV . |
spec.config.storage.pvSelectorV | PV selector value when storage.type=PV . |
spec.replicas | Replica count; only applicable to StorageClass /PV types. |
spec.config.components.prometheus.retention | Data retention days. |
spec.config.components.prometheus.scrapeInterval | Scrape interval seconds; applies to ServiceMonitors without interval . |
spec.config.components.prometheus.scrapeTimeout | Scrape timeout seconds; must be less than scrapeInterval . |
spec.config.components.prometheus.resources | Resource settings for Prometheus. |
spec.config.components.nodeExporter.port | Node Exporter port (default 9100). |
spec.config.components.nodeExporter.resources | Resource settings for Node Exporter. |
spec.config.components.alertmanager.resources | Resource settings for Alertmanager. |
spec.config.components.kubeStateExporter.resources | Resource settings for Kube State Exporter. |
spec.config.components.prometheusAdapter.resources | Resource settings for Prometheus Adapter. |
spec.config.components.thanosQuery.resources | Resource settings for Thanos Query. |
spec.config.size | Monitoring scale: Small , Medium , or Large . |
Since the ModuleInfo name changes upon creation, locate the resource via label to check the plugin status and version:
Field explanations:
NAME
: ModuleInfo resource nameCLUSTER
: Cluster where the plugin is installedMODULE
: Plugin nameDISPLAY_NAME
: Display name of the pluginSTATUS
: Installation status; Running
means successfully installed and runningTARGET_VERSION
: Intended installation versionCURRENT_VERSION
: Version before installationNEW_VERSION
: Latest available version for installation