系统衡量指标监控

本节介绍了 Orchestrator 上的系统衡量指标监控。

Orchestrator 系统衡量指标监控概览

Orchestrator 具有内置的系统衡量指标监控堆栈，其中包括衡量指标收集器和时间序列数据库。通过使用监控堆栈，您可以轻松检查 Orchestrator 的运行状况和系统负载。

要启用监控堆栈，请在 Orchestrator 上运行以下命令：

sudo /opt/vc/scripts/vco_observability_manager.sh enable

要检查监控堆栈的状态，请运行：

sudo /opt/vc/scripts/vco_observability_manager.sh status

要禁用监控堆栈，请运行：

sudo /opt/vc/scripts/vco_observability_manager.sh disable

衡量指标收集器

Telegraf 用作 Orchestrator 系统衡量指标收集器，其中包括用于收集系统衡量指标的插件。默认情况下，将启用以下衡量指标。


衡量指标名称	描述
inputs.cpu	有关 CPU 使用情况的衡量指标。
inputs.mem	有关内存使用情况的衡量指标。
inputs.net	有关网络接口的衡量指标。
inputs.system	有关系统负载和正常运行时间的衡量指标。
inputs.processes	按状态分组的进程数。
inputs.disk	有关磁盘使用情况的衡量指标。
inputs.diskio	有关设备的磁盘 IO 的衡量指标。
inputs.procstat	特定进程的 CPU 和内存使用情况。
inputs.nginx	Nginx 的基本状态信息 (ngx_http_stub_status_module)。
inputs.mysql	来自 MySQL 服务器的统计数据。
inputs.clickhouse	来自一个或多个 ClickHouse 服务器的衡量指标。
inputs.redis	来自一个或多个 Redis 服务器的衡量指标。
inputs.filecount	指定目录中的文件数量和总大小。
inputs.ntpq	标准 NTP 查询衡量指标（需要 ntpq 可执行文件）。
Inputs.x509_cert	来自 SSL 证书的衡量指标。

要启用更多衡量指标或禁用某些启用的衡量指标，请在 Orchestrator 上按以下方式编辑 Telegraf 配置文件：

sudo vi /etc/telegraf/telegraf.d/system_metrics_input.conf
sudo systemctl restart telegraf

时间序列数据库

Prometheus 用于存储 Telegraf 收集的系统衡量指标。衡量指标数据将在数据库中保留最多三周。默认情况下，Prometheus 侦听端口 9090。如果您使用外部监控工具，请提供 Prometheus 数据库以作为源，以便您可以在监控 UI 上查看 Orchestrator 系统衡量指标。