Allow VMware Tanzu CloudHealth to ingest the usage and performance data that Operations for Applications gathers.
Operations for Applications is a SaaS service that records and manages time-series metrics for your cloud assets and resources. You can use these metrics to gain insights into usage, performance, and other analytics. Operations for Applications gathers the time-series data through integrations that are specific to the cloud asset. For example, Operations for Applications has integrations for Amazon EC2, Azure Virtual Machine, Google BigQuery, and so on. For more information, see Operations for Applications Documentation.
Tanzu CloudHealth ingests the usage and performance data that Operations for Applications gathers to help you in the following ways:
Click Save Account.
Tanzu CloudHealth begins collecting Operations for Applications Sources within 15 min of account setup. Thereafter, Operations for Applications Sources are collected every 15 min. Tanzu CloudHealth fetches up to one day’s worth of time-series data from the date when you add the Operations for Applications account.
You can use VMware Aria Operations for Applications source tags to create Perspective Groups within the Tanzu CloudHealth platform.
Tanzu CloudHealth ingests Operations for Applications metrics - Memory, CPU, network bytes, and filesystem usage for EC2 Instances and displays these metrics in the Rightsizing Report.
Integrate with the Tanzu CloudHealth Agent to allow the Agent to take snapshots at specific intervals and report metrics to the Tanzu CloudHealth platform every hour.
The Tanzu CloudHealth Agent is a lightweight monitoring service for your cloud resources. It is based on the collectd
daemon. You can install the Agent on your cloud instances to gain system-wide visibility into CPU, file system, and memory utilization. If you have Docker, the Agent catalogs containers and images as well.
Once installed, the Agent takes snapshots at specific intervals and reports metrics to the Tanzu CloudHealth platform every hour. The Agent pushes data from the instance to the Tanzu CloudHealth endpoint over https
. The data is in the form of a JSON file containing the aggregated disk I/O, filesystem, and memory metrics.
To understand the Agent transmission rates, see the Tanzu CloudHealth Knowledge Base article.
Note
The Agent supports these server operating systems.
For Linux
Tanzu CloudHealth Agent Version | Release Date | Features Introduced | Supported Instances |
---|---|---|---|
v28 | 09/03/2023 | ARM Support, IMDsv2, TLS1.2 | ARM DEB/RPM (Centos >=8/ x86_64), AMD DEB/RPM (Centos >=x86_64), RHEL >=8.0, UBUNTU >=20.0.0, Ubuntu 20 20.04.3 LTS (Focal Fossa). No more support to UBUNTU <12.04 and RHEL < 6.5. |
v24 | 20/04/2021 | Restricted to TLSv1.2. Hardstop for TLS 1.0, 1.1 | Amazon Linux 2 AMD, UBUNTU < 20.0.0, CentOS > =7 (rhel fedora), RHEL >= 8.2 |
v23 | 21/01/2021 | Resolved security violations | Deprecated, No more support |
v20 | 24/09/2018 | Disabled auto-update at the agent level | Deprecated, No more support |
v19 | 13/02/2018 | Support for proxies, default configuration parameters, instance ID collection fallback | Deprecated, No more support |
v18 | 03/03/2017 | Standard version | Deprecated, No more support |
For Windows
Presently we have Tanzu CloudHealth Agent v22 with TLSv1.2 enabled.
NOTE: The auto-update feature has been disabled.
If you have an older version of the Agent installed, uninstall it first by following the instructions in Troubleshoot Errors section.
Under Linux / Windows> No proxy setup > To install, enter the following command:
wget https://s3.amazonaws.com/remote- collector/agent/v24/install_cht_perfmon.sh -O install_cht_perfmon.sh; sudo sh install_cht_perfmon.sh 24 api-key cloud-name
C:\Program Files (x86)\CloudHealth Technologies\
HKEY\_LOCAL\_MACHINE\Software\CloudHealth Technologies
Tanzu CloudHealth Platform no longer supports the use of TLSv1.0 and TLS1.1 for API utilized by Tanzu CloudHealth Agent. You will see no impact due to the deprecation if you are not using Tanzu CloudHealth Agent. However, if you are using the Agent, TLSv1.2 will be supported. Modern servers will have no impact as this version of TLS was released in 2008.
The Tanzu CloudHealth Agent for Linux and Windows collects CPU, Memory, and Filesystem metric data from the instance operating system. This data is used in Policies, Rightsizing, Reporting for understanding and optimizing instance use. Metric data is collected at specific intervals and sent to the platform. Most instances use the latest version of TLSv1.2.
Upgrade your Tanzu CloudHealth Agent to v24 with TLSv1.2. Installing the latest version of Agent will automatically upgrade it to v24. On noticing errors, follow the steps under the Troubleshoot Errors section for manually fixing these errors.
While CloudWatch metrics provide detailed performance information from outside the VM, the Agent provides information from the point of view of the operating system. This information allows Tanzu CloudHealth to calculate a score reflecting how well provisioned resources are utilized.
For more information, see Rightsizing by Instance.
Agent data provides visibility into file system utilization, which Tanzu CloudHealth uses to determine the usage of provisioned storage capacity.
For more information, see Rightsizing by Volume.
On seeing errors, you can try manually uninstalling the Agent and then reinstall by following the commands below:
Uninstall the Agent.
wget -O - https://s3.amazonaws.com/remote- collector/agent/v24/uninstall_cht_perfmon.sh sudo sh; sudo rm -rf cht_agent_install/; sudo rm -rf install_cht_perfmon.sh
Reinstall the Agent.
wget https://s3.amazonaws.com/remote- collector/agent/v24/install_cht_perfmon.sh -O install_cht_perfmon.sh; sudo sh install_cht_perfmon.sh 24 <api_key> <cloud_name>;
For further errors, reach out to the Tanzu CloudHealth support team (mail to: [email protected]
) with the following information:
System Details
$ uname #current operating system
$ uname -a #including the operating system name, version, and build number, as well as the hardware architecture and release level of kerel
$ hostname #hostname details
$ lscpu #provides a detailed overview of the system's CPU architecture, including the number of CPUs, cores, and threads, the clock speed, the cache size, and the address size
$ cat os-release #get os-details
Log in to the agent instance and get the installation logs
$ cat /tmp/agent_install_log.txt
Check the status of cht collectd daemon
$ ps -eaf | grep cht
Check the permissions on the collectd configuration file
$ ls -alh /etc/chtcollectd/collectd.conf
Check collectd config
$ vi /etc/chtcollectd/chtcollectd.conf
or $ vi /etc/chtcollectd/collectd.conf
Check the permissions of collecd and gather the content of the collector output directory and its subdirectories [metrics are collected if folders are present]
$ ls -alR /var/lib/chtcollectd
$ vi /var/lib/chtcollectd/last_docker_upload.txt #check inside on file stamp details.
Once metrics are available, check if it is reaching to Tanzu CloudHealth endpoint Tanzu CloudHealth Agent collects metrics from the instance and sends them to the platform at endpoint api.cloudhealthtech.com
at port 443.
$ curl https://api.cloudhealthtech.com/v1/health #should give happy
$ nslookup api.cloudhealthtech.com or dig api.cloudhealthtech.com # should give 6,7 ips
Check the collectd and perfmon services status and start/restart accordingly
$ sudo /etc/init.d/chtcollectd status/start/stop/restart
$ sudo /etc/init.d/cht_perfmon status/start/stop/restart
ex: sudo /etc/init.d/chtcollectd status
sudo /etc/init.d/cht_perfmon start
Missing last_upload file?
$ cat /var/lib/chtcollectd/last upload.txt
Use this command to read the contents of the last_upload.txt
and share the file to support. Please notify us if it is not present.
Retrieve the facter JSON response from Linux based agent instance
$ /opt/cht_perfmon/embedded/bin/facter --json
Remove the old installation and install a new one manually upon customer approval
$ wget -O - https://s3.amazonaws.com/remote-collector/agent/uninstall_cht_perfmon.sh | sudo sh
$ sudo rm -rf cht_agent_install/
$ sudo rm -rf install_cht_perfmon.sh
$ wget https://s3.amazonaws.com/remote-collector/agent/v24/install_cht_perfmon.sh -O install_cht_perfmon.sh;
$ sudo sh install_cht_perfmon.sh 24 <api_key> <cloud_name>;
Check if the user: cht_agent is available
$ id cht_agent
If the cht_agent user is not configured correctly, delete the user and recreate them.
$ userdel -r cht_agent - delete
$ sudo useradd cht_agent
$ chown -R cht_agent:cht_agent /opt/cht_perfmon
$ sudo usermod -a -G docker cht_agent
$ sudo usermod -a -G docker cht_agent
$ cht_agent -m -U -G docker
$ chgrp -R cht_agent /var/lib/chtcollectd/
$ chmod -R g+ws /var/lib/chtcollectd/
$ chown -R cht_agent:cht_agent /opt/cht_perfmon
Try to restart the collectd and perfmon services.
$ sudo /etc/init.d/chtcollectd start
$ sudo /etc/init.d/cht_perfmon start
Get saved state point
$ cat /opt/cht_perfmon/saved_state.json
Check EOL (end of the line) in collectd.conf There should be an extra empty line added at the end in collectd.conf and restart in case collectd is not started. Restart cht_pefmon
and collectd
, if above change is added.
When gathering the content of the collector output directory, and you see the ‘S’ character for cht_agent If you run ls -alR /var/lib/chtcollectd
and it returns with the ‘S’ character, it is not the expected outcome. In that case, run the following command-
$ chmod 2775 /var/lib/chtcollectd
After an hour, you should be able to see last_upload file. Please share the details from last_upload and last_docker_upload as well.
This integration collects cost and usage data from VMware Aria Operations (formerly vRealize Operations) and ingests it into Tanzu CloudHealth.
The Aria Operations Integration for Tanzu CloudHealth collects cost and usage data from Aria Operations and ingests it into Tanzu CloudHealth. In this integration, an adapter instance is deployed on the Aria Operations platform to monitor vCenter resources and publish information to Tanzu CloudHealth. The Tanzu CloudHealth platform consolidates your multi-cloud data on a single view, providing visibility into your vSphere-based private cloud data using Aria Operations and the public cloud data through Tanzu CloudHealth.
Complete the following steps to set up the Aria Operations Integration for Tanzu CloudHealth.
Follow the steps below to create a new role and assign the permissions.
Data Center | Required Permissions |
---|---|
Dashboard | Read Datacenter Dashboard, Change Datacenter Dashboard |
Reports > Cost > Machine | Read Data Center Server Cost Report, Export Data Center Server Cost Report, Save Data Center Server Cost Report, Share Data Center Server Cost Report, Subscribe Data Center Server Cost Report |
Reports > Cost > Current | Read Data Center Cost Current Report, Export Data Center Cost Current Report, Save Data Center Cost Current Report, Share Data Center Cost Current Report, Subscribe Data Center Cost Current Report |
Reports > Cost > Cost Driver | Read Data Center Cost Driver Report, Export Data Center Cost Driver Report, Save Data Center Cost Driver Report, Share Data Center Cost Driver Report, Subscribe Data Center Cost Driver Report |
Reports > Usage > Machine | Read Data Center Server Usage Report, Export Data Center Server Usage Report, Save Data Center Server Usage Report, Share Data Center Server Usage Report,Subscribe Data Center Server Usage Report |
Reports > Performance > Machine | Read Data Center Server Performance Report, Export Data Center Server Performance Report, Save Data Center Server Performance Report, Share Data Center Server Performance Report,Subscribe Data Center Server Performance Report |
Reports > Performance > Memory | Read Data Center Memory Performance Report, Export Data Center Memory Performance Report, Save Data Center Memory Performance Report, Share Data Center Memory Performance Report, Subscribe Data Center Memory Performance Report |
Assets | Read Data Center Tag, Export Data Center Tag, Save Data Center Tag, Subscribe Data Center Tag, Read Data Center Server File System, Export Data Center Server File System, Save Data Center Server File System, Subscribe Data Center Server File System, Read Data Center Server Network Interface, Export Data Center Server Network Interface, Save Data Center Server Network Interface, Subscribe Data Center Server Network Interface, Read Data Center Server Block Device, Export Data Center Server Block Device, Save Data Center Server Block Device, Subscribe Data Center Server Block Device, Read Data Center Server CPU, Export Data Center Server CPU, Save Data Center Server CPU, Subscribe Data Center Server CPU, Read Data Center Server Usage Daily |
Accounts | Create Data Center Account, Read Data Center Account, Update Data Center Account, Delete Data Center Account, Export Data Center Account, Save Data Center Account, Subscribe Data Center Account, Tag Data Center Account |
Machine | Update Data Center Server, Read Data Center Server, Export Data Center Server, Save Data Center Server, Subscribe Data Center Server, Tag Data Center Server |
Hosts / DataStores / Clusters / Datacenters | Read Data Center Host, Export Data Center Host, Save Data Center Host, Subscribe Data Center Host, Tag Data Center Host |
Agents | Read Datacenter Agent Configuration, Update Datacenter Agent Configuration, Disable Autoupdate Datacenter Agent Status, Enable Autoupdate Datacenter Agent Status, Read Datacenter Agent Status, Export Datacenter Agent Status, Save Datacenter Agent Status, Subscribe Datacenter Agent Status |
From the left scrolling pane, select Setup, and select the necessary permissions as given in the following table.
Setup | Required Permissions |
---|---|
Profile | Read Profile, Update Profile, Generate API Key Profile |
- The user API access key must be associated with a user having either a default Power User role or an Administrator role.
- If you want to associate the user API access key with a user having a custom role, then create a new role, and add the necessary permissions.
Download the vRealize Operations Integration for Tanzu CloudHealth PAK file from VMware Marketplace, and save the PAK file to a temporary folder on your local system. Please note that the Management Pack names may temporarily display the old names (vRealize Operations and CloudHealth) in the Marketplace portal. We will soon update them with the new names (VMware Aria Operations and Tanzu CloudHealth, respectively).
Note - The vRealize Operations Integration for Tanzu CloudHealth version 1.0.5 includes an updated Log4j version 2.17.0 to resolve the Log4j vulnerability.
Prerequisites
Follow these steps to install the adapter instance:
The PAK file signature is valid
. The upload might take several minutes to complete.I accept the terms of this agreement
checkbox and click Next.Best Practices
Prerequisite
If you use Aria Operations cloud setup and collector as a cloud proxy for this integration, upgrade the HAProxy buffer size. The default HAProxy configuration will not support the Virtual Machines collection. Therefore you need to upgrade the HAProxy buffer size using the following instructions.
vim /etc/haproxy/haproxy.cfg
Add the following lines under the global
section:
global
log localhost local0 info
log localhost local1 info
chroot /etc/haproxy
ca-base /etc/ssl/certs
crt-base /etc/ssl/private
maxconn 32768
tune.bufsize 65536
Restart the HAProxy service service haproxy restart
NoteThis prerequisite is applicable only for Aria Operations cloud setup and not required for on-premises setup.
Follow these steps to configure the Adapter instance:
https://apps.cloudhealthtech.com/graphql
.config.property
file before the first collection begins.You can deploy the VMware Aria Operations Integration for Tanzu CloudHealth adapter in one or more existing on-prem VMware Aria Operations instances. In the Tanzu CloudHealth platform, you can view all the adapter instance details that you installed on the vRealize Operations Manager console.
All the configured adapter instance details appears on the page.
For FAQs on the Aria Operations Integration, see Tanzu CloudHealth Knowledge Base Article.
Buffer Size Overflow Error
If you get a Bad Request or Invalid Request error, you can now update the number of resource IDs you provide while calling the GET /api/resources/properties
API.
ssh root@<vrops-instance-address>
vmware
cd /usr/lib/vmware-vcops/user/plugins/inbound/CloudHealthHybridAdapter/conf/
config.properties
file.max.resources
value. Ensure that the value should be less than 500.The VMware Aria Operations Management Pack for VMware Tanzu CloudHealth ingests public cloud costs from Tanzu CloudHealth into Aria Operations.
Using the Aria Operations Management Pack, you can:
This Management Pack is part of a bi-directional integration between Tanzu CloudHealth and Aria Operations; giving users the flexibility to use their platform of choice to view multi-cloud costs. For information on the management pack that ingests data from Aria Operations into Tanzu CloudHealth, see Integrate with Aria Operations article.
To install and configure the Aria Operations Management Pack for Tanzu CloudHealth, see VMware Aria Operations for Integrations.
After creating a Tanzu CloudHealth integration, you can use the dashboards to view the accounts, regions, and services cost trends of the AWS, Azure and GCP accounts configured in this Management Pack.
Integrating Tanzu CloudHealth with VMware Aria Automation provides deployment and project-level spending transparency, which gives you greater cost visibility and optimization.
Aria Automation ingests Tanzu CloudHealth data to provide:
After integrating with Tanzu CloudHealth, you can also view the cost history for public cloud deployments directly in Tanzu CloudHealth. In the Tanzu CloudHealth platform, select the public cloud from the top menu for which you want to see the cost analysis. From the left menu, go to Reports > Cost and select History or Current. The cost analysis provides an overview of Perspective groups and their associated costs. For more information, see:
To configure Tanzu CloudHealth in Aria Automation, refer to Configure Tanzu CloudHealth Integration in Cloud Assembly.After creating a Tanzu CloudHealth integration, to view the pricing and cost information for public cloud deployments in Cloud Assembly, refer to How to view pricing for AWS and Azure.
- It can take 12 hours for Tanzu CloudHealth to synchronize any new vRealize Automation deployments.After the initial synchronization, Tanzu CloudHealth generates the cost for all perspective groups and sends the data to vRealize Automation.
- Currently, Cloud Assembly does not support pricing for GCP deployments.