Support overview

You can collect data about your environment, monitor the health of your cluster and virtual machines (VMs), and troubleshoot OKD Virtualization resources with the following tools.

Web console

The OKD web console displays resource usage, alerts, events, and trends for your cluster and for OKD Virtualization components and resources.

Table 1. Web console pages for monitoring and troubleshooting
PageDescription

Overview page

Cluster details, status, alerts, inventory, and resource usage

VirtualizationOverview tab

OKD Virtualization resources, usage, alerts, and status

VirtualizationTop consumers tab

Top consumers of CPU, memory, and storage

VirtualizationMigrations tab

Progress of live migrations

VirtualMachinesVirtualMachineVirtualMachine detailsMetrics tab

VM resource usage, storage, network, and migration

VirtualMachinesVirtualMachineVirtualMachine detailsEvents tab

List of VM events

VirtualMachinesVirtualMachineVirtualMachine detailsDiagnostics tab

VM status conditions and volume snapshot status

Collecting data for Red Hat Support

When you submit a support case to Red Hat Support, it is helpful to provide debugging information. You can gather debugging information by performing the following steps:

Collecting data about your environment

Configure Prometheus and Alertmanager and collect must-gather data for OKD and OKD Virtualization.

Collecting data about VMs

Collect must-gather data and memory dumps from VMs.

must-gather tool for OKD Virtualization

Configure and use the must-gather tool.

Monitoring

You can monitor the health of your cluster and VMs. For details about monitoring tools, see the Monitoring overview.

Troubleshooting

Troubleshoot OKD Virtualization components and VMs and resolve issues that trigger alerts in the web console.

Events

View important life-cycle information for VMs, namespaces, and resources.

Logs

View and configure logs for OKD Virtualization components and VMs.

Runbooks

Diagnose and resolve issues that trigger OKD Virtualization alerts in the web console.

Troubleshooting data volumes

Troubleshoot data volumes by analyzing conditions and events.