Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

System Health alerts provide the following standard color-coding in the System Health page under NetMRI Settings:

  • Green: indicates no issues currently present in the category.
  • Yellow: Warning. Warning health alerts appear when an issues issue appears that poses potential for more severe problems in the future, or a configuration issue that should be addressed; for example, a disk utilization level of 70% in a NetMRI appliance, Operations Center, or a Collector in an Operations Center network will raise a Warning alert, as will a detected VRF network that is not yet mapped to a network view.
  • Red: Critical. An issue that needs to be addressed as soon as possible. Critical alerts occur in cases where, for example, storage utilization is at 90% or higher, or a system fan fails or is removed from the appliance.
  • Grey: Offline. Alerts colored Grey appear only for Operations Center Collectors that are offline due to expected causes, such as a Collector being taken offline for replacement or changes to configuration.

...

System Health alerts also support notification subscriptions (see Subscribing to Notifications for more information). System Health notifications fall into the following general types: System Hardware Alert, Software Health Alert, Processing Health Alert, Storage Health Alert, Network Health Alert, Platform Capacity Health Alert, and Collector Connectivity Health Alert.
Individual alert types gather under the seven basic System Health categories.

The following table provides a summary of the System Health alerts.

Health Alert CategoryAlert MessagesDescription
Hardware (see Details on Hardware Alerts for more information)

RAID Drive <X> Failed.

RAID Array Failed.

Fan <X> Failed.

Power Supply <X> Failed.

High Ambient Temperature.

High Internal Temperature.

RAID Battery Failed.

RAID Array Failed.

This category applies only to hardware-based NetMRI systems and will not appear for virtual machine-based NetMRI instances.
RAID messages apply only to appliances that directly support RAID, including the NT-2200 and NT-4000 models.
NetMRI 1102-A models do not support hardware monitoring alerts.
NT-1400 and NT-2200 systems do not report Ambient Temperatures.
Double-clicking any hardware Issue that appears in this category opens the Settings –> Notifications –> Hardware Status page.
Network (see Details on Network Alerts for more information)

High rate of network errors on MGMT port.

Network link down on MGMT port.

High rate of network errors on SCAN port.

Network link down on SCAN port.

General network connectivity issues on the NetMRI appliance.

Errors related to sending jumbo frames are excluded from the triggers of the alert messages "NETW000: High number of network errors on management port" and "NETW001: High number of network errors on SCAN port".

Platform Capacity (see Details on Network Alerts for more information)Number of interfaces <count> exceeds Platform Interface Limit of <limit>.
Number of end hosts <count> exceeds Platform SPM End Host Limit of <limit>.
Number of devices <count> exceeds Platform Total Device Limit of <limit>.
Reflects issues where the current level of discovered network devices, interfaces, or end hosts
is
are exceeding the platform limits for the appliance. Does not apply to licensed limits. Platform limit values can be located in the Settings icon –> Setup –> Settings Summary page.
Processing (see Details on Processing Alerts for more information)Processing Capacity is being exceeded.Processing Alerts reflect Issues where the system processing capacity is being exceeded in the current system configuration.
Software (see Details on Software Alerts for more information)A software problem was detected.
A software problem was detected during Weekly Maintenance.
In all cases, contact Customer Support for assistance.
Storage (see Details on Storage Alerts for more information)

Low on disk space

Critically low on disk space

Cannot Connect to remote archive storage

Could not save the archive to remote storage  <hostname>

Disk <X> Failed.

Low on Disk Space indicates that System Health recommends preventive action to increase available disk space in the appliance.
Critically Low on Disk Space indicates an impending failure due to insufficient disk space.
Collector Connectivity (see Details on Operation Center Collector Alerts for more information)Connection to Collector <X> lost. Collector <X> Reset.
Collector <X> is Rebooting.
Issues associated with collector reachability and connectivity in an Operation Center deployment.

Configuration (see Details on Configuration Alerts for more information)

New unassigned VRF discovered
.
Warning notification that a VRF network has been discovered and should be placed into a network view by the administrator.

Anchor
Details on Software Alerts
Details on Software Alerts
Anchor
bookmark770
bookmark770
Details on Software Alerts

...

Platform Capacity alerts do not necessarily reflect a problem in the NetMRI system. Each NetMRI appliance has an advisory limit in the number of discovered interfaces, discovered devices and discovered end host devices that it is expected to support, based on disk space and system processing capabilities inherent in the appliance model. These values are called the Platform Capacity and are also reflected in the NetMRI Configuration values shown under the Settings icon –> Setup –> Settings Summary page.
Unlike other System Alert categories, Platform Capacity warnings will always appear when all three of the advisory system limits (Number of managed interfaces, Number of end hosts devices, number of discovered devices) are exceeded by the appliance. Note that the Processing category (also see Details on Processing Alerts) provides the same three warnings (along with others) in its alerts category. When any of these three limits is violated as the result of a processing issue, one of the Platform Capacity warnings also will appear in the notification. These limits are not enforced and the NetMRI appliance operates normally; excess devices continues to appear in the Discovered Devices table. (For related information, see Understanding Platform Limits, Licensing Limits and Effective Limits.)

...