Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

System Health is a NetMRI feature to provide a view of the system health of the NetMRI appliance. NetMRI provides two visual inputs to notify and assist the administrator in responding to issues in the NetMRI appliance:

...

Banner system health messages appear only in yellow (warning) and red (critical). Click the banner text to  To display the System Health page with its alert listings, click the banner text.
To hide system health banners for NetMRI users:

  1. In the NetMRI UI, go to the Settings icon > User Admin > Roles.
  2. Click the Action icon for the role you want to edit and then select Edit.
  3. Click Privilages Privileges.
  4. In the View: System Health Banner row, click the Delete icon.
  5. Click Yes.

Anchor
Categories of Health Status
Categories of Health Status
Anchor
bookmark769
bookmark769
Categories of Health Status

...

Health Alert Category

Alert Messages

Description

Hardware (see the Details on Hardware Alerts for topic for more information)

RAID Drive <X> Failed.

RAID Array Failed.

Fan <X> Failed.

Power Supply <X> Failed.

High Ambient Temperature.

High Internal Temperature.

RAID Battery Failed.

RAID Array Failed.

This category applies only to hardware-based NetMRI systems and will not appear for virtual machine-based NetMRI instances.
RAID messages apply only to appliances that directly support RAID, including the NT-2200 and NT-4000 models.
NetMRI 1102-A models do not support hardware monitoring alerts.
NT-1400 and NT-2200 systems do not report Ambient Temperatures.
Double-clicking any hardware Issue that appears in this category opens the Settings icon > Notifications > Hardware Status page.

Network (see the Details on Network Alerts topic for more information)

High rate of network errors on MGMT port.

Network link down on MGMT port.

High rate of network errors on SCAN port.

Network link down on SCAN port.

General network connectivity issues on the NetMRI appliance.

Errors related to sending jumbo frames are excluded from the triggers of the alert messages "NETW000: High number of network errors on management port" and "NETW001: High number of network errors on SCAN port".

Platform Capacity (see the Details on Network Alerts topic for more information)

Number of interfaces <count> exceeds Platform Interface Limit of <limit>.
Number of end hosts <count> exceeds Platform SPM End Host Limit of <limit>.
Number of devices <count> exceeds Platform Total Device Limit of <limit>.

Reflects issues where the current level of discovered network devices, interfaces or end hosts is exceeding the platform limits for the appliance. Does not apply to licensed limits. Platform limit values can be located on the Settings icon > Setup > Settings Summary page.

Processing (see the Details on Processing Alerts topic for more information)

Processing Capacity is being exceeded.

Processing Alerts reflect Issues where the system processing capacity is being exceeded in the current system configuration.

Software (see the   Details on Software Alerts topic for more information)

A software problem was detected.
A software problem was detected during Weekly Maintenance.

In all cases, contact Customer Support for assistance.

Storage (see the   Details on Storage Alerts topic for more information)

Low on disk space

Critically low on disk space

Cannot Connect to remote archive storage

Could not save archive to remote storage  <hostname>

Disk <X> Failed.

Low on Disk Space indicates that System Health recommends preventive action to increase available disk space in the appliance.
Critically Low on Disk Space indicates an impending failure due to insufficient disk space.

Collector Connectivity (see the   Details on Operation Center Collector Alerts topic for more information)

Connection to Collector <X> lost. Collector <X> Reset.
Collector <X> is Rebooting.

Issues associated with collector reachability and connectivity in an Operation Center deployment.

Configuration (see the   Details on Configuration Alerts topic for more information)

New unassigned VRF discovered.

Warning notification that a VRF network has been discovered and should be placed into a network view by the administrator.

...

Platform Capacity alerts do not necessarily reflect a problem in the NetMRI system. Each NetMRI appliance has an advisory limit in the number of discovered interfaces, discovered devices and discovered end host devices that it is expected to support, based on disk space and system processing capabilities inherent in the appliance model. These values are called the Platform Capacity and are also reflected in the NetMRI Configuration values shown on the Settings icon > Setup > Settings Summary page.
Unlike other System Alert categories, Platform Capacity warnings will always appear when all three of the advisory system limits (Number of managed interfaces, Number of end hosts devices, number of discovered devices) are exceeded by the appliance. Note that the Processing category (also see the Details on Processing Alerts topic) provides the same three warnings (along with others) in its alerts category. When any of these three limits is violated as the result of a processing issue, one of the Platform Capacity warnings also will appear in the notification. These limits are not enforced and the NetMRI appliance operates normally; excess devices continues to appear in the Discovered Devices table. (For related information, see Understanding Platform Limits, Licensing Limits and Effective Limits.)

...