Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Banner System Health messages appear only in yellow (Warning) and red (Critical). Click directly on the banner text to display the System Health page with its alert listings.
You may disallow the System Health banners from appearing to non-Admin NetMRI users, by opening the Settings > General Settings –> > Advanced Settings page and choosing the Hide the system banners from non-admin users setting. (It is on the last page of Advanced Settings, under User Administration.) Click the Action icon and choose Edit, choose Yes and click OK.

...

Health Alert Category

Alert Messages

Description

Hardware 

RAID Drive <X> Failed.

RAID Array Failed.

Fan <X> Failed.

Power Supply <X> Failed.

High Ambient Temperature.

High Internal Temperature.

RAID Battery Failed.

RAID Array Failed.

This category applies only to hardware-based NetMRI systems and will not appear for virtual machine-based NetMRI instances.
RAID messages apply only to appliances that directly support RAID, including the NT-2200 and NT-4000 models.
NetMRI 1102-A models do not support hardware monitoring alerts.
NT-1400 and NT-2200 systems do not report Ambient Temperatures.
Double-clicking any hardware Issue that appears in this category opens the Settings –> > Notifications –> > Hardware Status page.

Network 

High rate of network errors on MGMT port.

Network link down on MGMT port.

High rate of network errors on SCAN port.

Network link down on SCAN port.

General network connectivity issues on the NetMRI appliance.

Errors related to sending jumbo frames are excluded from the triggers of the alert messages "NETW000: High number of network errors on management port" and "NETW001: High number of network errors on SCAN port".

Platform Capacity 

Number of interfaces <count> exceeds Platform Interface Limit of <limit>.
Number of end hosts <count> exceeds Platform SPM End Host Limit of <limit>.
Number of devices <count> exceeds Platform Total Device Limit of <limit>.

Reflects issues where the current level of discovered network devices, interfaces or end hosts is exceeding the platform limits for the appliance. Does not apply to licensed limits. Platform limit values can be located in the Settings icon –> > Setup –> > Settings Summary page.

Processing 

Processing Capacity is being exceeded.

Processing Alerts reflect Issues where the system processing capacity is being exceeded in the current system configuration.

Software 

A software problem was detected.
A software problem was detected during Weekly Maintenance.

In all cases, contact Customer Support for assistance.

Storage 

Low on disk space

Critically low on disk space

Cannot Connect to remote archive storage

Could not save archive to remote storage  <hostname>

Disk <X> Failed.

Low on Disk Space indicates that System Health recommends preventive action to increase available disk space in the appliance.
Critically Low on Disk Space indicates an impending failure due to insufficient disk space.

Collector Connectivity 

Connection to Collector <X> lost. Collector <X> Reset.
Collector <X> is Rebooting.

Issues associated with collector reachability and connectivity in an Operation Center deployment.

Configuration 

New unassigned VRF discovered.

Warning notification that a VRF network has been discovered and should be placed into a network view by the administrator.

...

Platform Capacity alerts do not necessarily reflect a problem in the NetMRI system. Each NetMRI appliance has an advisory limit in the number of discovered interfaces, discovered devices and discovered end host devices that it is expected to support, based on disk space and system processing capabilities inherent in the appliance model. These values are called the Platform Capacity and are also reflected in the NetMRI Configuration values shown under the Settings icon –> > Setup –> > Settings Summary page.
Unlike other System Alert categories, Platform Capacity warnings will always appear when all three of the advisory system limits (Number of managed interfaces, Number of end hosts devices, number of discovered devices) are exceeded by the appliance. Note that the processing category provides the same three warnings (along with others) in its alerts category. When any of these three limits is violated as the result of a processing issue, one of the Platform Capacity warnings also will appear in the notification. These limits are not enforced and the NetMRI appliance operates normally; excess devices continues to appear in the Discovered Devices table. 

...

Double-clicking on any hardware alert opens the alert in the Settings –> > Notifications –> > Hardware Status page

Alert Message

User Action

RAID Drive <X> Failed

Replace the hard disk with a replacement drive authorized by Infoblox.

RAID Array Failed

Contact Customer Support.

Fax <X> Failed

Replace the system fan. Appears only in systems where system fans are user-replaceable, as with the NetMRI NT-2200 and NT-4000 devices. Fan assemblies must be replaced with authorized Infoblox parts. Contact Customer Support if this message appears in systems where fans are not user-replaceable.

Power Supply <X> Failed

Check Power Supply operation. Message appears only for systems in which a redundant 1+1 power supply configuration is available and running in the device in question. (For a single-power-supply system, the appliance simply shuts down.) The alerts also allow for the possibility that a power supply is unplugged.

Ambient temperature is high. Internal temperature is high.

Both messages may appear for the same system, with internal temperature being affected by the ambient temperature. Reduce the ambient temperature where possible; if the Internal temperature remains high, look for a Fan Failed error message along with the Internal Temperature message. Contact Customer Support if an Internal Temperature is High issue persists when conditions are otherwise optimal.

Critical — RAID Battery failed.

Contact Customer Support.

RAID Array Degraded.

The RAID array is not fully operational due to a disk in the process of rebuilding or a disk being removed. If a disk has been removed in preparation for replacement, this issue will also appear, and will clear when the replacement is finished rebuilding. If you know that no disk replacement operation has been started with the appliance and this issue appears, contact Customer Support.

...