100 Series Alarm Messages¶
Alarm Severities
One or more of the following severity levels is associated with each alarm.
CriticalIndicates that a platform service affecting condition has occurred and immediate corrective action is required. (A mandatory platform service has become totally out of service and its capability must be restored.)
MajorIndicates that a platform service affecting condition has developed and urgent corrective action is required. (A mandatory platform service has developed a severe degradation and its full capability must be restored.)
- or -
An optional platform service has become totally out of service and its capability should be restored.
MinorIndicates that a platform non-service affecting fault condition has developed and corrective action should be taken in order to prevent a more serious fault. (The fault condition is not currently impacting / degrading the capability of the platform service.)
WarningIndicates the detection of a potential or impending service affecting fault. Action should be taken to further diagnose and correct the problem in order to prevent it from becoming a more serious service affecting fault.
Alarm ID: 100.101  | 
Platform CPU threshold exceeded; threshold x%, actual y% . CRITICAL @ 95% MAJOR @ 90%  | 
Entity Instance  | 
host=<hostname>  | 
Degrade Affecting Severity:  | 
critical  | 
Severity:  | 
[‘critical’, ‘major’]  | 
Proposed Repair Action  | 
Monitor and if condition persists, contact next level of support.  | 
Management Affecting Severity  | 
major  | 
Alarm ID: 100.103  | 
Memory threshold exceeded; threshold x%, actual y% . CRITICAL @ 90% MAJOR @ 80%  | 
Entity Instance  | 
host=<hostname> OR host=<hostname>.memory=total OR host=<hostname>.memory=platform OR host=<hostname>.numa=node<number>  | 
Degrade Affecting Severity:  | 
critical  | 
Severity:  | 
[‘critical’, ‘major’]  | 
Proposed Repair Action  | 
Monitor and if condition persists, contact next level of support; may require additional memory on Host.  | 
Management Affecting Severity  | 
none  | 
Alarm ID: 100.104  | 
host=<hostname>.filesystem=<mount-dir> File System threshold exceeded; threshold x%, actual y% . CRITICAL @ 90% MAJOR @ 80% OR host=<hostname>.volumegroup=<volumegroup-name> Monitor and if condition persists, consider adding additional physical volumes to the volume group.  | 
Entity Instance  | 
host=<hostname>.filesystem=<mount-dir> OR host=<hostname>.volumegroup=<volumegroup-name>  | 
Degrade Affecting Severity:  | 
critical  | 
Severity:  | 
[‘critical’, ‘major’]  | 
Proposed Repair Action  | 
Reduce usage or resize filesystem.  | 
Management Affecting Severity  | 
critical  | 
Alarm ID: 100.106  | 
‘OAM’ Port failed.  | 
Entity Instance  | 
host=<hostname>.port=<port-name>  | 
Degrade Affecting Severity:  | 
major  | 
Severity:  | 
major  | 
Proposed Repair Action  | 
Check cabling and far-end port configuration and status on adjacent equipment.  | 
Management Affecting Severity  | 
warning  | 
Alarm ID: 100.107  | 
‘OAM’ Interface degraded. OR ‘OAM’ Interface failed.  | 
Entity Instance  | 
host=<hostname>.interface=<if-name>  | 
Degrade Affecting Severity:  | 
major  | 
Severity:  | 
[‘critical’, ‘major’]  | 
Proposed Repair Action  | 
Check cabling and far-end port configuration and status on adjacent equipment.  | 
Management Affecting Severity  | 
warning  | 
Alarm ID: 100.108  | 
‘MGMT’ Port failed.  | 
Entity Instance  | 
host=<hostname>.port=<port-name>  | 
Degrade Affecting Severity:  | 
major  | 
Severity:  | 
major  | 
Proposed Repair Action  | 
Check cabling and far-end port configuration and status on adjacent equipment.  | 
Management Affecting Severity  | 
warning  | 
Alarm ID: 100.109  | 
‘MGMT’ Interface degraded. OR ‘MGMT’ Interface failed.  | 
Entity Instance  | 
host=<hostname>.interface=<if-name>  | 
Degrade Affecting Severity:  | 
major  | 
Severity:  | 
[‘critical’, ‘major’]  | 
Proposed Repair Action  | 
Check cabling and far-end port configuration and status on adjacent equipment.  | 
Management Affecting Severity  | 
warning  | 
Alarm ID: 100.110  | 
‘CLUSTER-HOST’ Port failed.  | 
Entity Instance  | 
host=<hostname>.port=<port-name>  | 
Degrade Affecting Severity:  | 
major  | 
Severity:  | 
major  | 
Proposed Repair Action  | 
Check cabling and far-end port configuration and status on adjacent equipment.  | 
Management Affecting Severity  | 
warning  | 
Alarm ID: 100.111  | 
‘CLUSTER-HOST’ Interface degraded. OR ‘CLUSTER-HOST’ Interface failed.  | 
Entity Instance  | 
host=<hostname>.interface=<if-name>  | 
Degrade Affecting Severity:  | 
major  | 
Severity:  | 
[‘critical’, ‘major’]  | 
Proposed Repair Action  | 
Check cabling and far-end port configuration and status on adjacent equipment.  | 
Management Affecting Severity  | 
warning  | 
Alarm ID: 100.114  | 
NTP configuration does not contain any valid or reachable NTP servers. NTP address <IP address> is not a valid or a reachable NTP server.  | 
Entity Instance  | 
host=<hostname>.ntp host=<hostname>.ntp=<IP address>  | 
Degrade Affecting Severity:  | 
none  | 
Severity:  | 
[‘major’, ‘minor’]  | 
Proposed Repair Action  | 
Monitor and if condition persists, contact next level of support.  | 
Management Affecting Severity  | 
none  | 
Alarm ID: 100.118  | 
Controller cannot establish connection with remote logging server.  | 
Entity Instance  | 
host=<hostname>  | 
Degrade Affecting Severity:  | 
none  | 
Severity:  | 
minor  | 
Proposed Repair Action  | 
Ensure Remote Log Server IP is reachable from Controller through OAM interface; otherwise contact next level of support.  | 
Management Affecting Severity  | 
none  | 
Alarm ID: 100.119  | 
<hostname> does not support the provisioned PTP mode OR <hostname> PTP clocking is out-of-tolerance OR <hostname> is not locked to remote PTP Primary source OR <hostname> GNSS signal loss state:<state> OR <hostname> 1PPS signal loss state:<state>  | 
Entity Instance  | 
host=<hostname>.ptp OR host=<hostname>.ptp=no-lock OR host=<hostname>.ptp=<interface>.unsupported=hardware-timestamping OR host=<hostname>.ptp=<interface>.unsupported=software-timestamping OR host=<hostname>.ptp=<interface>.unsupported=legacy-timestamping OR host=<hostname>.ptp=out-of-tolerance OR host=<hostname>.instance=<instance>.ptp=out-of-tolerance OR host=<hostname>.interface=<interface>.ptp=signal-loss  | 
Degrade Affecting Severity:  | 
none  | 
Severity:  | 
[‘major’, ‘minor’]  | 
Proposed Repair Action  | 
Monitor and if condition persists, contact next level of support.  | 
Management Affecting Severity  | 
none  | 
Alarm ID: 100.120  | 
Controllers running mismatched kernels.  | 
Entity Instance  | 
host=<hostname>.kernel=<kernel>  | 
Degrade Affecting Severity:  | 
none  | 
Severity:  | 
minor  | 
Proposed Repair Action  | 
Modify controllers using ‘system host-kernel-modify’ so that both are running the desired ‘standard’ or ‘lowlatency’ kernel.  | 
Management Affecting Severity  | 
none  | 
Alarm ID: 100.121  | 
Host not running the provisioned kernel.  | 
Entity Instance  | 
host=<hostname>.kernel=<kernel>  | 
Degrade Affecting Severity:  | 
none  | 
Severity:  | 
major  | 
Proposed Repair Action  | 
Retry ‘system host-kernel-modify’ and if condition persists, contact next level of support.  | 
Management Affecting Severity  | 
major  | 
Alarm ID: 100.150  | 
service open file descriptor has reached its limit service open file descriptor is approaching to its limit  | 
Entity Instance  | 
host=<hostname>.resource_type=file-descriptor.service_name=<service-name>  | 
Degrade Affecting Severity:  | 
critical  | 
Severity:  | 
[‘critical’, ‘major’]  | 
Proposed Repair Action  | 
swact to the other controller if it is available  | 
Management Affecting Severity  | 
critical  |