900 Series Alarm Messages¶
Alarm Severities
One or more of the following severity levels is associated with each alarm.
CriticalIndicates that a platform service affecting condition has occurred and immediate corrective action is required. (A mandatory platform service has become totally out of service and its capability must be restored.)
MajorIndicates that a platform service affecting condition has developed and urgent corrective action is required. (A mandatory platform service has developed a severe degradation and its full capability must be restored.)
- or -
An optional platform service has become totally out of service and its capability should be restored.
MinorIndicates that a platform non-service affecting fault condition has developed and corrective action should be taken in order to prevent a more serious fault. (The fault condition is not currently impacting / degrading the capability of the platform service.)
WarningIndicates the detection of a potential or impending service affecting fault. Action should be taken to further diagnose and correct the problem in order to prevent it from becoming a more serious service affecting fault.
Alarm ID: 900.001  | 
Patching operation in progress.  | 
Entity Instance  | 
host=controller  | 
Degrade Affecting Severity:  | 
none  | 
Severity:  | 
minor  | 
Proposed Repair Action  | 
Complete reboots of affected hosts.  | 
Management Affecting Severity  | 
warning  | 
Alarm ID: 900.002  | 
Patch host install failure. Command “sw-patch host-install” failed.  | 
Entity Instance  | 
host=<hostname>  | 
Degrade Affecting Severity:  | 
none  | 
Severity:  | 
major  | 
Proposed Repair Action  | 
Undo patching operation. Check patch logs on the target host (i.e. /var/log/patching.log)  | 
Management Affecting Severity  | 
warning  | 
Alarm ID: 900.003  | 
A patch with state ‘obsolete’ in its metadata has been uploaded.  | 
Entity Instance  | 
host=controller  | 
Degrade Affecting Severity:  | 
none  | 
Severity:  | 
warning  | 
Proposed Repair Action  | 
Remove and delete obsolete patches.  | 
Management Affecting Severity  | 
warning  | 
Alarm ID: 900.004  | 
The upgrade and running software version do not match. Command host-upgrade failed.  | 
Entity Instance  | 
host=<hostname>  | 
Degrade Affecting Severity:  | 
none  | 
Severity:  | 
major  | 
Proposed Repair Action  | 
Reinstall host to update applied load.  | 
Management Affecting Severity  | 
warning  | 
Alarm ID: 900.005  | 
System Upgrade in progress.  | 
Entity Instance  | 
host=controller  | 
Degrade Affecting Severity:  | 
none  | 
Severity:  | 
minor  | 
Proposed Repair Action  | 
No action required.  | 
Management Affecting Severity  | 
warning  | 
Alarm ID: 900.006  | 
Device image update operation in progress.  | 
Entity Instance  | 
host=controller  | 
Degrade Affecting Severity:  | 
none  | 
Severity:  | 
minor  | 
Proposed Repair Action  | 
Complete reboots of affected hosts.  | 
Management Affecting Severity  | 
warning  | 
Alarm ID: 900.007  | 
Kubernetes upgrade in progress.  | 
Entity Instance  | 
host=controller  | 
Degrade Affecting Severity:  | 
none  | 
Severity:  | 
minor  | 
Proposed Repair Action  | 
No action required.  | 
Management Affecting Severity  | 
warning  | 
Alarm ID: 900.008  | 
Kubernetes rootca update in progress  | 
Entity Instance  | 
host=controller  | 
Degrade Affecting Severity:  | 
none  | 
Severity:  | 
minor  | 
Proposed Repair Action  | 
Wait for kubernetes rootca procedure to complete  | 
Management Affecting Severity  | 
warning  | 
Alarm ID: 900.009  | 
Kubernetes root CA update aborted, certificates may not be fully updated. Command “system kube-rootca-update-abort” has been run.  | 
Entity Instance  | 
host=controller  | 
Degrade Affecting Severity:  | 
none  | 
Severity:  | 
minor  | 
Proposed Repair Action  | 
Fully update certificates by a new root CA update.  | 
Management Affecting Severity  | 
warning  | 
Alarm ID: 900.010  | 
System Config update in progress  | 
Entity Instance  | 
host=controller  | 
Degrade Affecting Severity:  | 
none  | 
Severity:  | 
minor  | 
Proposed Repair Action  | 
Wait for system config update to complete  | 
Management Affecting Severity  | 
warning  | 
Alarm ID: 900.011  | 
System Config update aborted, configurations may not be fully updated  | 
Entity Instance  | 
host=<hostname>  | 
Degrade Affecting Severity:  | 
none  | 
Severity:  | 
minor  | 
Proposed Repair Action  | 
Lock the host, wait for the host resource in the deployment namespace to become in-sync, then unlock the host  | 
Management Affecting Severity  | 
warning  | 
Alarm ID: 900.020  | 
Deploy host completed with success  | 
Entity Instance  | 
host=<hostname>  | 
Degrade Affecting Severity:  | 
none  | 
Severity:  | 
warning  | 
Proposed Repair Action  | 
Unlock host  | 
Management Affecting Severity  | 
none  | 
Alarm ID: 900.021  | 
Deploy host failed  | 
Entity Instance  | 
host=<hostname>  | 
Degrade Affecting Severity:  | 
none  | 
Severity:  | 
major  | 
Proposed Repair Action  | 
Check the logs for errors, fix the issues manually and retry  | 
Management Affecting Severity  | 
warning  | 
Alarm ID: 900.022  | 
Clean up deployment data  | 
Entity Instance  | 
host=<hostname>  | 
Degrade Affecting Severity:  | 
none  | 
Severity:  | 
warning  | 
Proposed Repair Action  | 
software deploy delete  | 
Management Affecting Severity  | 
none  | 
Alarm ID: 900.023  | 
Software release deploy operation in progress.  | 
Entity Instance  | 
host=controller  | 
Degrade Affecting Severity:  | 
none  | 
Severity:  | 
minor  | 
Proposed Repair Action  | 
Complete release deploy.  | 
Management Affecting Severity  | 
none  | 
Alarm ID: 900.024  | 
A release with state ‘unavailable’ is present.  | 
Entity Instance  | 
host=controller  | 
Degrade Affecting Severity:  | 
none  | 
Severity:  | 
warning  | 
Proposed Repair Action  | 
Delete obsolete releases using “software delete <release>”.  | 
Management Affecting Severity  | 
none  | 
Alarm ID: 900.101  | 
Software patch auto-apply in progress  | 
Entity Instance  | 
orchestration=sw-patch  | 
Degrade Affecting Severity:  | 
none  | 
Severity:  | 
major  | 
Proposed Repair Action  | 
Wait for software patch auto-apply to complete; if problem persists contact next level of support  | 
Management Affecting Severity  | 
warning  | 
Alarm ID: 900.102  | 
Software patch auto-apply aborting  | 
Entity Instance  | 
orchestration=sw-patch  | 
Degrade Affecting Severity:  | 
none  | 
Severity:  | 
major  | 
Proposed Repair Action  | 
Wait for software patch auto-apply abort to complete; if problem persists contact next level of support  | 
Management Affecting Severity  | 
warning  | 
Alarm ID: 900.103  | 
Software patch auto-apply failed. Command “sw-manager patch-strategy apply” failed.  | 
Entity Instance  | 
orchestration=sw-patch  | 
Degrade Affecting Severity:  | 
none  | 
Severity:  | 
critical  | 
Proposed Repair Action  | 
Attempt to apply software patches manually; if problem persists contact next level of support  | 
Management Affecting Severity  | 
warning  | 
Alarm ID: 900.201  | 
Software deploy auto-apply in progress  | 
Entity Instance  | 
orchestration=sw-deploy  | 
Degrade Affecting Severity:  | 
none  | 
Severity:  | 
major  | 
Proposed Repair Action  | 
Wait for software deploy auto-apply to complete; if problem persists contact next level of support  | 
Management Affecting Severity  | 
warning  | 
Alarm ID: 900.202  | 
Software deploy auto-apply aborting  | 
Entity Instance  | 
orchestration=sw-deploy  | 
Degrade Affecting Severity:  | 
none  | 
Severity:  | 
major  | 
Proposed Repair Action  | 
Wait for software deploy auto-apply abort to complete; if problem persists contact next level of support  | 
Management Affecting Severity  | 
warning  | 
Alarm ID: 900.203  | 
Software deploy auto-apply failed. Command “sw-manager update-strategy apply” failed  | 
Entity Instance  | 
orchestration=sw-deploy  | 
Degrade Affecting Severity:  | 
none  | 
Severity:  | 
critical  | 
Proposed Repair Action  | 
Attempt to apply software deploy manually; if problem persists contact next level of support  | 
Management Affecting Severity  | 
warning  | 
Alarm ID: 900.231  | 
Software deploy state out of sync  | 
Entity Instance  | 
orchestration=sw-deploy  | 
Degrade Affecting Severity:  | 
none  | 
Severity:  | 
major  | 
Proposed Repair Action  | 
Wait for the deployment on the active controller to complete. If problem persists contact next level of support  | 
Management Affecting Severity  | 
warning  | 
Alarm ID: 900.301  | 
Firmware Update auto-apply in progress  | 
Entity Instance  | 
orchestration=fw-update  | 
Degrade Affecting Severity:  | 
none  | 
Severity:  | 
major  | 
Proposed Repair Action  | 
Wait for firmware update auto-apply to complete; if problem persists contact next level of support  | 
Management Affecting Severity  | 
warning  | 
Alarm ID: 900.302  | 
Firmware Update auto-apply aborting  | 
Entity Instance  | 
orchestration=fw-update  | 
Degrade Affecting Severity:  | 
none  | 
Severity:  | 
major  | 
Proposed Repair Action  | 
Wait for firmware update auto-apply abort to complete; if problem persists contact next level of support  | 
Management Affecting Severity  | 
warning  | 
Alarm ID: 900.303  | 
Firmware Update auto-apply failed. Command “sw-manager kube-rootca-update-strategy apply” failed.  | 
Entity Instance  | 
orchestration=fw-update  | 
Degrade Affecting Severity:  | 
none  | 
Severity:  | 
critical  | 
Proposed Repair Action  | 
Attempt to apply firmware update manually; if problem persists contact next level of support  | 
Management Affecting Severity  | 
warning  | 
Alarm ID: 900.501  | 
Kubernetes rootca update auto-apply in progress  | 
Entity Instance  | 
orchestration=kube-rootca-update  | 
Degrade Affecting Severity:  | 
none  | 
Severity:  | 
major  | 
Proposed Repair Action  | 
Wait for kubernetes rootca update auto-apply to complete; if problem persists contact next level of support  | 
Management Affecting Severity  | 
warning  | 
Alarm ID: 900.502  | 
Kubernetes rootca update auto-apply aborting  | 
Entity Instance  | 
orchestration=kube-rootca-update  | 
Degrade Affecting Severity:  | 
none  | 
Severity:  | 
major  | 
Proposed Repair Action  | 
Wait for kubernetes rootca update auto-apply abort to complete; if problem persists contact next level of support  | 
Management Affecting Severity  | 
warning  | 
Alarm ID: 900.503  | 
Kubernetes rootca update auto-apply failed. Command “sw-manager kube-upgrade-strategy apply” failed.  | 
Entity Instance  | 
orchestration=kube-rootca-update  | 
Degrade Affecting Severity:  | 
none  | 
Severity:  | 
critical  | 
Proposed Repair Action  | 
Attempt to apply kubernetes rootca update manually; if problem persists contact next level of support  | 
Management Affecting Severity  | 
warning  | 
Alarm ID: 900.601  | 
System config update auto-apply in progress  | 
Entity Instance  | 
orchestration=system-config-update  | 
Degrade Affecting Severity:  | 
none  | 
Severity:  | 
major  | 
Proposed Repair Action  | 
Wait for system config update auto-apply to complete; if problem persists contact next level of support  | 
Management Affecting Severity  | 
warning  | 
Alarm ID: 900.602  | 
System config update auto-apply aborting  | 
Entity Instance  | 
orchestration=system-config-update  | 
Degrade Affecting Severity:  | 
none  | 
Severity:  | 
major  | 
Proposed Repair Action  | 
Wait for system config update auto-apply abort to complete; if problem persists contact next level of support  | 
Management Affecting Severity  | 
warning  | 
Alarm ID: 900.603  | 
System config update auto-apply failed. Command “sw-manager kube-upgrade-strategy apply” failed  | 
Entity Instance  | 
orchestration=system-config-update  | 
Degrade Affecting Severity:  | 
none  | 
Severity:  | 
critical  | 
Proposed Repair Action  | 
Attempt to apply system config update manually; if problem persists contact next level of support  | 
Management Affecting Severity  | 
warning  | 
Alarm ID: 900.701  | 
Node <hostname> tainted.  | 
Entity Instance  | 
host=<hostname>  | 
Degrade Affecting Severity:  | 
major  | 
Severity:  | 
major  | 
Proposed Repair Action  | 
“Execute ‘kubectl taint nodes <hostname> services=disabled:NoExecute-’ If it fails, Execute ‘system host-lock <hostname>’ followed by ‘system host-unlock <hostname>’. If issue still persists, contact next level of support.”  | 
Management Affecting Severity  | 
warning  |