900 Series Alarm Messages

Alarm Severities

One or more of the following severity levels is associated with each alarm.

Critical

Indicates that a platform service affecting condition has occurred and immediate corrective action is required. (A mandatory platform service has become totally out of service and its capability must be restored.)

Major

Indicates that a platform service affecting condition has developed and urgent corrective action is required. (A mandatory platform service has developed a severe degradation and its full capability must be restored.)

- or -

An optional platform service has become totally out of service and its capability should be restored.

Minor

Indicates that a platform non-service affecting fault condition has developed and corrective action should be taken in order to prevent a more serious fault. (The fault condition is not currently impacting / degrading the capability of the platform service.)

Warning

Indicates the detection of a potential or impending service affecting fault. Action should be taken to further diagnose and correct the problem in order to prevent it from becoming a more serious service affecting fault.


Alarm ID: 900.001

Patching operation in progress.

Entity Instance

host=controller

Degrade Affecting Severity:

none

Severity:

minor

Proposed Repair Action

Complete reboots of affected hosts.

Management Affecting Severity

warning


Alarm ID: 900.002

Patch host install failure. Command “sw-patch host-install” failed.

Entity Instance

host=<hostname>

Degrade Affecting Severity:

none

Severity:

major

Proposed Repair Action

Undo patching operation. Check patch logs on the target host (i.e. /var/log/patching.log)

Management Affecting Severity

warning


Alarm ID: 900.003

A patch with state ‘obsolete’ in its metadata has been uploaded.

Entity Instance

host=controller

Degrade Affecting Severity:

none

Severity:

warning

Proposed Repair Action

Remove and delete obsolete patches.

Management Affecting Severity

warning


Alarm ID: 900.004

The upgrade and running software version do not match. Command host-upgrade failed.

Entity Instance

host=<hostname>

Degrade Affecting Severity:

none

Severity:

major

Proposed Repair Action

Reinstall host to update applied load.

Management Affecting Severity

warning


Alarm ID: 900.005

System Upgrade in progress.

Entity Instance

host=controller

Degrade Affecting Severity:

none

Severity:

minor

Proposed Repair Action

No action required.

Management Affecting Severity

warning


Alarm ID: 900.006

Device image update operation in progress.

Entity Instance

host=controller

Degrade Affecting Severity:

none

Severity:

minor

Proposed Repair Action

Complete reboots of affected hosts.

Management Affecting Severity

warning


Alarm ID: 900.007

Kubernetes upgrade in progress.

Entity Instance

host=controller

Degrade Affecting Severity:

none

Severity:

minor

Proposed Repair Action

No action required.

Management Affecting Severity

warning


Alarm ID: 900.008

Kubernetes rootca update in progress

Entity Instance

host=controller

Degrade Affecting Severity:

none

Severity:

minor

Proposed Repair Action

Wait for kubernetes rootca procedure to complete

Management Affecting Severity

warning


Alarm ID: 900.009

Kubernetes root CA update aborted, certificates may not be fully updated. Command “system kube-rootca-update-abort” has been run.

Entity Instance

host=controller

Degrade Affecting Severity:

none

Severity:

minor

Proposed Repair Action

Fully update certificates by a new root CA update.

Management Affecting Severity

warning


Alarm ID: 900.010

System Config update in progress

Entity Instance

host=controller

Degrade Affecting Severity:

none

Severity:

minor

Proposed Repair Action

Wait for system config update to complete

Management Affecting Severity

warning


Alarm ID: 900.011

System Config update aborted, configurations may not be fully updated

Entity Instance

host=<hostname>

Degrade Affecting Severity:

none

Severity:

minor

Proposed Repair Action

Lock the host, wait for the host resource in the deployment namespace to become in-sync, then unlock the host

Management Affecting Severity

warning


Alarm ID: 900.020

Deploy host completed with success

Entity Instance

host=<hostname>

Degrade Affecting Severity:

none

Severity:

warning

Proposed Repair Action

Unlock host

Management Affecting Severity

none


Alarm ID: 900.021

Deploy host failed

Entity Instance

host=<hostname>

Degrade Affecting Severity:

none

Severity:

major

Proposed Repair Action

Check the logs for errors, fix the issues manually and retry

Management Affecting Severity

warning


Alarm ID: 900.022

Clean up deployment data

Entity Instance

host=<hostname>

Degrade Affecting Severity:

none

Severity:

warning

Proposed Repair Action

software deploy delete

Management Affecting Severity

none


Alarm ID: 900.023

Software release deploy operation in progress.

Entity Instance

host=controller

Degrade Affecting Severity:

none

Severity:

minor

Proposed Repair Action

Complete release deploy.

Management Affecting Severity

none


Alarm ID: 900.024

A release with state ‘unavailable’ is present.

Entity Instance

host=controller

Degrade Affecting Severity:

none

Severity:

warning

Proposed Repair Action

Delete obsolete releases using “software delete <release>”.

Management Affecting Severity

none


Alarm ID: 900.101

Software patch auto-apply in progress

Entity Instance

orchestration=sw-patch

Degrade Affecting Severity:

none

Severity:

major

Proposed Repair Action

Wait for software patch auto-apply to complete; if problem persists contact next level of support

Management Affecting Severity

warning


Alarm ID: 900.102

Software patch auto-apply aborting

Entity Instance

orchestration=sw-patch

Degrade Affecting Severity:

none

Severity:

major

Proposed Repair Action

Wait for software patch auto-apply abort to complete; if problem persists contact next level of support

Management Affecting Severity

warning


Alarm ID: 900.103

Software patch auto-apply failed. Command “sw-manager patch-strategy apply” failed.

Entity Instance

orchestration=sw-patch

Degrade Affecting Severity:

none

Severity:

critical

Proposed Repair Action

Attempt to apply software patches manually; if problem persists contact next level of support

Management Affecting Severity

warning


Alarm ID: 900.201

Software upgrade auto-apply in progress

Entity Instance

orchestration=sw-upgrade

Degrade Affecting Severity:

none

Severity:

major

Proposed Repair Action

Wait for software upgrade auto-apply to complete; if problem persists contact next level of support

Management Affecting Severity

warning


Alarm ID: 900.202

Software upgrade auto-apply aborting

Entity Instance

orchestration=sw-upgrade

Degrade Affecting Severity:

none

Severity:

major

Proposed Repair Action

Wait for software upgrade auto-apply abort to complete; if problem persists contact next level of support

Management Affecting Severity

warning


Alarm ID: 900.203

Software upgrade auto-apply failed. Command “sw-manager update-strategy apply” failed

Entity Instance

orchestration=sw-upgrade

Degrade Affecting Severity:

none

Severity:

critical

Proposed Repair Action

Attempt to apply software upgrade manually; if problem persists contact next level of support

Management Affecting Severity

warning


Alarm ID: 900.231

Software deploy state out of sync

Entity Instance

orchestration=sw-upgrade

Degrade Affecting Severity:

none

Severity:

major

Proposed Repair Action

Wait for the deployment on the active controller to complete. If problem persists contact next level of support

Management Affecting Severity

warning


Alarm ID: 900.301

Firmware Update auto-apply in progress

Entity Instance

orchestration=fw-update

Degrade Affecting Severity:

none

Severity:

major

Proposed Repair Action

Wait for firmware update auto-apply to complete; if problem persists contact next level of support

Management Affecting Severity

warning


Alarm ID: 900.302

Firmware Update auto-apply aborting

Entity Instance

orchestration=fw-update

Degrade Affecting Severity:

none

Severity:

major

Proposed Repair Action

Wait for firmware update auto-apply abort to complete; if problem persists contact next level of support

Management Affecting Severity

warning


Alarm ID: 900.303

Firmware Update auto-apply failed. Command “sw-manager kube-rootca-update-strategy apply” failed.

Entity Instance

orchestration=fw-update

Degrade Affecting Severity:

none

Severity:

critical

Proposed Repair Action

Attempt to apply firmware update manually; if problem persists contact next level of support

Management Affecting Severity

warning


Alarm ID: 900.501

Kubernetes rootca update auto-apply in progress

Entity Instance

orchestration=kube-rootca-update

Degrade Affecting Severity:

none

Severity:

major

Proposed Repair Action

Wait for kubernetes rootca update auto-apply to complete; if problem persists contact next level of support

Management Affecting Severity

warning


Alarm ID: 900.502

Kubernetes rootca update auto-apply aborting

Entity Instance

orchestration=kube-rootca-update

Degrade Affecting Severity:

none

Severity:

major

Proposed Repair Action

Wait for kubernetes rootca update auto-apply abort to complete; if problem persists contact next level of support

Management Affecting Severity

warning


Alarm ID: 900.503

Kubernetes rootca update auto-apply failed. Command “sw-manager kube-upgrade-strategy apply” failed.

Entity Instance

orchestration=kube-rootca-update

Degrade Affecting Severity:

none

Severity:

critical

Proposed Repair Action

Attempt to apply kubernetes rootca update manually; if problem persists contact next level of support

Management Affecting Severity

warning


Alarm ID: 900.601

System config update auto-apply in progress

Entity Instance

orchestration=system-config-update

Degrade Affecting Severity:

none

Severity:

major

Proposed Repair Action

Wait for system config update auto-apply to complete; if problem persists contact next level of support

Management Affecting Severity

warning


Alarm ID: 900.602

System config update auto-apply aborting

Entity Instance

orchestration=system-config-update

Degrade Affecting Severity:

none

Severity:

major

Proposed Repair Action

Wait for system config update auto-apply abort to complete; if problem persists contact next level of support

Management Affecting Severity

warning


Alarm ID: 900.603

System config update auto-apply failed. Command “sw-manager kube-upgrade-strategy apply” failed

Entity Instance

orchestration=system-config-update

Degrade Affecting Severity:

none

Severity:

critical

Proposed Repair Action

Attempt to apply system config update manually; if problem persists contact next level of support

Management Affecting Severity

warning


Alarm ID: 900.701

Node <hostname> tainted.

Entity Instance

host=<hostname>

Degrade Affecting Severity:

major

Severity:

major

Proposed Repair Action

“Execute ‘kubectl taint nodes <hostname> services=disabled:NoExecute-’

If it fails, Execute ‘system host-lock <hostname>’ followed by

‘system host-unlock <hostname>’.

If issue still persists, contact next level of support.”

Management Affecting Severity

warning