docs/doc/source/fault-mgmt/kubernetes/500-series-alarm-messages.rst
Ron Stone 52b70f81c2 Alarm Expiring or Expired Certificates
Added topic on new expiring/expired cert alarms.
Added 2x alarms to 500 series alarms messages page. NB. Details need to be confirmed.
Minor update for clarity around use of kubernetes edit ...
Added sample fm output
Updtes to alarm definitions based on events.yaml
Incorporated (Word) updates from Greg W.
Patchset 4 review updates.
Patchset 5 review updates.
Fixed merge conflict in sec/kub/index
Patchset 7 review updates.
Patchset 8 review update (note about cert expiry check frequency)

Story: 2008946
Task: 43568

Signed-off-by: Ron Stone <ronald.stone@windriver.com>
Change-Id: Ifeeba7484e49abcaf2d1ad2afc9afc876d479ded
2021-11-26 11:09:14 -05:00

3.1 KiB
Raw Blame History

500 Series Alarm Messages

The system inventory and maintenance service reports system changes with different degrees of severity. Use the reported alarms to monitor the overall health of the system.

Alarm ID: 500.100 initialization failed on host.
Entity Instance tenant=<tenant-uuid>
Degrade Affecting Severity: None
Severity: M
Proposed Repair Action Reinstall HTTPS certificate; if problem persists contact next level of support.

Alarm ID: 500.101 Developer patch certificate enabled.
Entity Instance host=controller
Degrade Affecting Severity: None
Severity: C
Proposed Repair Action Reinstall system to disable developer certificate and remove untrusted patches.

Alarm ID: 500.200 Certificate system certificate-show <uuid>' (mode=<ssl/ssl_ca/docker_registry/openstack/openstack_ca>) expiring soon on <date>. OR Certificate <Namespace>/<Certificate/Secret> expiring soon on <date>. OR Certificate <k8sRootCA/EtcdCA> expiring soon on <date>. system.certificate.k8sRootCA
Entity Instance system.certificate.mode=<mode>.uuid=<uuid> OR namespace=<namespace-name>.certificate=<certificate-name> OR namespace=<namespace-name>.secret=<secret-name>
Degrade Affecting Severity: None
Severity: M
Proposed Repair Action Renew certificate for the entity identified.
Alarm_Type: operational-violation
Probable_Cause: certificate-expiration
Service_Affecting: False
Suppression: False
Management_Affecting_Severity: none

Alarm ID: 500.210 Certificate system certificate-show <uuid>' (mode=<ssl/ssl_ca/docker_registry/openstack/openstack_ca>) expired. OR Certificate <Namespace>/<Certificate/Secret> expired. OR Certificate <k8sRootCA/EtcdRootCA> expired.
Entity Instance system.certificate.mode=<mode>.uuid=<uuid> OR namespace=<namespace-name>.certificate=<certificate-name> OR namespace=<namespace-name>.secret=<secret-name> OR system.certificate.k8sRootCA
Degrade Affecting Severity: None
Severity: C
Proposed Repair Action Renew certificate for the entity identified.
Inhibit_Alarms: Alarm_Type: operational-violation
Probable_Cause: certificate-expiration
Service_Affecting: False
Suppression: False
Management_Affecting_Severity: none