metal/mtce/src/heartbeat
Eric MacDonald a4238c2a35 Add 50 byte hostname support to maintenance
Hosts with hostnames longer than 31 characters do not
go online (locked-disabled-online) after installation.

This update enhances maintenance messaging to support
up to 50 byte/character hostnames.

System Install:
---------------
PASS: Verify system install
PASS: Verify AIO system install (regression)
PASS: Verify system install with long hostnames, deployment-config.yaml
PASS: Verify mtcAgent process startup/restart logs
PASS: Verify hbsAgent process startup/restart logs (active controller)
PASS: Verify hbsAgent process startup/restart logs (standby controller)
PASS: Verify hwmond process startup/restart logs
PASS: Verify guestAgent process startup/restart logs
PASS: Verify all common maintenance daemons startup/restart logs
PASS: Verify patch applies and removes cleanly

PASS: Verify long hostname Add ; inventory distribution
PASS: Verify short hostname Add ; inventory distribution

Long Hostname Handling:
-----------------------
PASS: Verify host name support for up to 50 and 51 byte hostnames

Heartbeat Monitoring:
---------------------
PASS: Verify cluster-host interface link down handling.
PASS: Verify graceful recovery from host reboot.
PASS: Verify pmond process failure and recovery cycle.

Maintenance Actions:
--------------------
PASS: Verify host install with 50 byte hostname
PASS: Verify host lock
PASS: Verify host unlock
PASS: Verify host reboot
PASS: Verify host reinstall
PASS: Verify host delete (no core dump / all daemon logs)
PASS: Verify host power-off
PASS: Verify host power-on
PASS: Verify BMC State Info
PASS: Verify lock and unlock storage node
PASS: Controller Swact over and Back
PASS: Verify thresholded heartbeat failure handling
PASS: Verify node locked flag file
PASS: Verify no core dumps during testiong

Hardware Monitor:
-----------------
PASS: Verify BMC Provisioning/Reprovisioning/Deprovisioning
PASS: Verify Inventory Add/Delete/Modify
PASS: Verify Sensor Model and Monitoring
PASS: Verify Sensor Model Relearn
PASS: Verify Alarming and Logs
PASS: Verify Sensor Action, Interval modification
PASS: Verify Critical Sensor Action handling (ignore, log, alarm, reset, power cycle)

Guest Agent:
------------
PASS: Verify inventory add and delete

Process Monitor:
----------------
PASS: Verify process monitor logs
PASS: Verify process monitor events into mtcAgent
PASS: Verify process monitor failure alarming and recovery clear.
PASS: Verify process monitor regression script (test-pmon.sh -c restart)
PASS: Verify process monitor regression script (test-pmon.sh -c kill)
PASS: Verify process monitor regression script (test-pmon-action.sh)
PASS: Verify critical process failure handling
PASS: Verify major process failure handling

Collectd Monitoring:
-----------------
PASS: Verify collectd monitoring for long hostname hosts

Regression:
-----------
PASS: Verify mtce daemon sigal handling (test-signals.sh)

Change-Id: If22ab081397ec1e8b24f20aad8c99f8079cb98a5
Closes-Bug: 1824429
Signed-off-by: Eric MacDonald <eric.macdonald@windriver.com>
2019-07-12 12:20:08 +00:00
..
Makefile Add EXTRALDFLAGS to linker in a number of Makefiles 2019-02-28 22:34:54 -06:00
hbsAgent.cpp Add 50 byte hostname support to maintenance 2019-07-12 12:20:08 +00:00
hbsAlarm.cpp Refactor infrastructure network in mtce code 2019-04-18 09:32:41 -04:00
hbsAlarm.h Refactor infrastructure network in mtce code 2019-04-18 09:32:41 -04:00
hbsBase.h Add 50 byte hostname support to maintenance 2019-07-12 12:20:08 +00:00
hbsClient.cpp Add 50 byte hostname support to maintenance 2019-07-12 12:20:08 +00:00
hbsCluster.cpp Make heartbeat cluster state change notifications immediate 2019-06-06 21:30:36 -04:00
hbsCluster.h Mtce: Add heartbeat cluster information for SM query 2018-10-05 22:47:17 +00:00
hbsPmon.cpp Decouple Guest-server/agent from stx-metal 2018-09-18 17:15:08 -04:00
hbsStubs.cpp MTCE: reading BMC passwords from Barbican secret storage. 2019-02-14 09:04:46 -05:00
hbsUtil.cpp Make heartbeat cluster state change notifications immediate 2019-06-06 21:30:36 -04:00
mtceHbsCluster.h Refactor infrastructure network in mtce code 2019-04-18 09:32:41 -04:00