Increase Maintenance Heartbeat period from 100 to 1000 msecs

This update changes the default Maintenance Heartbeat period
from 100 msecs to 1 second (1000 msecs).

Test Plan:

PASS: Verify full deployment of WRCP AIO DX Plus 1 worker
PASS: Verify full deployment of 2+4+2 Standard System
PASS: Verify heartbeat period default is 1 second

Regression:

PASS: Verify AIO DX enable handler heartbeat soak
PASS: Verify AIO DX add handler heartbeat soak
PASS: Verify Standard controller enable handler heartbeat soak
PASS: Verify Standard controller add handler heartbeat soak
PASS: Verify Standard worker node enable handler heartbeat soak
PASS: Verify Standard worker node add handler heartbeat soak

PASS: Verify heartbeat loss handling with new default heartbeat period
PASS: Verify MNFA handling with with new default heartbeat period

PASS: Verify hostwd quorum process failure fault detection and
      handling timing is not effected by new default heartbeat period.

PASS: Run WRCP DX Sanity on AIO DX and 2+4+2 Sandard system

Depends-On: https://review.opendev.org/c/starlingx/config/+/955893
Partial-Fix: 2117252
Change-Id: Iaae2cc0efca92aa751e9404c886ac569d238be86
Signed-off-by: Eric MacDonald <eric.macdonald@windriver.com>
This commit is contained in:
Eric MacDonald
2025-07-25 08:37:41 -04:00
parent 10aa511bda
commit 487d14aafc
3 changed files with 4 additions and 4 deletions

View File

@@ -32,7 +32,7 @@ see :ref:`The Life Cycle of a Host <the-life-cycle-of-a-host-93640aa2b707>`.
| c3a9... | platform | maintenance | heartbeat_degrade_threshold | 6 | | c3a9... | platform | maintenance | heartbeat_degrade_threshold | 6 |
| 9089... | platform | maintenance | heartbeat_failure_action | fail | | 9089... | platform | maintenance | heartbeat_failure_action | fail |
| 8df8... | platform | maintenance | heartbeat_failure_threshold | 10 | | 8df8... | platform | maintenance | heartbeat_failure_threshold | 10 |
| 16b5... | platform | maintenance | heartbeat_period | 100 | | 16b5... | platform | maintenance | heartbeat_period | 1000 |
| 4712... | platform | maintenance | mnfa_threshold | 2 | | 4712... | platform | maintenance | mnfa_threshold | 2 |
| 4ba7... | platform | maintenance | mnfa_timeout | 0 | | 4ba7... | platform | maintenance | mnfa_timeout | 0 |
+---------+----------+---------------+-----------------------------+-------+ +---------+----------+---------------+-----------------------------+-------+

View File

@@ -35,7 +35,7 @@ see :ref:`The Life Cycle of a Host <the-life-cycle-of-a-host-93640aa2b707>`.
| c3a9... | platform | maintenance | heartbeat_degrade_threshold | 6 | | c3a9... | platform | maintenance | heartbeat_degrade_threshold | 6 |
| 9089... | platform | maintenance | heartbeat_failure_action | fail | | 9089... | platform | maintenance | heartbeat_failure_action | fail |
| 8df8... | platform | maintenance | heartbeat_failure_threshold | 10 | | 8df8... | platform | maintenance | heartbeat_failure_threshold | 10 |
| 16b5... | platform | maintenance | heartbeat_period | 100 | | 16b5... | platform | maintenance | heartbeat_period | 1000 |
| 4712... | platform | maintenance | mnfa_threshold | 2 | | 4712... | platform | maintenance | mnfa_threshold | 2 |
| 4ba7... | platform | maintenance | mnfa_timeout | 0 | | 4ba7... | platform | maintenance | mnfa_timeout | 0 |
+---------+----------+---------------+-----------------------------+-------+ +---------+----------+---------------+-----------------------------+-------+
@@ -54,7 +54,7 @@ see :ref:`The Life Cycle of a Host <the-life-cycle-of-a-host-93640aa2b707>`.
**heartbeat_period** **heartbeat_period**
The time in milliseconds between heartbeat challenges from the The time in milliseconds between heartbeat challenges from the
controller to the other hosts (1001000 ms). The default is controller to the other hosts (1001000 ms). The default is
100 ms. 1000 ms.
**heartbeat_degrade_threshold** **heartbeat_degrade_threshold**
The number of consecutive missing responses to heartbeat challenges The number of consecutive missing responses to heartbeat challenges

View File

@@ -36,7 +36,7 @@ immediately in the event of a persistent loss of maintenance heartbeat.
| c3a9... | platform | maintenance | heartbeat_degrade_threshold | 6 | | c3a9... | platform | maintenance | heartbeat_degrade_threshold | 6 |
| 9089... | platform | maintenance | heartbeat_failure_action | fail | | 9089... | platform | maintenance | heartbeat_failure_action | fail |
| 8df8... | platform | maintenance | heartbeat_failure_threshold | 10 | | 8df8... | platform | maintenance | heartbeat_failure_threshold | 10 |
| 16b5... | platform | maintenance | heartbeat_period | 100 | | 16b5... | platform | maintenance | heartbeat_period | 1000 |
| 4712... | platform | maintenance | mnfa_threshold | 2 | | 4712... | platform | maintenance | mnfa_threshold | 2 |
| 4ba7... | platform | maintenance | mnfa_timeout | 0 | | 4ba7... | platform | maintenance | mnfa_timeout | 0 |
+---------+----------+---------------+-----------------------------+-------+ +---------+----------+---------------+-----------------------------+-------+