Increase Maintenance Heartbeat period from 100 to 1000 msecs

This update changes the default Maintenance Heartbeat period
from 100 msecs to 1 second (1000 msecs).

Test Plan:

PASS: Verify full deployment of WRCP AIO DX Plus 1 worker
PASS: Verify full deployment of 2+4+2 Standard System
PASS: Verify heartbeat period default is 1 second

Regression:

PASS: Verify AIO DX enable handler heartbeat soak
PASS: Verify AIO DX add handler heartbeat soak
PASS: Verify Standard controller enable handler heartbeat soak
PASS: Verify Standard controller add handler heartbeat soak
PASS: Verify Standard worker node enable handler heartbeat soak
PASS: Verify Standard worker node add handler heartbeat soak

PASS: Verify heartbeat loss handling with new default heartbeat period
PASS: Verify MNFA handling with with new default heartbeat period

PASS: Verify hostwd quorum process failure fault detection and
      handling timing is not effected by new default heartbeat period.

PASS: Run WRCP DX Sanity on AIO DX and 2+4+2 Sandard system

Depends-On: https://review.opendev.org/c/starlingx/config/+/955893
Partial-Fix: 2117252
Change-Id: Iaae2cc0efca92aa751e9404c886ac569d238be86
Signed-off-by: Eric MacDonald <eric.macdonald@windriver.com>
This commit is contained in:
Eric MacDonald
2025-07-25 08:37:41 -04:00
parent 10aa511bda
commit 487d14aafc
3 changed files with 4 additions and 4 deletions

View File

@@ -32,7 +32,7 @@ see :ref:`The Life Cycle of a Host <the-life-cycle-of-a-host-93640aa2b707>`.
| c3a9... | platform | maintenance | heartbeat_degrade_threshold | 6 |
| 9089... | platform | maintenance | heartbeat_failure_action | fail |
| 8df8... | platform | maintenance | heartbeat_failure_threshold | 10 |
| 16b5... | platform | maintenance | heartbeat_period | 100 |
| 16b5... | platform | maintenance | heartbeat_period | 1000 |
| 4712... | platform | maintenance | mnfa_threshold | 2 |
| 4ba7... | platform | maintenance | mnfa_timeout | 0 |
+---------+----------+---------------+-----------------------------+-------+

View File

@@ -35,7 +35,7 @@ see :ref:`The Life Cycle of a Host <the-life-cycle-of-a-host-93640aa2b707>`.
| c3a9... | platform | maintenance | heartbeat_degrade_threshold | 6 |
| 9089... | platform | maintenance | heartbeat_failure_action | fail |
| 8df8... | platform | maintenance | heartbeat_failure_threshold | 10 |
| 16b5... | platform | maintenance | heartbeat_period | 100 |
| 16b5... | platform | maintenance | heartbeat_period | 1000 |
| 4712... | platform | maintenance | mnfa_threshold | 2 |
| 4ba7... | platform | maintenance | mnfa_timeout | 0 |
+---------+----------+---------------+-----------------------------+-------+
@@ -54,7 +54,7 @@ see :ref:`The Life Cycle of a Host <the-life-cycle-of-a-host-93640aa2b707>`.
**heartbeat_period**
The time in milliseconds between heartbeat challenges from the
controller to the other hosts (1001000 ms). The default is
100 ms.
1000 ms.
**heartbeat_degrade_threshold**
The number of consecutive missing responses to heartbeat challenges

View File

@@ -36,7 +36,7 @@ immediately in the event of a persistent loss of maintenance heartbeat.
| c3a9... | platform | maintenance | heartbeat_degrade_threshold | 6 |
| 9089... | platform | maintenance | heartbeat_failure_action | fail |
| 8df8... | platform | maintenance | heartbeat_failure_threshold | 10 |
| 16b5... | platform | maintenance | heartbeat_period | 100 |
| 16b5... | platform | maintenance | heartbeat_period | 1000 |
| 4712... | platform | maintenance | mnfa_threshold | 2 |
| 4ba7... | platform | maintenance | mnfa_timeout | 0 |
+---------+----------+---------------+-----------------------------+-------+