hfarhat 966dade9b8 Rook Migration: Stop SM failover during removal of ceph
During the migration when the services are deprovisioned,
SM may crash in certain scenarios, taking the standby host
down.

Since SM can recover from this, we are preventing the SM
failover audit to happen during the migration, by checking
for a signal file present in the system.

We log every 200 times, which gives us a log every 10 seconds,
since the audit runs every 50 ms.

Reintroducing previously reverted change:

https://review.opendev.org/c/starlingx/ha/+/979860

Test Plan:

  - PASS: DX VM redeploy migration
  - PASS: DX HW redeploy migration

Partial-bug: 2143592

Change-Id: If31313e526804074518c77dd77c70a6c90068139
Signed-off-by: Heitor Farhat <heitor.farhat@windriver.com>
2026-03-11 15:27:47 -03:00
2022-05-31 14:18:44 +00:00
2018-09-27 10:14:44 -07:00
2019-04-19 19:52:24 +00:00
2025-01-22 09:51:39 -05:00
2023-07-19 12:28:24 -03:00
2018-05-31 07:36:26 -07:00
2023-07-19 12:28:24 -03:00

ha

The starlingx/ha repository handles High Availability services1.

Its key component is the StarlingX Service Management (SM), which coordinates the StarlingX services.

This repository is not intended to be developed standalone, but rather as part of the StarlingX Source System, which is defined by the StarlingX manifest2.

References


  1. https://docs.starlingx.io/api-ref/ha↩︎

  2. https://opendev.org/starlingx/manifest.git↩︎

Description
StarlingX High Availability/Process Monitoring/Service Management
Readme 7 MiB
Languages
C 70.2%
Python 18.5%
Shell 7.6%
C++ 3.3%
Makefile 0.4%