StarlingX Integration and packaging
Go to file
Felipe Sanches Zanoni e0f83421e2 Start Ceph processes before start monitoring
The Ceph processes were not being started before Pmon started its
monitoring. Then Pmon was detecting Ceph processes failures when the
host was being initialized.

Besides that, the puppet manifest was creating the .ceph_started flag
at the wrong time, preventing controlling the Ceph processes at boot
up.

The ceph.sh script is modified to initialize all the Ceph processes
according to the system mode and the system type.

For a Simplex, all processes are started in sequence (mon, mds, osd).

For the Duplex, only the fixed monitor and the mds are started. SM
will start the other processes.

For the Standard, it will start with no parameter first, and then the
mds.

After all processes were initialized, the .ceph_started flag is
created so Pmon and SM can start monitoring Ceph processes.

The ceph.sh script will always return success to prevent the host to
not get enabled when Ceph fails to start. If any Ceph process fails
when starting, the Pmon and SM will try to recover them and raise
alarms accondingly.

Additional changes:
 - Added a 'forcestart' action to the ceph-init-wrapper script to
   bypass the .ceph_started flag. The 'start' action on the
   ceph-init-wrapper script verifies the .ceph_started_flag and will
   skip the initialization if it does not exist. Creating the flag
   before calling the 'start' command would trigger a race condition
   between Pmon/SM and the ceph.sh script.

 - Improved the logging to add timestamp to each line.

Test-Plan:
  PASS: AIO-SX deploy and lock/unlock, checking Ceph is running as
        expected and pmon log shows no errors for Ceph processes.

  PASS: For AIO-DX, AIO-DX+, Standard, Storage:
        Deploy, lock/unlock each host, DOR test, force reboot the
        active controller and force reboot the standby controller.
        Check Ceph is running as expected and the pmon log shows
        no errors for Ceph processes.

  PASS: Apply Ceph after the deploy (runtime) and check if the flag
        .ceph_started has been created.

  PASS: Deploy AIO-SX with Ceph configured with 1 OSD. Force corrupt
        OSD data by deleting some files from the disk. Reboot the
        host. Check the log /var/log/ceph/ceph-init.log if the start
        osd action returned error and check if the alarm raised for
        the OSD.

Partial-bug: 2083056

Signed-off-by: Felipe Sanches Zanoni <Felipe.SanchesZanoni@windriver.com>
Change-Id: I41cad8190616909f2a8be1d27c2ef8dd5a75a6a3
2024-10-08 17:45:10 -03:00
base Disable usm-initialize.service in presets 2024-08-01 12:29:01 -03:00
bmc Merge "Remove CentOS/OpenSUSE build support" 2024-05-22 15:14:42 +00:00
centos-debian-compat Update integ debian package ver based on git 2023-03-01 18:53:50 +00:00
ceph/ceph Start Ceph processes before start monitoring 2024-10-08 17:45:10 -03:00
config Revert "Dynamize Postgres Auth Method Definition" 2024-09-18 18:12:59 +00:00
database/mariadb/debian Remove CentOS/OpenSUSE build support 2024-05-01 16:39:19 -04:00
devstack Relocated some packages to repo 'utilities' 2019-09-05 20:31:36 -04:00
doc Fix tox-docs failing sphinx 2023-08-29 16:52:04 -03:00
docker/python-docker/debian Remove CentOS/OpenSUSE build support 2024-05-01 16:39:19 -04:00
filesystem Remove CentOS/OpenSUSE build support 2024-05-01 16:39:19 -04:00
golang-github-dev fix golang-github-golang-jwt-jwt-dev url 2024-01-31 10:39:34 -05:00
gpu/gpu-operator Remove CentOS/OpenSUSE build support 2024-05-01 16:39:19 -04:00
grub Remove CentOS/OpenSUSE build support 2024-05-01 16:39:19 -04:00
kata-containers/debian Add kata containers support for Starlingx 2023-12-04 11:58:54 +00:00
kubernetes Set CoreDNS memory request to zero. 2024-09-16 07:39:38 -04:00
ldap Merge "Change ldapsetpasswd error message" 2024-06-03 16:24:57 +00:00
networking Correct support for gateway address checking in dual-stack 2024-09-09 10:30:31 -03:00
ostree Support the single driver switch 2024-07-11 07:45:02 +00:00
python Remove CentOS/OpenSUSE build support 2024-05-01 16:39:19 -04:00
releasenotes Switch to newer openstackdocstheme and reno versions 2020-06-04 14:28:48 +02:00
requests-toolbelt Remove CentOS/OpenSUSE build support 2024-05-01 16:39:19 -04:00
security Remove CentOS/OpenSUSE build support 2024-05-01 16:39:19 -04:00
storage-drivers/trident-installer/debian Remove CentOS/OpenSUSE build support 2024-05-01 16:39:19 -04:00
tools Remove CentOS/OpenSUSE build support 2024-05-01 16:39:19 -04:00
virt Remove CentOS/OpenSUSE build support 2024-05-01 16:39:19 -04:00
.gitignore Add Docker Registry Token Server 2019-01-08 11:42:04 -05:00
.gitreview OpenDev Migration Patch 2019-04-19 19:52:31 +00:00
.pylintrc tox: fixed warnings 2023-09-06 17:54:55 -03:00
.yamllint Add .yamllint file 2021-09-09 19:05:36 +03:00
.zuul.yaml Update pylint test to use debian-bullseye nodeset 2024-08-23 12:33:23 -05:00
bindep.txt Fix pylint zuul jobs failing due to libvirt-python and pkgconfig 2019-07-04 14:14:39 -05:00
CONTRIBUTORS.wrs StarlingX open source release updates 2018-05-31 07:36:35 -07:00
debian_build_layer.cfg Add debian_build_layer.cfg file 2021-10-05 14:08:19 -04:00
debian_iso_image.inc Patch ca-certificates package 2024-07-15 13:47:07 +00:00
debian_pkg_dirs Patch ca-certificates package 2024-07-15 13:47:07 +00:00
debian_stable_docker_images.inc Disable n3000 container build 2024-07-03 10:43:52 -04:00
distroless_stable_docker_images.inc Remove Armada related packages from stx build 2023-09-27 18:58:13 +00:00
LICENSE StarlingX open source release updates 2018-05-31 07:36:35 -07:00
README.rst Followup opendev cleanup and test jobs 2019-04-21 09:23:19 -05:00
test-requirements.txt Update pylint test to use debian-bullseye nodeset 2024-08-23 12:33:23 -05:00
tox.ini Update pylint test to use debian-bullseye nodeset 2024-08-23 12:33:23 -05:00

integ

StarlingX Integration