Commit Graph

3960 Commits

Author SHA1 Message Date
Vladimir Kozhukalov
693d3a2686 Update Chart.yaml apiVersion to v2
Change-Id: I66dcaedefd0640f8a7b5343363354ba539d70627
2024-12-16 16:48:03 -06:00
Vladimir Kozhukalov
b600c06628 Enable temporarily disabled jobs
Here I7bfdef3ea2128bbb4e26e3a00161fe30ce29b8e7
we disabled some jobs that involve scripts from
OSH git repo because these scripts had to be
aligned with the new values_overrides location and
directory structure.

Change-Id: I7d0509051c8cd563a3269e21fe09eb56dcdb8f37
2024-12-16 10:29:56 -06:00
Vladimir Kozhukalov
8b29037cec Move values overrides to a separate directory
This is the action item to implement the spec:
doc/source/specs/2025.1/chart_versioning.rst

Also add overrides env variables

- OSH_VALUES_OVERRIDES_PATH
- OSH_INFRA_VALUES_OVERRIDES_PATH

This commit temporarily disables all jobs that involve scripts
in the OSH git repo because they need to be updated to work
with the new values_overrides structure in the OSH-infra repo.
Once this is merged I4974785c904cf7c8730279854e3ad9b6b7c35498
all these disabled test jobs must be enabled.

Depends-On: I327103c18fc0e10e989a17f69b3bff9995c45eb4
Change-Id: I7bfdef3ea2128bbb4e26e3a00161fe30ce29b8e7
2024-12-13 12:04:44 -06:00
Sergiy Markin
2b500465da [ceph] Fix for ceph-osd pods restart
This PS updates ceph-osd pod containers making
sure that osd pods are not stuck at deletion. In
this PS we are taking care of another background
process that has to be terminated by preStop
script.

Change-Id: Icebb6119225b4b88fb213932cc3bcf78d650848f
2024-12-12 01:01:28 +00:00
Sergiy Markin
c98ea9ca61 [ceph] Fix for ceph-osd pods restart
This PS updates ceph-osd pod containers making sure
that osd pods are not stuck at deletion.

It adds missed lifecycle preStop action for log0runner container.

Change-Id: I8d6853a457d3142c33ca6b5449351d9b05ffacda
2024-12-11 19:35:20 +00:00
Sergiy Markin
7811e90f4e [ceph] Fix for ceph-osd pods restart
This PS updates ceph-osd pod containers making sure
that osd pods are not stuck at deletion. Also
added similar approach to add lifecycle ondelete
hook to kill log-runner container process before pod restart.

And added wait_for_degraded_object function to
helm-test pod making sure that newly deployed pod
are joined the ceph cluster and it is safe to go
on with next ceph-osd chart releade upgrade.

Change-Id: Ib31a5e1a82526906bff8c64ce1b199e3495b44b2
2024-12-10 16:36:24 +00:00
Zuul
cd1ee943f9 Merge "Remove tini from ceph-osd chart" 2024-12-10 01:22:01 +00:00
SPEARS, DUSTIN (ds443n)
9c55663c87 Remove tini from ceph-osd chart
Removing tini from ceph daemon as this didn't resolve
an issue with log runner process as will be resolved in
another change in post-apply job.

Change-Id: I4ebb1d12e736d387e6e34354619a532dd50dfeae
2024-12-09 14:42:56 -05:00
Vladimir Kozhukalov
7fddc1a8d3 Bump K8s to v1.31
Change-Id: I384b10ef7b2da42d2227b4134e4ece4c5f9aa6d1
2024-12-02 22:00:51 -06:00
Zuul
daff3cbdea Merge "Remove 2023.1 build jobs" 2024-11-26 23:52:05 +00:00
Zuul
a2910463ac Merge "[mariadb] Add probes for exporter" 2024-11-22 10:20:46 +00:00
Zuul
36efabdba8 Merge "Allow to use default storage class" 2024-11-22 00:53:28 +00:00
Zuul
71bdab2ceb Merge "[mariadb] Add terminationGracePeriodSeconds" 2024-11-21 16:52:35 +00:00
Zuul
d3dd74e9a0 Merge "[mariadb] Use service IP to discover endpoints" 2024-11-21 00:18:46 +00:00
Zuul
b490e77dc2 Merge "[mariadb] Implement mariadb upgrade on start" 2024-11-20 14:52:49 +00:00
Zuul
67ce775ca1 Merge "[mariadb] Avoid using deprecated isAlive" 2024-11-20 04:44:35 +00:00
Vasyl Saienko
10e8b39a91 [mariadb] Add probes for exporter
Implement readiness/liveness probes for exporter

Change-Id: I7e73872dd35b8e6adf67d585e7d4d9250eca70c3
2024-11-19 14:28:01 +00:00
Vasyl Saienko
cae49d7455 Allow to use default storage class
When name of storage class is specified as default, do not add
storageClassName option to let kubernetes pick a default

Change-Id: I25c60e49ba770ce10ea2ec68c3555ffea49848fe
2024-11-19 07:49:37 +00:00
Vasyl Saienko
bb236e0a98 [mariadb] Add terminationGracePeriodSeconds
Allow to set terminationGracePeriodSeconds for server instace to let
more time to shutdown all clients gracefully.
Increase timeout to 600 seconds by default.

Change-Id: I1f4ba7d5ca50d1282cedfacffbe818af7ccc60f2
2024-11-19 07:49:33 +00:00
Vasyl Saienko
c2269d70a2 [mariadb] Use service IP to discover endpoints
It was observed that under certain circumstances
galera instances can use old IP address of the node
after pod restart. This patch changes the value of
wsrep_cluster_address variable - instead of listing
all dns names of the cluster nodes the discovery service
IP address is used. In this case cluster_node_address is set to IP
address instead of DNS name - otherwise SST method will fail.

Co-Authored-By: Oleksii Grudev <ogrudev@mirantis.com>

Change-Id: I8059f28943150785abd48316514c0ffde56dfde5
2024-11-19 07:49:29 +00:00
Vasyl Saienko
65ec71b939 [mariadb] Implement mariadb upgrade on start
Call mysql_upgrade during start to check and upgrade if needed

Change-Id: I9c4ac1a5ea5f492282bb6bb1ee9923b036faa998
2024-11-19 06:17:24 +00:00
Vasyl Saienko
f0ad9daa5a [mariadb] Avoid using deprecated isAlive
The method was deprecated and later dropped, switch to is_alive()

Co-Authored-By: dbiletskiy <dbiletskiy@mirantis.com>

Change-Id: Ie259d0e59c68c9884e85025b1e44bcd347f45eff
2024-11-18 07:41:35 +00:00
Vladimir Kozhukalov
5d943fd72d Remove 2023.1 build jobs
The 2023.1 release is unmaintained since 2024-10-30.
See https://releases.openstack.org/

Change-Id: I8375b16338b172a5875b7a379df085020490305c
2024-11-15 14:40:51 -06:00
Zuul
c77bdd82d4 Merge "Update ceph-osd to be able to use tini" 2024-11-15 18:15:55 +00:00
Zuul
dcddce5bb7 Merge "ovn: fix resources" 2024-11-15 17:59:35 +00:00
Vasyl Saienko
174f6f5bd5 [mariadb] Refactor liveness/readiness probes
* Move all probes into single script to reduce code duplication
* Check free disk percent, fail when we consume 99% to avoid
  data corruption
* Do not restart container when SST is in progress

Change-Id: I6efc7596753dc988aa9edd7ade4d57107db98bdd
2024-11-13 18:37:52 +00:00
Vasyl Saienko
842f0f11dc [mariadb] Give more time on resolving configmap update conflicts
Make 'data too old' timeout dependent on state report interval. Increase
timeout to 5 times of report interval.

Change-Id: I0c350f9e64b65546965002d0d6a1082fd91f6f58
2024-11-13 18:37:48 +00:00
Oleksii Grudev
cb3afe6f85 Prevent TypeError in get_active_endpoint function
Sometimes "endpoints_dict" var can be evaluated to None
resulting in "TypeError: 'NoneType' object is not iterable"
error. This patch catches the exception while getting
list of endpoints and checks the value of
endpoints_dict.  Also the amount of active endpoints is being logged
for debug purposes.

Change-Id: I79f6b0b5ced8129b9a28c120b61e3ee050af4336
2024-11-13 18:37:44 +00:00
Vasyl Saienko
13a683b9c2 [mariadb] Remove useless retries on conflics during cm update
The retries were originally added at [0] but they were never working.
We pass fixed revision that we would like to see during patch to avoid
race condition, into the safe_update_configmap. We can't organize retries
inside function as it will require change of the original revision which
may happen only at upper layer. Revert patch partially.

[0] https://review.opendev.org/c/openstack/openstack-helm-infra/+/788886

Change-Id: I81850d5e534a3cfb3c4993275757c244caec8be9
2024-11-13 18:37:39 +00:00
Vasyl Saienko
ef707fa3f3 [mariadb] Stop running threads on sigkill
Stop monitor cluster and leader election threads on sigkill.
This allows to terminate all threads from start.py and actually
exit earlier than terminationGracePeriod in statefulset.
Drop preStop hook which is redundant with stop_mysqld() function call.

Change-Id: Ibc4b7604f00b1c5b3a398370dafed4d19929fd7d
2024-11-13 18:36:49 +00:00
Kaloyan Kotlarski
07ae16493e ovn: fix resources
Change-Id: I2b0c70550379dd214bc67869a7c74518b7004c7f
2024-11-13 15:30:41 +08:00
Vasyl Saienko
6d7fba0c43 [mariadb] Improve python3 compatibility
Decode byte sequence into string before printing log.

Change-Id: Icd61a1373f5c62afda0558dfadc2add9138cff6d
2024-11-11 11:37:26 +00:00
Vasyl Saienko
4aaa5fc778 [mariadb] Improve leader election on cold start
During cold start we pick leader node by seqno. When node is running
of finished non gracefully seqno may stay as -1 unless periodic task
update its based on local grastate.dat or will detect latest seqno via
wsrep_recover. This patch adds an unfinite waiter to leader election
function to wait unless all nodes report seqno different that -1 to make
sure we detect leader based on correct data.

Change-Id: Id042f6f4c915b21b905bde4d57d40e159d924772
2024-11-11 11:37:19 +00:00
Vasyl Saienko
4ee7ebda43 [mysql] Use constant for mysqld binary name
Change-Id: I996141242dac9978283e5d2086579c75d120ed8b
2024-11-11 09:45:03 +00:00
SPEARS, DUSTIN (ds443n)
09f1ccc64c Update ceph-osd to be able to use tini
Sometimes the pod fails to terminate correctly,
leaving zombie processes. Add option to use tini
to handle processes correctly. Additionally update
log-tail script to handle sigterm and sigint.

Change-Id: I96af2f3bef5f6c48858f1248ba85abdf7740279c
2024-11-06 09:51:55 -05:00
Zuul
f706586123 Merge "Mariadb chart updates" 2024-11-02 08:09:57 +00:00
Zuul
cf4611ffb5 Merge "Update grafana helm test" 2024-11-02 01:42:58 +00:00
Zuul
f10f516d2b Merge "ovn: make gateway label configurable" 2024-11-02 01:18:53 +00:00
Sergiy Markin
f630c152e8 Mariadb chart updates
This PS is for improvements for wait_for_cluster mariadb job.

Change-Id: I46de32243e3aaa98b7e3e8c132a84d7b65d657cc
2024-11-01 22:34:58 +00:00
SPEARS, DUSTIN (ds443n)
4da1347ee6 Update grafana helm test
Adds setting XDG_CONFIG_HOME and XDG_CACHE_HOME to
a writable path.

Change-Id: Ieb2a6ca587ecefe24d04392970c415409c8f5e1b
2024-11-01 09:38:17 -04:00
SPEARS, DUSTIN (ds443n)
42940f326a Update helm test for Elasticsearch
Removing the use of python during helm test script as it
is no longer in the image.

Change-Id: Id8feff1bee8c3f2dd277307d176f6a535c5f7ba6
2024-10-24 15:38:43 -04:00
okozachenko1203
5aed17d744 ovn: make gateway label configurable
Change-Id: I88ab77e61e9766e12eb3aff899e0d6dd24a8d3c0
2024-10-25 02:55:01 +11:00
Zuul
aad6fc07d7 Merge "Add 2024.2 overrides" 2024-10-16 20:16:32 +00:00
Zuul
25cbdf250c Merge "[helm-toolkit] Fix db-init and db-drop scripts" 2024-10-13 04:34:20 +00:00
Vladimir Kozhukalov
81da0879c3 [memcached] Fix statefulset spec format
Recently we switched from Deployment to Statefulset
to make it possible to work with memcached instances
directly w/o load balancer. The strategy field is not
valid for statefulsets, so here we remove it.

Change-Id: I52db7dd4563639a55c12850147cf256cec8b1ee4
2024-10-10 12:58:52 -05:00
Vladimir Kozhukalov
489b87a975 Add 2024.2 overrides
Change-Id: Ic43f14e212f4de6616b4255bdd5ce562c5bcf9b0
2024-10-10 13:52:24 +00:00
Vladimir Kozhukalov
ab2cfc1d64 [helm-toolkit] Fix db-init and db-drop scripts
Wrap queries into sqlalchemy.text before executing them.

Change-Id: I783bd05bdd529c73825311515e1390f3cc077c4f
2024-10-10 08:49:08 -05:00
Zuul
5d81f0f5a4 Merge "Add app.kubernetes.io/name label to openstack pods" 2024-10-07 18:20:24 +00:00
Zuul
42b3ccc564 Merge "[mariadb] Add cluster wait job" 2024-10-03 21:31:06 +00:00
jchialun
f9458d51b8 Add app.kubernetes.io/name label to openstack pods
This commit adds recommended kubernetes name label to pods definition.

This label is used by FluxCD operators to correctly look for the
status of every pod.

Change-Id: I866f1dfdb3ca8379682e090aca4c889d81579e5a
Signed-off-by: Johnny Chia <johnny.chialung@windriver.com>
2024-10-03 11:40:58 -05:00