Rehome: Increase timeout for certs to be installed

This commit addresses 2 related bugs, in the sense that they
are related to ansible tasks that wait for certificates
to be installed.

Task 'Check admin-ep-cert.pem updated' depends on cert-mon to
install the certificate. Cert-mon may fail and reattempt after
10 minutes. This change increases that timeout for this task
to be larger than that. I'm also decreasing the delay in a
half as it is a quick stat operation, to allow it to be
detected quickly in most cases where the first cert-mon attempt
works.

Task 'Verify if there are 250.001 (config out-of-date) alarms'
is dependent on puppet to apply a config change to install
the certificate and sysinv to clear the alarm. When I
reproduced this issue it took 1 minute longer for the alarm
to clear. This change increases the timeout of the task in
about 50%.

Test Plan:

PASS: Rehome a subcloud with 1200 ms latency injected and
      50% cap on CPU capacity.

Closes-Bug: 2054462
Closes-Bug: 2054463
Change-Id: I017fab0ccb13629c63a7cd855470f0a777f06e22
Signed-off-by: Rei Oliveira <Reinildes.JoseMateusOliveira@windriver.com>
This commit is contained in:
Rei Oliveira 2024-02-20 15:57:12 -03:00
parent e1f3cf6c7d
commit 6335d7d491
2 changed files with 3 additions and 3 deletions

View File

@ -44,8 +44,8 @@
command: stat -c %Y "/etc/ssl/private/{{ sc_adminep_cert }}"
register: adminep_cert_modify_time
until: adminep_cert_modify_time.stdout|int > (start_time.stdout|int + 1)
retries: 12
delay: 30
retries: 63
delay: 10
- name: Check haproxy service restarted
shell: >-

View File

@ -81,7 +81,7 @@
source /etc/platform/openrc;
fm alarm-list --query alarm_id=250.001
register: alarm_subcloud
retries: 10
retries: 15
delay: 20
until: alarm_subcloud.stdout == ""
failed_when: false