Extend PXE boot retry timeout for RAX hosts

When extending the timeouts for jobs to execute with-in,
we've observed a case where RAX hosts are cutting off at
the time limit of 900 seconds (as being asserted by another
change set). This is both good and bad. We know the timeout
feature works, but the agent was not quite online yet.

As such, we should also auto-extend base retry timeouts
so there is hope for the job to complete.

Change-Id: I8efa3a52188de558a7964d1daafd2225e102e251
This commit is contained in:
Julia Kreger 2020-07-22 10:35:38 -07:00
parent b5ae75a406
commit 67e51af6d5
1 changed files with 7 additions and 0 deletions

View File

@ -173,6 +173,13 @@ if [[ "$hostdomain" =~ "rax" ]]; then
# these hosts for jobs to complete without issues.
new_timeout=$(echo "$IRONIC_TEMPEST_BUILD_TIMEOUT * 1.5 / 1" | bc)
IRONIC_TEMPEST_BUILD_TIMEOUT=$new_timeout
if [ -n "$IRONIC_PXE_BOOT_RETRY_TIMEOUT" ]; then
new_timeout=$(echo "$IRONIC_PXE_BOOT_RETRY_TIMEOUT * 1.5 / 1" | bc)
IRONIC_PXE_BOOT_RETRY_TIMEOUT=$new_timeout
fi
# TODO(TheJulia): If we have to do magically extend timeouts again,
# we should make a helper method...
fi
IRONIC_DEFAULT_API_VERSION=${IRONIC_DEFAULT_API_VERSION:-}