8850 Commits

Author SHA1 Message Date
Alex Schultz
2dfd0ea8ca Flatten Aodh service configuration
This change combines the previous puppet and docker files into a single
file that performs the docker service installation and configuration.
With this patch the baremetal version of aodh services have been
removed.

Co-Authored-By: Juan Antonio Osorio Robles <jaosorior@redhat.com>
Depends-On: https://review.rdoproject.org/r/#/c/16994/
Change-Id: I39645aff0365218d4b841ed0d9c964b3622f143a
Related-Blueprint: services-yaml-flattening
2018-12-06 15:07:05 +00:00
Alex Schultz
623924972f Update yaml-validate for deployment/ folders
Since we're looking at flattening the services into a deployment/
folder, we need to update the validation script to also handle this
directory structure. Additionally this change updates the service name
validation to ensure that the service name in matches the start of the
filename itself.

Change-Id: Ibb140a38b69a8780adf69362e0f437b3426f360d
Related-Blueprint: service-yaml-flattening
2018-12-06 08:06:10 -07:00
Zuul
79ce8a504b Merge "Use InternalApi network for openshift_master_cluster_hostname" 2018-12-04 17:36:34 +00:00
Zuul
0bfcc0fc73 Merge "Set balance to source for openshift_router endpoints" 2018-12-04 17:36:31 +00:00
Zuul
d73dedd4bc Merge "Use new ODL diagstatus heath check REST URL" 2018-12-04 17:36:29 +00:00
Zuul
2557bfdf14 Merge "Create clouds.yaml for the undercloud" 2018-12-04 16:30:30 +00:00
Zuul
aee048d3bb Merge "Add a gateway IP to the Management net in network_data.yaml" 2018-12-04 14:03:09 +00:00
Zuul
a80bb71642 Merge "ctlplane pre-alloc IPs - deprecated/non-deprecated role name" 2018-12-04 14:03:07 +00:00
Zuul
d062b3c6da Merge "Fix pacemaker tagging race condition" 2018-12-04 04:32:39 +00:00
Zuul
b01b1a7f72 Merge "Change step to start nova placement and make compute wait for it" 2018-12-03 02:33:06 +00:00
Zuul
e8420d7240 Merge "upgrade: remove Docker containers now managed by Podman" 2018-12-02 20:59:15 +00:00
Zuul
f071dd4c94 Merge "Add Storage network to IronicConductor role" 2018-12-02 18:05:36 +00:00
Zuul
04a94549f7 Merge "Fix ServiceData in docker/services/aodh-api.yaml" 2018-12-02 00:29:31 +00:00
Zuul
5ebb679074 Merge "Initial conversion of scenario001 to standalone" 2018-12-01 19:31:27 +00:00
Zuul
5998717379 Merge "Adds ci/environments/scenario004-standalone.yaml for new job" 2018-12-01 17:52:10 +00:00
Zuul
e5919ec448 Merge "Default NtpPool to an empty list" 2018-12-01 13:04:09 +00:00
Zuul
692ffe51c8 Merge "Add a new standalone scenario003 env file" 2018-12-01 12:35:26 +00:00
Marius Cornea
50de0269d0 Set balance to source for openshift_router endpoints
Currently the haproxy loadbalancer set by TripleO uses balance
roundrobin. According to this Openshift HA configuration example[1]
it uses balance source for both the API and the router backends.
This way we ensure that all traffic from a user’s session goes to
the same node(sticky sessions).

[1] https://blog.openshift.com/keeping-both-of-your-openshift-container-platforms-highly-available-with-keepalived-and-haproxy/

Change-Id: I0be466f7d30748fc46fc69f098bf5aa7eb980aa2
2018-11-30 13:57:32 -05:00
Marius Cornea
bd51cf5f44 Use InternalApi network for openshift_master_cluster_hostname
According to the inventory examples[1] openshift_master_cluster_hostname
points to an internal hostname/address set on the loadbalancer while
openshift_master_cluster_public_hostname points to the external.
This change sets openshift_master_cluster_hostname to use the InternalApi
network instead of the External network as it is at this moment.

[1] https://docs.openshift.com/container-platform/3.11/install/example_inventories.html

Change-Id: I9efab5b07682efd6b03da433801d636e7d324619
2018-11-30 13:25:03 -05:00
Daniel Farrell
da87906ac3 Use new ODL diagstatus heath check REST URL
OpenDaylight's Infrautils project has a new, recommended method for
checking when ODL is up and ready. Use this new diagstatus ODL NB REST
API endpoint vs the old netvirt:1 endpoint.

ODL Jira that tracked adding diagstatus REST API:

https://jira.opendaylight.org/browse/INFRAUTILS-33

RH BZ tracking moving to diagstatus:

https://bugzilla.redhat.com/show_bug.cgi?id=1642270

Change-Id: I44dc5ba7680a9c5db2d6070e813d9b0e31d6e811
Signed-off-by: Daniel Farrell <dfarrell@redhat.com>
2018-11-30 01:34:39 +00:00
Juan Antonio Osorio Robles
126991516a Default NtpPool to an empty list
This way we have the same default as we always have, with only the ntp
server set.

Change-Id: I8a0d8b2f333154e2e881ca788689c2d9f564b224
2018-11-29 20:19:33 +02:00
Jiri Stransky
3f57d7380d Fix pacemaker tagging race condition
Change I81bc48b53068c3a5ed90266a4fd3e62bfb017835 moved image fetching
and tagging for pacemaker-managed services from step 1 to step 2. This
is also a step when the services are started, which probably
introduced a race condition for environments where pacemaker cluster
consists of more than one machine.

During the deployment you can get a lot of pcmk failures like:

failed to pull image 192.168.24.1:8787/tripleomaster/centos-binary-mariadb:pcmklatest

This only happens on non-bootstrap nodes. On bootstrap node the order
is still correct, first download and tag image, and then start the
pcmk resources. However, if non-bootstrap nodes are slower with
downloading and tagging, pacemaker there might start the resources
before the images are tagged (as the starting of resources is
controlled globally from bootstrap node).

Change-Id: Id669cc9a296a8366c7c80a5ee509bdb964b62a04
Closes-Bug: #1805826
2018-11-29 12:02:21 +01:00
Zuul
b6b4201be1 Merge "Add ctlplane interface routes" 2018-11-29 10:45:53 +00:00
Zuul
225c70be40 Merge "Remove unnecessary logs due to misconfiguration from pacemaker" 2018-11-29 10:45:51 +00:00
Zuul
beccd1076f Merge "mysql: do not overwrite password file during docker-puppet" 2018-11-28 11:17:41 +00:00
Zuul
ba86e3944b Merge "Don't set boot_option=local on baremetal flavors" 2018-11-28 04:45:15 +00:00
Harald Jensås
7a1cd822f6 ctlplane pre-alloc IPs - deprecated/non-deprecated role name
By default, Compute role template set's the deprecated_param_ips
parameter in roles data. This forces the use of the deprecated
names in paramer_defaults when using predictable IPs for the
ctlplane network.

To allow the user to either use the deprecated role name, or the
non deprecated role name in parameters defaults extend the
ctlplane_fixed_ip_set contition to use or logic to test for data
in either the deprecated name parameter or the new parameter.

In the server resource use yaql to pick the first element that
is not empty. The non-deprecated parameter name is prioritiezed.

Change-Id: Iedc65064c5efaa618c3d54df10bf09296829efd2
Closes-Bug: #1805482
2018-11-27 20:20:50 +01:00
Zuul
22bf2b8364 Merge "Upgrades: Ensure idempotency of pacemaker services" 2018-11-27 15:58:24 +00:00
Harald Jensås
eaa8f8c2e9 Add Storage network to IronicConductor role
When Ironic uses the 'direct' deploy interface it requires
access to swift. To access swift it needs the storage
network.

Change-Id: Ie49b961bb276dff0e5afbf82b450caa57d17f6ff
2018-11-27 07:27:09 +00:00
Zuul
c464b3d206 Merge "Added Dell EMC SC multipath support" 2018-11-27 05:42:25 +00:00
Zuul
2b9e5d412d Merge "Stop upgrade if a task on one node fails" 2018-11-27 05:42:23 +00:00
Emilien Macchi
47b75398d1 upgrade: remove Docker containers now managed by Podman
For all containers where restart=always is configured and that are not
managed by Pacemaker (this part will be handled later), we remove these
containers at step 1 of post_upgrade_tasks.

Change-Id: Id446dbf7b0a18bd1d4539856e6709d35c7cfa0f0
2018-11-27 00:20:31 +00:00
Harald Jensås
055e252872 Add ctlplane interface routes
For the isolated networks we use the subnets host_routes
to set and get the routes for overcloud node interfaces.

This change add's this to the ctlplane interface.

Partial: blueprint tripleo-routed-networks-templates
Change-Id: Id4cf0cc17bc331ae27f8d0ef8f285050330b7be0
2018-11-26 17:49:31 +01:00
Martin Schuppert
cc61ff93ec Change step to start nova placement and make compute wait for it
There is a deployment race where nova-placement fails to start if
the nova api db migration have not finished before starting it.
We start nova placement early to make sure it is up before the
nova-compute services get started. Since in HA scenario there is
no sync in between the nodes on the current worked deployment step
we might have the situation that the placement service gets started
on C1/2 when the nova api db sync is not yet finished on C0.

We have two possibilities:
1) start placement later and verify that nova-computes recover correct
2) verify that db migration on nova_api db finished before start nova-
placement on the controllers

2) which was addressed via https://review.openstack.org/610966 showed
problems:
a) the docker/podman container failed to start with some file not found
error, therefore this was reverted in https://review.openstack.org/619607

b) when the scrip were running on different controllers at the same
time, the way how nova's db_version() is implemented has issues, which
is being worked on in https://review.openstack.org/619622

This patch addresses 1) and moves placement service start to step_4
and adds an additional task on the computes to wait until the placement
service is up.

Closes-Bug: #1784155

Change-Id: Ifb5ffc4b25f5ca266560bc0ac96c73071ebd1c9f
2018-11-26 17:44:47 +01:00
Zuul
b8206cc0f5 Merge "Fluentd deprecation releasenote" 2018-11-26 16:14:27 +00:00
Zuul
687c3cd90c Merge "Add customized libvirt-guests unit file to properly shutdown instances" 2018-11-26 15:03:50 +00:00
Harald Jensås
66142908cf Fix ServiceData in docker/services/aodh-api.yaml
The puppet aodh-api.yaml service uses the puppet
apache service. The apache server uses the cidr
map in ServiceData.

The docker service did not pass the ServiceData
to the puppet service template. The result is
that the properties resolved to ''.

Change-Id: I736e0fa4191fa130f882b09eb87256c62ac69143
2018-11-26 13:49:31 +00:00
Marios Andreou
386adcae2c Adds ci/environments/scenario004-standalone.yaml for new job
Adds environment file to be used in featureset override by the
standalone scenario4 job. Tracked by ci squad at [1]. Used by
the new job definition in [2]


[1] https://tree.taiga.io/project/tripleo-ci-board/us/339
[2] https://review.openstack.org/#/c/619520/
Change-Id: I3e971f5aa49fafaf82352011fcf3c9fc54cde21a
2018-11-26 08:06:54 +00:00
Zuul
0eeb0d48da Merge "Set Swift workers to 2" 2018-11-23 18:56:02 +00:00
Zuul
6d2200d5b8 Merge "Ensure directory exists before trying to use it in a container" 2018-11-23 17:14:23 +00:00
Zuul
4030103ca0 Merge "Remove validation part of the online database migration pre-upgrade." 2018-11-23 13:55:11 +00:00
Zuul
f407109bff Merge "Adjust haproxy-public-tls-inject.yaml bootstrap variable" 2018-11-23 13:55:06 +00:00
Zuul
cb86cc0a33 Merge "Python 3 compatibility: fix has_key" 2018-11-23 12:28:29 +00:00
Zuul
09f887391d Merge "Revert "Verify nova api migration finished before start placement"" 2018-11-23 12:28:22 +00:00
Steven Hardy
fd088e2fe7 Adjust haproxy-public-tls-inject.yaml bootstrap variable
In RDO CI we're seeing this undefined, but haproxy_short_bootstrap_node_name
is defined, which proves https://review.openstack.org/#/c/605046/ is included
and working.

The root cause is that the haproxy_public_tls_inject_service is actually
created via the haproxy template as a nested stack, so we need to use
haproxy_short_bootstrap_node_name instead

Change-Id: I870825140b8947a1845307b5bec1bcff387c15c0
Closes-Bug: #1804433
2018-11-23 06:26:46 +00:00
Zuul
82a79f3235 Merge "Run docker-puppet.py in debug mode." 2018-11-22 22:11:21 +00:00
Zuul
6ae97d1587 Merge "Sensu Deprecation note" 2018-11-22 21:49:09 +00:00
Zuul
444657edd2 Merge "Cleanup nova metadata port in nova api service" 2018-11-22 21:48:08 +00:00
Rafael Folco
244f6216fd Add a new standalone scenario003 env file
Convert scenario003 env to standalone.

Change-Id: Id371e895c67f215f0a923c850cac8c667117e609
Story: https://tree.taiga.io/project/tripleo-ci-board/us/338
2018-11-22 18:10:06 +00:00
Zuul
0bfeeccfe6 Merge "Allow enabling routed networks on undercloud update|upgrade" 2018-11-22 17:12:50 +00:00