765 Commits

Author SHA1 Message Date
Jenkins
ed17e83aae Merge "Add ubuntu-mariadb to configure_mirror.sh" 2017-03-06 10:00:16 +00:00
Jenkins
42c2803bfc Merge "Clean up unused nodepool osic-cloud1-disk provider" 2017-03-04 13:38:34 +00:00
Jenkins
b76dfb5194 Merge "Drop osic-cloud1-disk flavor from nodepool conf" 2017-03-04 01:39:55 +00:00
Jenkins
257618fddf Merge "Revert "Temporarily pull osic from the pool."" 2017-03-03 15:42:47 +00:00
Sean Dague
952fa10522 Revert "Temporarily pull osic from the pool."
This reverts commit b7de2e2511f8d9739ecc80be1c08d1b0e9c73c7e.

Change-Id: Ic7666e11e6b27b8dd4d8b27301930f88f8795c98
2017-03-03 15:14:28 +00:00
Jeremy Stanley
2be4ca1e0d Merge "Temporarily pull osic from the pool." 2017-03-03 14:55:00 +00:00
Sean Dague
b7de2e2511 Temporarily pull osic from the pool.
OSIC appears to not be able to create nodes right now, pull it from
the pool to let patch merges flow again.

Change-Id: Ia14b53b9e0ccd3b2e0ba97b0b6d156f4dea96fe4
2017-03-03 09:51:52 -05:00
Jenkins
98ac038980 Merge "openstack-repos: fail on missing project list" 2017-03-03 08:49:13 +00:00
Jeremy Stanley
29511810cc Clean up unused nodepool osic-cloud1-disk provider
Now that this empty provider is no longer in use, clean up the
vestigial entry for it.

Change-Id: Ib91db38b35af0d01e7a0d0d7e7f3fa0dd612e4d4
2017-03-02 23:51:04 +00:00
Jeremy Stanley
1792663836 Drop osic-cloud1-disk flavor from nodepool conf
As part of resource restructuring within OSIC, we're ceasing use of
512 nodes with their "disk" flavor, and continuing to use 256 each
of their "s3500" and "s3700" SSD-backed flavors. Independent of
this, they will be adjusting these flavors from 8vCPU to 4vCPU so
that we can bring our peak utilization in line with their current
availability.

This is expected to be a temporary measure, until they're able to
complete deployment of a dedicated 35-host OpenStack environment
where we can start adjusting overcommit ratios (hopefully driving
our cap back up again somewhat).

Change-Id: I0244362a61264bb36e5c1e1562684a0136944e49
2017-03-02 23:37:17 +00:00
Paul Belanger
062fa627f6 Add ubuntu-mariadb to configure_mirror.sh
Change-Id: I20d9091f6540dd8074820963132233cb1d2cd1c3
Signed-off-by: Paul Belanger <pabelanger@redhat.com>
2017-03-02 14:13:31 -05:00
Jenkins
91c1a6afcb Merge "Update nodepool elements for openSUSE Leap 42.2" 2017-03-01 08:28:13 +00:00
Ian Wienand
ce0a039508 openstack-repos: fail on missing project list
If you specify a CUSTOM_PROJECTS_LIST_URL that doesn't exist you get a
message but the script continues.  Since you clearly meant to specify
something we should bail so the user can correct the problem.

Additionally, add some documentation to the build-image.sh script to
describe how to use this to cut down build times considerably for
initial testing purposes.

Change-Id: I02d1cd341f793471b1ad4374e98031ea53db7f49
2017-03-01 17:17:56 +11:00
Paul Belanger
bf29a5b406 Add nodepool-id to infracloud-chocolate
Because we want to share infracloud-chocolate between nodepool.o.o and
nl01.o.o, we need to use the new 'nodepool-id' for the provider.
Otherwise, nodepool.o.o will proceed to delete the instances that
nl01.o.o launches.

Change-Id: Ic5d1f0ebf7eb92c841ec09502e5134d89c8f6e97
Signed-off-by: Paul Belanger <pabelanger@redhat.com>
2017-02-28 12:35:43 -05:00
Dirk Mueller
f694c91bcb Update nodepool elements for openSUSE Leap 42.2
stackviz: In general newer openSUSE distributions switched to nodejs6 LTS,
but Leap 42.2 is still on 4.x.

zuul-worker: No need for libselinux enablement in openSUSE

puppet: cleanup the puppet repo that got added in the install_puppet.sh
code part

Change-Id: If7bf3c799ebb74aeb36c4b1b607b0454993f0ed6
2017-02-28 10:54:53 +01:00
Paul Belanger
4e4f75f010 Bring infracloud-chocolate back online
Now that the Pike PTG is over, bring infracloud-chocolate back online
for production. We do this, to avoid nodepool-builder issues (we are
not deleting diskimages from disk).

Moving forward, we'll need to update nodepool to properly support 2
nodepool launchers using a shared tenant.

Change-Id: I7fd90939db9fcb0edc1afc4f62248801f3c20ca5
Signed-off-by: Paul Belanger <pabelanger@redhat.com>
2017-02-27 08:18:18 -05:00
Paul Belanger
6f3bb9d7f5 Add nl01.o.o support for configure_mirror.sh
Because we are using the 'zuul' user to SSH into our zuul workers, we
need to add it to configure_mirror.sh and still support the jenkins
user.

Change-Id: I79eff670f56d06c49a16e1330a73889e7b8482f1
Signed-off-by: Paul Belanger <pabelanger@redhat.com>
2017-02-22 10:07:21 -05:00
David Shrewsbury
f5e34d6106 Bump min-ready to 2 on zuulv3 nodepool
Change-Id: I8dbff85761aee596228245519826e3d7a5537c33
2017-02-21 11:56:52 -05:00
Paul Belanger
36af7299d5 Fixes for feature/zuulv3 testing of nl01.o.o
Disable our ready-scripts, as we have a permission issue with the zuul
user.

Replace jenkins user with zuul user.

Switch our infracloud provider back to the original, otherwise we
cannot find an image to launch.

Change-Id: Ib3d6cf45308be3efdf2aa6c855b26c85b1db3804
Signed-off-by: Paul Belanger <pabelanger@redhat.com>
2017-02-21 08:46:14 -05:00
Jenkins
29ed3b5f8f Merge "Disable infracloud-chocolate for zuulv3 dev 2/2" 2017-02-20 18:03:30 +00:00
Jenkins
c09bafd29b Merge "Disable infracloud-chocolate for zuulv3 dev 1/2" 2017-02-20 16:20:41 +00:00
Joshua Hesketh
ccdbc46ab6 Disable infracloud-chocolate for zuulv3 dev 2/2
We'll be using chocolate for nodepool+zuulv3 dev during ptg.

We don't want to just set max-servers to 0 because nodepool will
still attempt to manage and clean up servers and get in the way
of our v3 deployment.

Change-Id: If090190b541922ece7e67fa9107e521c050fa484
2017-02-20 11:01:12 -05:00
Joshua Hesketh
aefb74d7d5 Disable infracloud-chocolate for zuulv3 dev 1/2
Start by cleaning up running servers.

Change-Id: I6bb6c65a0c9d31bb3ad05b6acb78e7d158c77746
2017-02-20 11:00:02 -05:00
Paul Belanger
e138df9951 Initial commit of nodepoolv3.yaml file
This is our first attempt at using nl01.o.o.

Change-Id: I5ac3634331cc732e8e69a3f8f79f4afa77e7ee50
Signed-off-by: Paul Belanger <pabelanger@redhat.com>
2017-02-20 10:35:14 -05:00
Andreas Jaeger
963fb25c30 Revert "Revert "Revert "Temporarily disable osic-cloud1"""
Let's try again enabling OSIC.

This reverts commit e7e4e5ec74118c984fa7d2af8f56770d9390c305.

Change-Id: I9b4f22737e53470e7d1108d94c79f8176feeeeb0
2017-02-17 07:27:01 +00:00
Paul Belanger
e7e4e5ec74 Revert "Revert "Temporarily disable osic-cloud1""
This reverts commit 96bb76406a82d2f4fe8654f18e009643ae74b1eb.

Change-Id: I32666a54b4aa72d3aadd2ed9a596191bcb0a972f
2017-02-16 23:33:19 +00:00
Paul Belanger
96bb76406a Revert "Temporarily disable osic-cloud1"
osic-cloud1 is back to normal.

This reverts commit bc22cb207d5e32895c3a76098be8ef3f525962a0.

Change-Id: I48ed5449431c9d251722d8c1f5152b0c9c8fad4a
Signed-off-by: Paul Belanger <pabelanger@redhat.com>
2017-02-16 12:47:26 -05:00
Paul Belanger
bc22cb207d Temporarily disable osic-cloud1
We are seeing a large increase in failures with osic-cloud1, for the
moment lets disable the cloud so our other providers can absorb the
load.

Once we know the issue (looks to be networking) has been resolved, we
can revert this patch.

Change-Id: If875faa3ac7aa71020ed16f28ee702fc00d67ce2
Signed-off-by: Paul Belanger <pabelanger@redhat.com>
2017-02-16 12:11:46 -05:00
Paul Belanger
4d8a14e494 Revert "Disable tripleo-test-cloud-rh1 cloud"
This reverts commit e5ae018c7b7a812df8cf3e5c98421860c824e11e.

Change-Id: I7e0f0047a7df305727c597f54152e74a151bd790
2017-02-15 21:23:21 +00:00
Paul Belanger
e5ae018c7b Disable tripleo-test-cloud-rh1 cloud
Currently, we are seeing a large influx of data being uploaded to
logs.o.o. As a result, we are decreasing our retention policy on
logs.o.o but cannot say a head of the curve.

Until we have reclaimed HDD space, disable tripleo-test-cloud-rh1.

Change-Id: I4d7dc07552662ddd72aef74cef9650e6e06f65d3
Signed-off-by: Paul Belanger <pabelanger@redhat.com>
2017-02-15 15:56:01 -05:00
Emilien Macchi
39cf700ab1 nodepool: set 1 for min-ready instead of 0
Switch min-ready from 0 to 1 for 4-nodes environment in nodepool.

Change-Id: Ida6a1fa180f345cc2d5da0b3fc96bbf0601fd0cf
2017-01-31 13:00:55 -05:00
Ben Nemec
27377ab163 Bump rh1 max-servers to 70
The move to 65 has gone smoothly, and we should have capacity for
even more jobs.  We could probably go straight to 75 (which is the
most we should run with the current te-broker setup), but larger
jumps in the number of testenvs have caused problems in the past so
I'd prefer to stick with 5 at a time.  Note that 5 extra testenvs
will use more than 150 GB of memory, so this increase is not as
small as it looks.

Change-Id: If83b507ba6ca97fc850d3a9caf570c4950949914
2017-01-24 19:55:12 +00:00
Jenkins
1e09fcbb20 Merge "Drop template from diskimage names" 2017-01-23 15:40:38 +00:00
Paul Belanger
736cee2421 Remove fedora-24 from openstack-infra
Now that we have migrated all our jobs to fedora-25, we can remove
fedora-24 from project-config.

Change-Id: I3a58c785c12f779060a6dca5cd40ddcc46429971
Signed-off-by: Paul Belanger <pabelanger@redhat.com>
2017-01-22 12:02:33 -05:00
Ben Nemec
29e0ecf8ce Bump max-servers to 65 in rh1
We've made a number of changes recently in tripleo-ci so we're
making more efficient use of the rh1 cloud and should now have the
capacity to run more jobs at once.

Change-Id: I99cb92d7ada4b89bcac62dfc24846c04b03eb9f5
2017-01-19 15:22:11 +00:00
Paul Belanger
b50ccd90b1 Revert "Reduce our 3-node labels to min-ready 0"
There is an issue when we restart zuul, that gearman information about
said nodes is lost.  Revert for now, and hopefully address this in
zuulv3.

This reverts commit 23495050235dd80331e256782f6db6a85e577e22.

Change-Id: I32d331f065be041cd11c71c11702be93b3d42831
Signed-off-by: Paul Belanger <pabelanger@redhat.com>
2017-01-15 10:14:43 -05:00
Kevin Carter (cloudnull)
2417a543c2 Revert "Disable the OSIC cloud for the time being"
The DNS issues have been resolved.

This reverts commit 2e1a256b0c9aceabe008d50cf12cfa9cbb1262d4.

Change-Id: I7b6b2f4ad61f99c0d4b3748dfa88dcb9708357e4
2017-01-15 02:00:33 +00:00
Kevin Carter
2e1a256b0c Disable the OSIC cloud for the time being
The people who manage our DNS broke it. Sadly we need to disable the
environment until we can get whatever they did undone.

Change-Id: I2427c84e7f0d980d6fb8aeba5f25be7c693319b2
Signed-off-by: Kevin Carter <kevin.carter@rackspace.com>
2017-01-13 21:32:46 -06:00
Paul Belanger
25968cc69a Add support for fedora (25+) AFS mirrors
Now that fedora-25 is online, we can start using the new AFS mirror for
fedora. Keep in mind, we didn't mirror fedora-24, since fedora-25 will
be replacing it.

Once fedora-24 has been removed from nodepool.yaml, we can remove our
conditional check.

Change-Id: I1bd510c6a568b71b604c27d6cae16d2654063d0c
Depends-On: I762e55fb7fe0c5ebfb7810545ddb4d5ccfedf2ff
Signed-off-by: Paul Belanger <pabelanger@redhat.com>
2017-01-13 16:11:37 -05:00
Paul Belanger
56fc1a6ed3 Drop template from diskimage names
This is cosmetic change for nodepool. Diskimages will now be uploaded
as ubuntu-xenial-1484177984 instead of
template-ubuntu-xenial-1484177984.

Change-Id: Ib4ced36ff8ae5a7e99ce5ecedac16b2dd36fa6a3
Signed-off-by: Paul Belanger <pabelanger@redhat.com>
2017-01-13 13:39:48 -05:00
Paul Belanger
dc0d1b2e15 Start building fedora-25 DIBs
Bring fedora-25 online with a 1 min-ready node. This is needed to
start the migration from fedora-24 to fedora-25.

Change-Id: I99b852d3f440814fcf2f2a7518c413cdeeeb4106
Signed-off-by: Paul Belanger <pabelanger@redhat.com>
2017-01-12 17:25:44 -05:00
Paul Belanger
4723032ac0 Remove script-dir from nodepool
Switch back to master branch, until we are ready to run zuulv3 branch
again in production.

Change-Id: Id93a3c64282a31545a66dc1da740596d3db3909f
Signed-off-by: Paul Belanger <pabelanger@redhat.com>
2017-01-12 17:24:20 -05:00
Paul Belanger
2349505023 Reduce our 3-node labels to min-ready 0
While we have jobs using these labels today, they are still used very
little. Rather then having 6 nodes idle for days at a time, only
launch them when jobs are needed.

On average it takes about 30seconds for nodes to come online these
days.

Change-Id: I8e74b62f3556823e17468a4e2d22d42490fc44e6
Signed-off-by: Paul Belanger <pabelanger@redhat.com>
2017-01-09 22:01:10 +01:00
Paul Belanger
4ba3c83b34 Revert "Disable chocolate until mirror is up again"
Our AFS mirror is back online. It looks like the compute server lost
network access on Jan 7, the solution was to reboot the server in
ironic.

This reverts commit 06ebd8a99e56ca586059a42f738b6b9fef921922.

Change-Id: I45587358e40d15bd8bbd660b0093096bdfb95c0c
Signed-off-by: Paul Belanger <pabelanger@redhat.com>
2017-01-09 12:29:05 -05:00
Andreas Jaeger
06ebd8a99e Disable chocolate until mirror is up again
The region is currently failing tests since the mirror is down. Disable
cloud until everything is fixed.

Change-Id: I22a619e62067b47900bb6db7d01eedfdceaad3cf
2017-01-09 15:52:39 +01:00
Jenkins
192004a827 Merge "Introduce TripleO multinode (x4) experimental jobs with scenario005" 2017-01-09 11:29:42 +00:00
Monty Taylor
6ca6c2bd7c
Update nodepool config to remove unneeded things
In an effort to be a good config example, remove config settings that
are no longer relevant.

Change-Id: I6cc3a18dc743f67f215b4d6307307d69d83a05f0
2017-01-08 09:09:08 -06:00
Jenkins
b8f219de9a Merge "Set DIB_DEBIAN_COMPONENTS: 'main,universe' for ubuntu-minimal" 2017-01-05 18:43:38 +00:00
Emilien Macchi
adaca85b72 Introduce TripleO multinode (x4) experimental jobs with scenario005
TripleO Architectures allows complex scenarios where we can choose which
services (run with Pacemaker in HA) live on which node.

http://specs.openstack.org/openstack/tripleo-specs/specs/ocata/composable-ha-architecture.html

To test this architecture, we need a 4-nodes deployment where:
- 1 node is for undercloud
- 3 nodes are for Pacemaker cluster with composable services

scenario005 will deploy this complex architecture and makes sure
OpenStack Cloud is working well, like we do with other scenarios.

This patch aims to:
- introduce 4nodes scenario in OpenStack Nodepool.
- introduce scenario005, in experimental pipeline
- configure scenario005 to run only when needed:
  - check experimental
  - only starting from ocata
  - only when we patch pacemaker files in puppet-tripleo and
    tripleo-heat-templates.

Change-Id: Ic1d05ba7d572a291d2c77d85473394564699078c
2017-01-04 11:23:27 -05:00
Clark Boylan
a126f3953f Make journald log collection persistent
We have had some job runs overrun the journald ring buffer which is used
by default resulting in losing older logs during the job runs. Update
the journald configuration to persistently store the journal so that we
can reliably retrieve those logs when jobs complete.

Change-Id: I5626ce76878287be220a8803f9dfe9a9da950d5b
2017-01-03 09:39:06 -08:00