2841 Commits

Author SHA1 Message Date
Gage Hugo
3182b01d82 Remove divingbell job
This change removes the non-voting divingbell job from
openstack-helm-infra checks due to not really being used to
test much functionality.

Change-Id: I343b4cdc98d637522ac854211a974cc86d49cae6
2020-10-30 13:29:22 -05:00
Chinasubbareddy Mallavarapu
7c8ca55ac0 [ceph-provisioners] Validate each storageclass created
This is to include every storageclass getting created part of
helm tests.

Change-Id: I62dc11600d00fe2ec7babb1688e61d3eaa50100c
2020-10-28 22:14:49 +00:00
Zuul
e74674324b Merge "Add capability to delete a backup archive" 2020-10-28 20:17:20 +00:00
Parsons, Cliff (cp769u)
2d1fe882bb Add capability to delete a backup archive
This patchset adds the capability to delete any archives that are stored
in the local file system or archives that are stored on the remote RGW
data store.

Change-Id: I68cade39e677f895e06ec8f2204f55ff913ce327
2020-10-28 16:19:31 +00:00
Andrii Ostapenko
22cfea81d0 Split deployment script sets to improve stability
Change-Id: I848d6ad0ce52863bf4a13b96b2afbf79bfaf70fc
Signed-off-by: Andrii Ostapenko <andrii.ostapenko@att.com>
2020-10-28 15:01:45 +00:00
okozachenko
63b7a0cd0f Update ingress tpl in helmtoolkit
- Check issuer type to distinguish the annotation between
clusterissuer and issuer
- Add one more annotation "certmanager.k8s.io/xx" for old version

Change-Id: I320c1fe894c84ac38a2878af33e41706fb067422
2020-10-28 07:06:51 +00:00
Andrii Ostapenko
42f3b3eaf5
Don't use opendev docker proxy
Look like using docker proxy is slower and less stable than pulling from
dockerhub directly and contributes to some part of unstable builds.

This reverts commit e3f14aaff35364b84acedf53b3778111cbae0373.

Change-Id: I9735ad35ce9240f610479a56eaa38715defa2e04
Signed-off-by: Andrii Ostapenko <andrii.ostapenko@att.com>
2020-10-27 10:33:40 -05:00
Zuul
757a353b70 Merge "postgresql: Revert "Add default reject rule ..."" 2020-10-24 02:27:40 +00:00
Kabanov, Dmitrii
d39abfe0f0 [ceph-osd] Update post apply job
The PS updates wait_for_pods() function in post apply script.
The changes allow to pass wait_for_pods() function when required percent
of OSDs reached (REQUIRED_PERCENT_OF_OSDS). Also removed a part of code
which is not needed any more.

Change-Id: I56f1292682cf2aa933c913df162d6f615cf1a133
2020-10-23 19:00:58 +00:00
Phil Sphicas
20288319af postgresql: Revert "Add default reject rule ..."
This reverts commit 982e3754a5755cc227552b6f1fcc195e8793589c.
"Add default reject rule end in Postgres pg_hba.conf to ensure all
connections must be explicitly allowed."

The original commit introduced a breaking change when installing with
the chart defaults - before, all remote connections with md5 auth were
allowed, and after the change, only explicit users are allowed.

This is fully overridable, but the original defaults are more
conservative.

Change-Id: Ib297e480bccd3ac7c0cf15985b3def2c8b3e889e
2020-10-23 17:50:50 +00:00
Phil Sphicas
c43331d67a postgresql: Optimize restart behavior
* add preStop hook to trigger Fast Shutdown
* disable readiness probe by default

When Kubernetes terminates a pod, the container runtime typically sends
a SIGTERM signal to pid 1 in each container [0]. PostgreSQL interprets
SIGTERM as a request to do a "Smart Shutdown" [1]. This can take minutes
(often exhausting the termination grace period), and during this time,
new connections are not being serviced.

Now that postgresql has a single replica, this behavior is undesirable.
If we kill the pod (e.g. in an upgrade), we probably want it to come
back as soon as possible.

This change adds a preStop hook that sends a SIGINT to postgresql in
order to trigger a "Fast Shutdown". In addition, the readiness probe is
disabled by default, since it adds no value in a single-replica
scenario.

0: https://kubernetes.io/docs/concepts/workloads/pods/pod-lifecycle/#pod-termination
1: https://www.postgresql.org/docs/9.6/server-shutdown.html

Change-Id: Ib5f3d2a49e55332604c91f9a011e87d78947dbef
2020-10-23 07:41:57 +00:00
Phil Sphicas
a10699c4e0 postgresql: Allow probe tweaking
Uses the standard helm-toolkit macros for liveness and readiness probes,
allowing them to be enabled or disabled, and params to be overridden.

The existing hard-coded settings are preserved as the chart defaults.

Change-Id: Idd063e6b8721126c88fa22c459f93812151d7b64
2020-10-23 06:52:45 +00:00
KHIYANI, RAHUL (rk0850)
b4d0793b98 Add pod/contianer security context template to create_db.yaml
This enables the runAsUser and ReadOnly-fs flags overridden in
values.yaml

Change-Id: I2e5cbd57f90ef1f5c09b7a54cd04d92dcfd8edc5
2020-10-22 20:50:25 +00:00
Zuul
a7cfefddb5 Merge "Fix spacing inconsistencies with flags" 2020-10-22 15:39:33 +00:00
Zuul
9332c2961e Merge "Fix ks-user script case matching for domain" 2020-10-21 17:51:04 +00:00
Smith, David (ds3330)
9d9aaa8948 Fix spacing inconsistencies with flags
Change-Id: Ia8f7437071a8865f1470412ad616b67a38142719
2020-10-21 13:44:07 +00:00
Tin Lam
62b10c7d49 chore(pkg): updates the chart packaging
Part 2. This patch set adjusts the url once the initial packages are
make available.

Change-Id: Idfb69146d606b43c98c552d1d2c5680ccd503282
Signed-off-by: Tin Lam <tin@irrational.io>
2020-10-21 00:58:16 -05:00
Tin Lam
738c89b342 fix(job): fixes the post job
This corrects the ability to sync artifacts to tarballs.o.o.

Change-Id: Icb2b6653f263aaab173d1479d05c0209e7390c50
Signed-off-by: Tin Lam <tin@irrational.io>
2020-10-20 22:43:10 -05:00
Tin Lam
da81705a47 fix(post): fixes publish job
This fixes a typo of the publish job.

Change-Id: I077feb29a8764a0b3031b34b462779c911baaee3
Signed-off-by: Tin Lam <tin@irrational.io>
2020-10-19 11:53:46 -05:00
Gage Hugo
cddf665c16 Fix ks-user script case matching for domain
Some services attempt to recreate the default domain
with both the values of "default" and "Default". Since this
domain already exists when keystone is deployed, this
creates redundant API calls that only result in conflicts.

This change enables nocasematch for string checking in order
to avoid making multiple unnecessary calls to keystone.

Change-Id: I698fd420dc41eae211a511269cb021d4ab7a5bfc
2020-10-19 05:03:58 +00:00
Tin Lam
e5c776e5c4 chore(pkg): updates the chart packaging
This patch set updates the ability to package (and subsequent publish)
of the charts in the OpenStack-Helm-Infra repository.

Change-Id: I6175325b0e7a668c22a7ec3ab08cae51ad4f9ab8
Signed-off-by: Tin Lam <tin@irrational.io>
2020-10-17 08:42:53 +00:00
Zuul
a282491ba6 Merge "[ceph-client] fix the logic to disable the autoscaler on pools" 2020-10-17 01:47:54 +00:00
Chinasubbareddy Mallavarapu
c3f921c916 [ceph-client] fix the logic to disable the autoscaler on pools
This is to fix the logic to disable the autosclaer on pools as
its not considering newly created pools.

Change-Id: I76fe106918d865b6443453b13e3a4bd6fc35206a
2020-10-16 21:17:07 +00:00
Stephen Taylor
16b72c1e22 [ceph-osd] Synchronization audit for the ceph-volume osd-init script
There are race conditions in the ceph-volume osd-init script that
occasionally cause deployment and OSD restart issues. This change
attempts to resolve those and stabilize the script when multiple
instances run simultaneously on the same host.

Change-Id: I79407059fa20fb51c6840717a083a8dc616ba410
2020-10-16 18:30:57 +00:00
Tin Lam
3a2d0f83b4 chore(charts): addresses issues with chart publish
This changes attempts to address the chart publish issue. Also makes
the job periodic.

Change-Id: I806da82a7eb07ce8e83ae8c023a014fa3b917193
Signed-off-by: Tin Lam <tin@irrational.io>
2020-10-16 15:15:35 +00:00
Zuul
af712da863 Merge "Update image version from v2.0.0-alpha to v2.0.0-alpha-1" 2020-10-15 15:34:22 +00:00
Chinasubbareddy Mallavarapu
321b8cb7e3 [ceph-osd] Logic improvement for used osd disk detection
This is to improve the logic to detect used osd disks so that scripts will
not zap the osd disks agressively.

also adding debugging mode for pvdisplay commands to capture more logs
during failure scenarios along with reading osd force repair flag from
values.

Change-Id: Id2996211dd92ac963ad531f8671a7cc8f7b7d2d5
2020-10-15 13:13:28 +00:00
Andrii Ostapenko
825d0a5f3d
Switch back to openstack-helm-single-node for support job
Jobs running on current node label does not seem to be stable.

Also fixes ssl support job name duplication.

Change-Id: I1b9a3b674818fb7a322d05b4f849188f1484e47b
Signed-off-by: Andrii Ostapenko <andrii.ostapenko@att.com>
2020-10-15 07:03:14 -05:00
Xiaoguang(William) Zhang
57e58c388d Update image version from v2.0.0-alpha to v2.0.0-alpha-1
Fix missing labels issue (kube-state-metrics issue #1239)

Change-Id: I7ff246b424885fda7d7aac2e95a30ef1994ec545
2020-10-14 16:37:02 -04:00
KHIYANI, RAHUL (rk0850)
53480c39f0 Add missing pod/container flags to alerta
Adding runAsUser and readOnly-fs flag to alerta pod/container.

The security context templates are added here:
https://github.com/openstack/openstack-helm-infra/blob/master/alerta/templates/deployment.yaml#L52

Change-Id: I1136f776ffd7caf54310288a162563ef4e565bbe
2020-10-14 11:41:07 -05:00
Zuul
ed229cf605 Merge "[ceph-osd] Fix the sync issue between osds when using shared disk for metadata" 2020-10-11 16:41:50 +00:00
Chris Wedgwood
da1117e257 [PostgreSQL] Use explicit entrypoint for prometheus exporter
It appears having `args:` without `command:` causes some combinations
of kubernetes & container runtimes to not work as expected.

Change-Id: Id9d692632066de410ca5f13bbfe13d1899b93819
2020-10-11 13:53:34 +00:00
Chinasubbareddy Mallavarapu
6a0feecaef [ceph-osd] Fix the sync issue between osds when using shared disk for metadata
This is to fix the sync between ceph osds when they are using shared
disk for metadata as they are having conflict while preparing the metadata disk.

we are adding a lock when first osd preparing the sahred metadata disk so that
other osd will wait for the lock, also adding udev settle in few places to get
latest tags on lvm devices.

Change-Id: I018bd12a3f02cf8cd3486b9c97e14b138b5dac76
2020-10-11 04:04:53 +00:00
Andrii Ostapenko
3e52027702
Adds vexxhost based nodeset
Adds ubuntu-bionic-expanded nodeset based on ubuntu-bionic-expanded-vexxhost
that is 16c/~32GB
Switches long running support job to new nodeset to reduce build time.

Change-Id: Iec27c00bf54efa7d686b3176998fc8ad6c9f287e
Signed-off-by: Andrii Ostapenko <andrii.ostapenko@att.com>
2020-10-09 10:32:06 -05:00
Zuul
a20afe253f Merge "adding archiving to postgres" 2020-10-08 22:34:50 +00:00
Zuul
4e5fe5ec10 Merge "[Libvirt] Values overrides for external Ceph Cinder backend" 2020-10-08 20:09:55 +00:00
Apurva Gokani
85cbd6f04b adding archiving to postgres
To safeguard postgres from clogging up wal files
in pg_xlog directory, This change does the following:
1) adding postgres archiving to move the WAL file to different directory
2) Makes sure that archive is in different Persistent volume.

Change-Id: I59bc76f27384d4f3836ef609855afcc33a7b99d0
2020-10-08 13:14:03 -05:00
Zuul
27d6f4aa70 Merge "Add conntrack required for k8s 1.18.x" 2020-10-08 17:12:42 +00:00
Huang, Sophie (sh879n)
df3918b4fe [Libvirt] Values overrides for external Ceph Cinder backend
Example values_overrides file is added to indicate how to
override the Libvirt manifest for configure an additional
externally managed Ceph Cinder backend.

Change-Id: I8e7a294059a2d98fb7854a281a29dcff80530d2b
2020-10-08 15:01:58 +00:00
Andrii Ostapenko
9d7d4e4ba6 Add conntrack required for k8s 1.18.x
Change-Id: I8c85e11ad984fa283139b3d71d52bc7c3fb8e900
Signed-off-by: Andrii Ostapenko <andrii.ostapenko@att.com>
2020-10-07 18:34:34 +00:00
Zuul
72658e2712 Merge "Revert "Prometheus: Render Rules as Templates"" 2020-10-07 15:11:14 +00:00
Zuul
f5caa0c7da Merge "Add extensible command line flags to Alertmanager" 2020-10-06 19:32:03 +00:00
Zuul
d75d0d8f00 Merge "[ceph-osd] Don't try to prepare OSD disks that are already deployed" 2020-10-06 18:17:51 +00:00
Zuul
c658ac37df Merge "Prometheus: Add configurable readiness/liveness Probes" 2020-10-06 16:11:57 +00:00
Steven Fitzpatrick
cdd0f33d0c Revert "Prometheus: Render Rules as Templates"
This reverts commit fb7fc87d237ce569666f7bd041adea6007549138.

I first submitted that as a way to add dynamic capability to the
prometheus rules (they infamously don't support ENV variable
substitution there). However this be done easily with another solution,
and would clean up the prometheus chart values significantly.

Change-Id: Ibec512d92490798ae5522468b915b49e7746806a
2020-10-06 15:21:18 +00:00
Stephen Taylor
38d9f35c05 [ceph-osd] Don't try to prepare OSD disks that are already deployed
This addresses an issue that can prevent some OSDs from being able
to restart properly after they have been deployed. Some OSDs try to
prepare their disks again on restart and end up crash looping. This
change fixes that.

Change-Id: I9edc1326c3544d9f3e8b6e3ff83529930a28dfc6
2020-10-05 18:40:48 -05:00
Zuul
1af5482e2d Merge "Alerta update to slience alert in Alertmanager" 2020-10-05 20:34:42 +00:00
Steven Fitzpatrick
f4bdb713c1 Prometheus: Add configurable readiness/liveness Probes
This change adds probes to the prometheus statefulset using the HTK
probe generation functions

Change-Id: I249d662dd0d23dd964f7118af94c733bbdc5db92
2020-10-05 19:28:00 +00:00
Steven Fitzpatrick
2bdf4f8239 Add extensible command line flags to Alertmanager
Alertmanager is configured similarly to Prometheus. This change
brings the utils.command_line_flags template from the osh-infra
prometheus chart to Alertmanager, allowing these flags to be
configured in Values.yaml

Change-Id: Ieca94c09881bc52b62500efa4c6f8730b9208d3b
2020-10-05 17:05:26 +00:00
Zuul
006835b59e Merge "[kube-state-metrics] Update to make current" 2020-10-05 15:43:16 +00:00