tripleo-heat-templates/deployment
katarimanoj 811bf370be Fix DCN system upgrade failure to set ceph noout
In case of DCN setup with multiple stacks, the ceph task
"Set noout flag" is delegated to incorrect host and the
ceph command also fails to find  the conf and keyring.

This patch will ensure to
  - delegate the task to right host
  - run the ceph command for the specific cluster by
    using the correct conf and keyring files

Change-Id: Ib79b6bf3fb61a782661beaaeec85a2610c0f3833
2023-12-13 16:49:57 +05:30
..
aide Use server side env merging for ServiceNetMap/VipSubnetMap 2022-06-15 14:52:48 +05:30
aodh Enable CAP_AUDIT_WRITE for some containers/steps 2023-07-03 18:13:52 +09:00
apache Apache: Use only required modules 2022-09-06 07:04:18 +00:00
auditd Use the new tripleo_auditd ansible role instead of puppet 2022-08-17 07:03:31 +00:00
barbican Enable CAP_AUDIT_WRITE for some containers/steps 2023-07-03 18:13:52 +09:00
ceilometer Ceilometer: Enable oslo.cache backend 2023-10-13 13:02:34 +05:30
ceph-ansible Add tag to skip ceph-ansible removal 2023-11-15 13:14:37 +05:30
cephadm Fix DCN system upgrade failure to set ceph noout 2023-12-13 16:49:57 +05:30
certs Use server side env merging for ServiceNetMap/VipSubnetMap 2022-06-15 14:52:48 +05:30
cinder cinder-volume-pacemaker leapp upgrade with NFS 2023-09-29 17:23:54 +01:00
clients Use server side env merging for ServiceNetMap/VipSubnetMap 2022-06-15 14:52:48 +05:30
container-image-prepare Support role specific ContainerImagePrepare 2022-12-23 14:42:20 +10:00
database Add certificate/key to db-upgrade temporary container 2023-11-01 13:59:17 +00:00
deprecated Install systemd-container package on minor update 2023-12-04 16:20:41 +01:00
designate Merge "add parameter to change ns_records of /etc/designate/pools.yaml" into stable/wallaby 2023-09-23 06:14:46 +00:00
etcd Role specific container support 2022-09-22 22:35:55 +09:00
experimental Move designate from experimental 2021-06-28 17:25:15 -02:30
frr Call tripleo_container_manage for FRR during updates 2023-09-14 07:42:11 +02:00
glance Reintroduce NFS mounts after leapp upgrade 2023-09-15 12:45:28 +01:00
gnocchi Reintroduce NFS mounts after leapp upgrade 2023-09-15 12:45:28 +01:00
haproxy Allow 1993 port for HAProxy stats page 2023-08-10 11:18:17 +00:00
heat Enable CAP_AUDIT_WRITE for some containers/steps 2023-07-03 18:13:52 +09:00
horizon Merge "Horizon: Support Strict-Transport-Security header" into stable/wallaby 2023-09-18 15:10:37 +00:00
image-serve Use server side env merging for ServiceNetMap/VipSubnetMap 2022-06-15 14:52:48 +05:30
ipa Run TLS-E in pre_deploy_step_tasks 2023-07-10 09:31:15 +02:00
ipsec Filter excluded nodes from ansible delegates 2022-09-27 10:57:18 +00:00
ironic Enable CAP_AUDIT_WRITE for some containers/steps 2023-07-03 18:13:52 +09:00
iscsid Role specific container support 2022-09-22 22:35:55 +09:00
kernel Use server side env merging for ServiceNetMap/VipSubnetMap 2022-06-15 14:52:48 +05:30
keystone Fix wrong parameter type of KeystoneCronTrustFlush* 2023-01-11 02:14:13 +00:00
logging [OSP=17.1] Use older rsyslog puppet class 2023-05-30 17:18:11 +02:00
login-defs Fix wrong parameter assignation 2022-08-17 07:35:58 +00:00
logrotate Filter excluded nodes from ansible delegates 2022-09-27 10:57:18 +00:00
manila Enable CAP_AUDIT_WRITE for some containers/steps 2023-07-03 18:13:52 +09:00
masquerade-networks Use server side env merging for ServiceNetMap/VipSubnetMap 2022-06-15 14:52:48 +05:30
memcached Filter excluded nodes from ansible delegates 2022-09-27 10:57:18 +00:00
messaging Role specific container support 2022-09-22 22:35:55 +09:00
metrics [ffwd3] Switch UBI8 to UBI9 of collectd after OS upgrade 2023-07-17 11:39:42 +02:00
multipathd Bind /run/udev into the multipathd container 2023-02-14 12:28:56 -08:00
neutron Enable radvd to start as a side cart container by default 2023-10-04 15:03:32 -04:00
nova Merge "Install systemd-container package on minor update" into stable/wallaby 2023-12-06 18:43:30 +00:00
octavia Merge "Enable CAP_AUDIT_WRITE for some containers/steps" into stable/wallaby 2023-07-04 13:47:03 +00:00
openvswitch Add support for OVS PMD load based sleeping 2023-10-31 12:49:05 +00:00
ovn [ffwd3][update] Improve ovn update 2023-10-11 14:56:14 +00:00
pacemaker Pacemaker cleanup during scale_down task for Compute nodes 2023-08-08 17:15:55 +02:00
placement Enable CAP_AUDIT_WRITE for some containers/steps 2023-07-03 18:13:52 +09:00
podman Merge "Comment infra_image in container.conf" into stable/wallaby 2023-10-23 14:00:53 +00:00
qdr Role specific container support 2022-09-22 22:35:55 +09:00
rabbitmq [ffwd3] Fix node_names variable for cell stacks 2023-06-18 00:23:27 +02:00
rhsm Use server side env merging for ServiceNetMap/VipSubnetMap 2022-06-15 14:52:48 +05:30
securetty Use server side env merging for ServiceNetMap/VipSubnetMap 2022-06-15 14:52:48 +05:30
snmp Use server side env merging for ServiceNetMap/VipSubnetMap 2022-06-15 14:52:48 +05:30
sshd Deprecate SshServerOptionsOverrides 2022-06-15 14:54:03 +05:30
swift Role specific container support 2022-09-22 22:35:55 +09:00
tests Use server side env merging for ServiceNetMap/VipSubnetMap 2022-06-15 14:52:48 +05:30
time Use server side env merging for ServiceNetMap/VipSubnetMap 2022-06-15 14:52:48 +05:30
timemaster Use server side env merging for ServiceNetMap/VipSubnetMap 2022-06-15 14:52:48 +05:30
timesync Use server side env merging for ServiceNetMap/VipSubnetMap 2022-06-15 14:52:48 +05:30
tls Use server side env merging for ServiceNetMap/VipSubnetMap 2022-06-15 14:52:48 +05:30
tripleo-firewall Use server side env merging for ServiceNetMap/VipSubnetMap 2022-06-15 14:52:48 +05:30
tripleo-packages [ffwd3] WA 2240185 fix schema v1 for podman 2023-10-05 12:58:14 +00:00
tuned Use server side env merging for ServiceNetMap/VipSubnetMap 2022-06-15 14:52:48 +05:30
unbound Fix trailing slashes in bind-mounts and linting 2022-11-18 09:48:06 +09:00
undercloud Ansible package changed name to ansible-core, update exclude value 2023-02-16 08:15:49 +00:00
validations Use server side env merging for ServiceNetMap/VipSubnetMap 2022-06-15 14:52:48 +05:30
vpp Use server side env merging for ServiceNetMap/VipSubnetMap 2022-06-15 14:52:48 +05:30
README.rst Use server side env merging for ServiceNetMap/VipSubnetMap 2022-06-15 14:52:48 +05:30
containers-common.yaml Use server side env merging for ServiceNetMap/VipSubnetMap 2022-06-15 14:52:48 +05:30

README.rst

TripleO Deployments

This directory contains files that represent individual service deployments, orchestration tools, and the configuration tools used to deploy them.

Directory Structure

Each logical grouping of services will have a directory. Example: 'timesync'. Within this directory related timesync services would exist to for example configure timesync services on baremetal or via containers.

Filenaming conventions

As a convention each deployments service filename will reflect both the deployment engine (baremetal, or containers) along with the config tool used to deploy that service.

The convention is <service-name>-<engine>-<config management tool>.

Examples:

deployment/aodh/aodh-api-container-puppet.yaml (containerized Aodh service configured with Puppet)

deployment/aodh/aodh-api-container-ansible.yaml (containerized Aodh service configured with Ansible)

deployment/timesync/chrony-baremetal-ansible.yaml (baremetal Chrony service configured with Ansible)

deployment/timesync/chrony-baremetal-puppet.yaml (baremetal Chrony service configured with Puppet)

Building Kolla Images

TripleO currently relies on Kolla(Dockerfile) containers. Kolla supports container customization and we are making use of this feature within TripleO to inject puppet (our configuration tool of choice) into the Kolla base images. A variety of other customizations are being made via the tripleo-common/container-images/tripleo_kolla_template_overrides.j2 file.

To build Kolla images for TripleO adjust your kolla config1 to build your centos base image with puppet using the example below:

$ cat template-overrides.j2 {% extends parent_template %} {% set base_centos_binary_packages_append = ['puppet'] %} {% set nova_scheduler_packages_append = ['openstack-tripleo-common'] %}

kolla-build --base centos --template-override template-overrides.j2

Containerized Deployment Template Structure

Each deployment template may define a set of output values control the underlying service deployment in a variety of ways. These output sections are specific to the TripleO deployment architecture. The following sections are available for containerized services.

  • config_settings: This section contains service specific hiera data can be used to generate config files for each service. This data is ultimately processed via the container_puppet_config module in tripleo-ansible which generates config files for each service according to the settings here.

  • kolla_config: Contains YAML that represents how to map config files into the kolla container. This config file is typically mapped into the container itself at the /var/lib/kolla/config_files/config.json location and drives how kolla's external config mechanisms work.

  • docker_config: Data that is passed to tripleo_container_manage role to configure a container, or step of containers at each step. See the available steps documented below which are implemented by TripleO's cluster deployment architecture. If you want the tasks executed only once for the bootstrap node per a role in the cluster, use the /usr/bin/bootstrap_host_exec wrapper.

  • puppet_config: This section is a nested set of key value pairs that drive the creation of config files using puppet. Required parameters include:

    • puppet_tags: Puppet resource tag names that are used to generate config files with puppet. Only the named config resources are used to generate a config file. Any service that specifies tags will have the default tags of 'file,concat,file_line,augeas,cron' appended to the setting. Example: keystone_config
    • config_volume: The name of the volume (directory) where config files will be generated for this service. Use this as the location to bind mount into the running Kolla container for configuration.
    • config_image: The name of the container image that will be used for generating configuration files. This is often the same container that the runtime service uses. Some services share a common set of config files which are generated in a common base container.
    • step_config: This setting controls the manifest that is used to create container config files via puppet. The puppet tags below are used along with this manifest to generate a config directory for this container.
  • container_puppet_tasks: This section provides data to drive the puppet containers tooling directly. The task is executed for the defined steps before the corresponding docker_config's step. Puppet always sees the step number overrided as the step #6. It might be useful for initialization of things. Note that the tasks are executed only once for the bootstrap node per a role in the cluster. Make sure the puppet manifest ensures the wanted "at most once" semantics. That may be achieved via the <service_name>_short_bootstrap_node_name hiera parameters automatically evaluated for each service.

  • global_config_settings: the hiera keys will be distributed to all roles

  • service_config_settings: Takes an extra key to wire in values that are defined for a service that need to be consumed by some other service. For example: service_config_settings: haproxy: foo: bar This will set the hiera key 'foo' on all roles where haproxy is included.

Deployment steps

Similar to baremetal containers are brought up in a stepwise manner. The current architecture supports bringing up baremetal services alongside of containers. For each step the baremetal puppet manifests are executed first and then any containers are brought up afterwards.

Steps correlate to the following:

Pre) Containers config files generated per hiera settings. 1) Load Balancer configuration baremetal a) step 1 baremetal b) step 1 containers 2) Core Services (Database/Rabbit/NTP/etc.) a) step 2 baremetal b) step 2 containers 3) Early Openstack Service setup (Ringbuilder, etc.) a) step 3 baremetal b) step 3 containers 4) General OpenStack Services a) step 4 baremetal b) step 4 containers c) Keystone containers post initialization (tenant,service,endpoint creation) 5) Service activation (Pacemaker), online data migration a) step 5 baremetal b) step 5 containers

Update steps:

All services have an associated update_tasks output that is an ansible snippet that will be run during update in an rolling update that is expected to run in a rolling update fashion (one node at a time)

For Controller (where pacemaker is running) we have the following states:
  1. Step=1: stop the cluster on the updated node;
  2. Step=2: Pull the latest image and retag the it pcmklatest
  3. Step=3: yum upgrade happens on the host.
  4. Step=4: Restart the cluster on the node
  5. Step=5: Verification: Currently we test that the pacemaker services are running.

Then the usual deploy steps are run which pull in the latest image for all containerized services and the updated configuration if any.

Note: as pacemaker is not containerized, the points 1 and 4 happen in deployment/pacemaker/pacemaker-baremetal-puppet.yaml.

Input Parameters

Each service may define its own input parameters and defaults. Operators will use the parameter_defaults section of any Heat environment to set per service parameters.

Apart from sevice specific inputs, there are few default parameters for all the services. Following are the list of default parameters:

  • ServiceData: Mapping of service specific data. It is used to encapsulate all the service specific data. As of now, it contains net_cidr_map, which contains the CIDR map for all the networks. Additional data will be added as and when required.

  • ServiceNetMap: Mapping of service_name -> network name. Default mappings for service to network names are defined in ./overcloud-resource-registry-puppet.j2.yaml, which may be overridden via ServiceNetMap values added to a user environment file via parameter_defaults. Use parameter_merge_strategies to merge it with the defaults.

  • EndpointMap: Mapping of service endpoint -> protocol. Contains a mapping of endpoint data generated for all services, based on the data included in ../network/endpoints/endpoint_data.yaml.

  • RoleName: Name of the role on which this service is deployed. A service can be deployed in multiple roles. This is an internal parameter (should not be set via environment file), which is fetched from the name attribute of the roles_data.yaml template.

  • RoleParameters: Parameter specific to a role on which the service is applied. Using the format "<RoleName>Parameters" in the parameter_defaults of user environment file, parameters can be provided for a specific role. For example, in order to provide a parameter specific to "Compute" role, below is the format:

    parameter_defaults:
      ComputeParameters:
        Param1: value

Update Steps

Each service template may optionally define a update_tasks key, which is a list of ansible tasks to be performed during the minor update process. These are executed in a rolling manner node-by-node.

We allow a series of steps for the per-service update sequence via conditionals referencing a step variable e.g when: step|int == 2.

Pre-upgrade Rolling Steps

Each service template may optionally define a pre_upgrade_rolling_tasks key, which is a list of ansible tasks to be performed before the main upgrade phase, and these tasks are executed in a node-by-node rolling manner on the overcloud, similarly as update_tasks.

Upgrade Steps

Each service template may optionally define a upgrade_tasks key, which is a list of ansible tasks to be performed during the upgrade process.

Similar to the update_tasks, we allow a series of steps for the per-service upgrade sequence, defined as ansible tasks with a "when: step|int == 1" for the first step, "== 2" for the second, etc.

Steps correlate to the following:

  1. Perform any pre-upgrade validations.
  2. Stop the control-plane services, e.g disable LoadBalancer, stop pacemaker cluster and stop any managed resources. The exact order is controlled by the cluster constraints.
  3. Perform a package update and install new packages: A general upgrade is done, and only new package should go into service ansible tasks.
  4. Start services needed for migration tasks (e.g DB)
  5. Perform any migration tasks, e.g DB sync commands

Note that the services are not started in the upgrade tasks - we instead re-run puppet which does any reconfiguration required for the new version, then starts the services.

When running an OS upgrade via the tags system_upgrade_prepare and system_upgrade_run, or the combined tag system_upgrade, the steps corellate to the following:

  1. Any pre-service-stop actions. (system_upgrade_prepare)
  2. Stop all services. (system_upgrade_prepare)
  3. Post-service-stop actions like removing packages before the upgrade. (system_upgrade_prepare)
  4. Step reserved for the tripleo-packages service. Only package download for upgrade (under system_upgrade_prepare tag), and reboot for performing the offline upgrade (under system_upgrade_run tag) happens here.
  5. Any post-upgrade tasks (system_upgrade_run).

Nova Server Metadata Settings

One can use the hook of type OS::TripleO::ServiceServerMetadataHook to pass entries to the nova instances' metadata. It is, however, disabled by default. In order to overwrite it one needs to define it in the resource registry. An implementation of this hook needs to conform to the following:

  • It needs to define an input called RoleData of json type. This gets as input the contents of the role_data for each role's ServiceChain.
  • This needs to define an output called metadata which will be given to the Nova Server resource as the instance's metadata.

Keystone resources management

Keystone resources, such as users, roles, domains, endpoints, services, role assignments, are now managed by tripleo-keystone-resources Ansible role.


  1. See the override file which can be used to build Kolla packages that work with TripleO.↩︎