Go to file

Rajat Dhasmana 850c99cf8b [func test] Fix race between attachment delete and server delete

Recently openstacksdk functional test, test_volume_attachment,
started failing frequently.
It mostly failed during the tearDown step trying to delete
the volume as the volume delete was already issued by
server delete (which it shouldn't be).

Looking into the issue, I found out the problem to be in
a race between the BDM record of instance being deleted (during
volume attachment delete) and trying to delete the server.
The sequence of operations that trigger this issue are:

1. Delete volume attachment
2. Wait for volume to become available
3. Delete server

In step (2), nova sends a request to Cinder to delete the
volume attachment[1], making the volume in available state[2], BUT
the operation is still ongoing on nova side to delete the BDM
record[3].
Hence we end up in a race, where nova is trying to delete the
BDM record and we issue a server delete (overlapping request), which
in turn consumes that BDM record and sends request to (which it shouldn't):

1. delete attachment (which is already deleted, hence returns 404)
2. delete volume

Later when the functional test issue another request to delete the volume,
we fail since the volume is already in the process of being deleted
(by the server delete operation -- delete_on_termination is set to true).

This analysis can yield a number of fixes in nova and cinder, namely:

1. Nova to prevent the race of BDM being deleted and being used at the same time.
2. Cinder to detect the volume being deleted and return success for
subsequent delete requests (and not fail with 400 BadRequest).

This patch focuses on fixing this on the SDK side where the flow
of operations happens too fast triggering this race condition.

We introduce a wait mechanism to wait for the VolumeAttachment resource
to be deleted and later verify that the number of attachments for the
server to be 0 before moving to the tearDown that deletes the server
and the volume.

there is a 1 second gap race happening which can be seen here:

1. server delete starting at 17:13:49

2024-06-05 17:13:49,892 openstack.iterate_timeout        ****Timeout is 300 --- wait is 2.0 --- start time is 1717607629.892198 ----
2024-06-05 17:13:49,892 openstack.iterate_timeout        $$$$ Count is 1 --- time difference is 299.99977254867554
2024-06-05 17:13:50,133 openstack.iterate_timeout        Waiting 2.0 seconds

2. BDM being deleted at 17:13:50
(already used by server delete to do attachment and volume delete calls)

*************************** 2. row ***************************
            created_at: 2024-06-05 17:13:11
            ...
            deleted_at: 2024-06-05 17:13:50
            ...
            device_name: /dev/vdb
            volume_id: c13a3070-c5ab-4c8a-bb7e-5c7527fdf0df
            attachment_id: a1280ca9-4f88-49f7-9ba2-1e796688ebcc
            instance_uuid: 98bc13b2-50fe-4681-b263-80abf08929ac
            ...

[1] 7dc4b1ea62/nova/virt/block_device.py (L553)
[2] 9f1292ad06/cinder/volume/api.py (L2685)
[3] 7dc4b1ea62/nova/compute/manager.py (L7658-L7659)

Closes-Bug: #2067869
Change-Id: Ia59df9640d778bec4b22e608d111f82b759ac610

2024-06-06 13:35:51 +05:30

devstack

Finish updating links to point to opendev

2019-04-21 12:31:44 +00:00

doc

Add support for federation service providers

2024-05-03 19:25:39 +01:00

examples

trivial: Prepare for pyupgrade pre-commit hook

2024-04-26 17:38:54 +01:00

extras

Add ansible stable-2.9 job and run 2.8 and 2.9

2020-03-24 08:43:10 -05:00

openstack

[func test] Fix race between attachment delete and server delete

2024-06-06 13:35:51 +05:30

playbooks

Prepare acceptance tests for real clouds

2023-03-31 09:28:17 +02:00

releasenotes

Merge "Allow project switching for Designate API"

2024-05-28 18:57:12 +00:00

roles/deploy-clouds-config

Prepare acceptance tests for real clouds

2023-03-31 09:28:17 +02:00

tools

trivial: Prepare for pyupgrade pre-commit hook

2024-04-26 17:38:54 +01:00

zuul.d

Remove retired project Senlin job

2024-05-17 12:00:28 -07:00

.coveragerc

Fix coverage configuration and execution

2016-03-14 07:34:53 +00:00

.git-blame-ignore-revs

Ignore black version bump

2023-05-05 12:49:31 +01:00

.gitignore

Merge tox, tests and other support files

2017-10-04 14:51:08 -05:00

.gitreview

OpenDev Migration Patch

2019-04-19 19:47:46 +00:00

.mailmap

Merge tox, tests and other support files

2017-10-04 14:51:08 -05:00

.pre-commit-config.yaml

pre-commit: Add pyupgrade hook

2024-04-26 17:38:56 +01:00

.stestr.conf

Merge shade and os-client-config into the tree

2017-11-15 09:03:23 -06:00

babel.cfg

setting up the initial layout; move the api proposals to api_strawman

2014-01-24 22:58:25 -06:00

bindep.txt

Remove python-dev from bindep

2022-11-07 11:02:00 +01:00

CONTRIBUTING.rst

Switch back to LaunchPad for issue tracking

2023-10-31 11:41:55 +01:00

docs-requirements.txt

Add requirements.txt file for readthedocs

2015-05-21 08:16:44 -07:00

HACKING.rst

Fix some typos

2019-03-09 17:25:16 +01:00

include-acceptance-regular-user.txt

Whitelist cloud functional tests in acceptance

2022-12-02 14:00:32 +01:00

LICENSE

setting up the initial layout; move the api proposals to api_strawman

2014-01-24 22:58:25 -06:00

MANIFEST.in

setting up the initial layout; move the api proposals to api_strawman

2014-01-24 22:58:25 -06:00

post_test_hook.sh

Update load_balancer for v2 API

2017-07-18 18:05:29 -07:00

README.rst

README: Add guide on raw HTTP layer

2024-04-24 11:11:56 +01:00

requirements.txt

Remove importlib-metadata from requirements

2023-12-01 10:12:09 +09:00

setup.cfg

mypy: Address issues with remaining service modules

2024-01-09 10:54:07 +00:00

setup.py

Blackify everything else

2023-05-05 11:31:36 +01:00

SHADE-MERGE-TODO.rst

Use discovery instead of config to create proxies

2018-10-06 07:44:29 -05:00

test-requirements.txt

requirements: Sort alphabetically

2023-11-06 10:47:47 +00:00

tox.ini

Merge "tox: Don't install package in pep8"

2024-04-29 10:24:22 +00:00

README.rst

openstacksdk

openstacksdk is a client library for building applications to work with OpenStack clouds. The project aims to provide a consistent and complete set of interactions with OpenStack's many services, along with complete documentation, examples, and tools.

It also contains an abstraction interface layer. Clouds can do many things, but there are probably only about 10 of them that most people care about with any regularity. If you want to do complicated things, the per-service oriented portions of the SDK are for you. However, if what you want is to be able to write an application that talks to any OpenStack cloud regardless of configuration, then the Cloud Abstraction layer is for you.

More information about the history of openstacksdk can be found at https://docs.openstack.org/openstacksdk/latest/contributor/history.html

Getting started

Authentication and connection management

openstacksdk aims to talk to any OpenStack cloud. To do this, it requires a configuration file. openstacksdk favours clouds.yaml files, but can also use environment variables. The clouds.yaml file should be provided by your cloud provider or deployment tooling. An example:

clouds:
  mordred:
    region_name: Dallas
    auth:
      username: 'mordred'
      password: XXXXXXX
      project_name: 'demo'
      auth_url: 'https://identity.example.com'

openstacksdk will look for clouds.yaml files in the following locations:

If set, the path indicated by the OS_CLIENT_CONFIG_FILE environment variable
. (the current directory)
$HOME/.config/openstack
/etc/openstack

You can create a connection using the openstack.connect function. The cloud name can be either passed directly to this function or specified using the OS_CLOUD environment variable. If you don't have a clouds.yaml file and instead use environment variables for configuration then you can use the special envvars cloud name to load configuration from the environment. For example:

import openstack

# Initialize connection from a clouds.yaml by passing a cloud name
conn_from_cloud_name = openstack.connect(cloud='mordred')

# Initialize connection from a clouds.yaml using the OS_CLOUD envvar
conn_from_os_cloud = openstack.connect()

# Initialize connection from environment variables
conn_from_env_vars = openstack.connect(cloud='envvars')

Note

How this is all achieved is described in more detail below.

The cloud layer

openstacksdk consists of four layers which all build on top of each other. The highest level layer is the cloud layer. Cloud layer methods are available via the top level Connection object returned by openstack.connect. For example:

import openstack

# Initialize and turn on debug logging
openstack.enable_logging(debug=True)

# Initialize connection
conn = openstack.connect(cloud='mordred')

# List the servers
for server in conn.list_servers():
    print(server.to_dict())

The cloud layer is based on logical operations that can potentially touch multiple services. The benefit of this layer is mostly seen in more complicated operations that take multiple steps and where the steps vary across providers. For example:

import openstack

# Initialize and turn on debug logging
openstack.enable_logging(debug=True)

# Initialize connection
conn = openstack.connect(cloud='mordred')

# Upload an image to the cloud
image = conn.create_image(
    'ubuntu-trusty', filename='ubuntu-trusty.qcow2', wait=True)

# Find a flavor with at least 512M of RAM
flavor = conn.get_flavor_by_ram(512)

# Boot a server, wait for it to boot, and then do whatever is needed
# to get a public IP address for it.
conn.create_server(
    'my-server', image=image, flavor=flavor, wait=True, auto_ip=True)

The proxy layer

The next layer is the proxy layer. Most users will make use of this layer. The proxy layer is service-specific, so methods will be available under service-specific connection attributes of the Connection object such as compute, block_storage, image etc. For example:

import openstack

# Initialize and turn on debug logging
openstack.enable_logging(debug=True)

# Initialize connection
conn = openstack.connect(cloud='mordred')

# List the servers
for server in conn.compute.servers():
    print(server.to_dict())

Note

A list of supported services is given below.

The resource layer

Below this there is the resource layer. This provides support for the basic CRUD operations supported by REST APIs and is the base building block for the other layers. You typically will not need to use this directly but it can be helpful for operations where you already have a Resource object to hand. For example:

import openstack
import openstack.config.loader
import openstack.compute.v2.server

# Initialize and turn on debug logging
openstack.enable_logging(debug=True)

# Initialize connection
conn = openstack.connect(cloud='mordred')

# List the servers
for server in openstack.compute.v2.server.Server.list(session=conn.compute):
    print(server.to_dict())

The raw HTTP layer

Finally, there is the raw HTTP layer. This exposes raw HTTP semantics and is effectively a wrapper around the requests API with added smarts to handle stuff like authentication and version management. As such, you can use the requests API methods you know and love, like get, post and put, and expect to receive a requests.Response object in response (unlike the other layers, which mostly all return objects that subclass openstack.resource.Resource). Like the resource layer, you will typically not need to use this directly but it can be helpful to interact with APIs that have not or will not be supported by openstacksdk. For example:

import openstack

# Initialize and turn on debug logging
openstack.enable_logging(debug=True)

# Initialize connection
conn = openstack.connect(cloud='mordred')

# List servers
for server in openstack.compute.get('/servers').json():
    print(server)

Configuration

openstacksdk uses the openstack.config module to parse configuration. openstack.config will find cloud configuration for as few as one cloud and as many as you want to put in a config file. It will read environment variables and config files, and it also contains some vendor specific default values so that you don't have to know extra info to use OpenStack

If you have a config file, you will get the clouds listed in it
If you have environment variables, you will get a cloud named envvars
If you have neither, you will get a cloud named defaults with base defaults

You can view the configuration identified by openstacksdk in your current environment by running openstack.config.loader. For example:

$ python -m openstack.config.loader

More information at https://docs.openstack.org/openstacksdk/latest/user/config/configuration.html

Supported services

The following services are currently supported. A full list of all available OpenStack service can be found in the Project Navigator.

Note

Support here does not guarantee full-support for all APIs. It simply means some aspect of the project is supported.

Supported services
Service	Description	Cloud Layer	Proxy & Resource Layer
Compute
Nova	Compute	✔	✔ (`openstack.compute`)
Hardware Lifecycle
Ironic	Bare metal provisioning	✔	✔ (`openstack.baremetal`, `openstack.baremetal_introspection`)
Cyborg	Lifecycle management of accelerators	✔	✔ (`openstack.accelerator`)
Storage
Cinder	Block storage	✔	✔ (`openstack.block_storage`)
Swift	Object store	✔	✔ (`openstack.object_store`)
Cinder	Shared filesystems	✔	✔ (`openstack.shared_file_system`)
Networking
Neutron	Networking	✔	✔ (`openstack.network`)
Octavia	Load balancing	✔	✔ (`openstack.load_balancer`)
Designate	DNS	✔	✔ (`openstack.dns`)
Shared services
Keystone	Identity	✔	✔ (`openstack.identity`)
Placement	Placement	✔	✔ (`openstack.placement`)
Glance	Image storage	✔	✔ (`openstack.image`)
Barbican	Key management	✔	✔ (`openstack.key_manager`)
Workload provisioning
Magnum	Container orchestration engine provisioning	✔	✔ (`openstack.container_infrastructure_management`)
Orchestration
Heat	Orchestration	✔	✔ (`openstack.orchestration`)
Senlin	Clustering	✔	✔ (`openstack.clustering`)
Mistral	Workflow	✔	✔ (`openstack.workflow`)
Zaqar	Messaging	✔	✔ (`openstack.message`)
Application lifecycle
Masakari	Instances high availability service	✔	✔ (`openstack.instance_ha`)

README.rst

openstacksdk

Getting started

Configuration

Supported services

Links