grenade/PLUGINS.rst
Dan Smith a3874c58d4 Indicate to plugin resource verification where we are
This makes the resource verify commands take a second argument that
indicates if we are pre-upgrade or post-upgrade. This may be needed
in order to validate things that will not be true before, but must
be true after an upgrade.

Change-Id: I6383c5f641bb8eb7fee6f1fbd41bca3a281d9636
2016-08-12 12:46:15 -07:00

243 lines
7.6 KiB
ReStructuredText

==============================
Modular Grenade Architecture
==============================
Grenade was originally created to demonstrate some level of upgrade
capacity for OpenStack projects. Originally this just included a small
number of services.
Proposed new basic flow:
- setup_grenade
- all the magic setup involved around err traps and filehandle redirects
- setup devstack trees
- setup_base
- run stack.sh to build the correct base environment
- verify_base
- for project in projects; do verify_project; done
- resources.sh create
- resources.sh verify pre-upgrade
- shutdown
- for project in projects; do shutdown; done
- snapshot.sh pre_upgrade (NOT YET IMPLEMENTED)
- resources.sh verify_noapi pre-upgrade
- upgrade ...
- resources.sh verify post-upgrade
- verify_target
- resources.sh destroy
Modular Components
==================
Assuming the following tree in target projects::
devstack/ - devstack plugin directory
upgrade/ - upgrade scripts
settings - adds settings for the upgrade path
upgrade.sh
snapshot.sh - snapshots the state of the service, typically a
database dump (NOT YET IMPLEMENTED)
from-juno/ - per release
within-juno/
from-kilo/
within-kilo/
resources.sh
This same modular structure exists in the grenade tree with::
grenade/
projects/
10_ceilometer/
settings
upgrade.sh
resources.sh
=================
resources.sh is a per-service resource create / verify / destroy
interface. What a service does inside a script is up to them.
You can assume your resource script will only be called if your
service is running in an upgrade environment. The script should return
zero on success for actions, and nonzero on failure.
Calling Interface
-----------------
The following is the supported calling interface
- resources.sh early_create
creates a set of sample resources that should survive very early in
the upgrade process. This should only be used for horizontal
resources that impact other services, that *have* to be available
before they do any of their setup. For instance setup of neutron
networks.
Do not use the phase unless you really know why ``create`` will not
work for you.
- resources.sh create
creates a set of sample resources that should survive
upgrade. Script should exit with a nonzero exit code if any
resources could not be created.
Example: create an instance in nova or a volume in cinder
- resources.sh verify (pre-upgrade|post-upgrade)
verify that the resources were created. Services are running at this
point, and the APIs may be expected to work. The second argument
indicates whether we are pre-upgrade or post-upgrade.
Example: use the nova command to verify that the test instance is
still ACTIVE, or the cinder command to verify that the volume is
still available.
- resources.sh verify_noapi
verify that the resources are still present. This is called in the
phase where services are stopped, and APIs are expected to not be
accessible. Resource verification at this phase my require probing
underlying components to make sure nothing has gone awry during
service shutdown. The second argument indicates whether we are
pre-upgrade or post-upgrade.
Example: check with libvirt to make sure the instance is actually
created and running. Bonus points for being able to ping the
instance, or otherwise check its live-ness. With cinder, checking
that the LVM volume exists and looks reasonable.
- resources.sh destroy
Resource scripts should be responsible and cleanup all their
resources when asked to destroy.
Calling Sequence
----------------
The calling sequence during a grenade run looks as follows:
- # start old side
- create (create will be called during the working old side)
- verify pre-upgrade
- # shutdown all services
- verify_noapi pre-upgrade
- # upgrade and start all services
- verify post-upgrade
- destroy
The important thing to remember is verify/verify_noapi will be called
multiple times, with multiple different versions of OpenStack. Those
phases of the script must not be rerunnable multiple times.
While create / destroy are only going to be called once in the current
interface, bonus points for also making those idempotent for
resiliancy in testing.
Supporting Methods
------------------
In order to assist with the checks listed the following functions
exist::
resource_save project key value
resource_get project key
This allow resource scripts to have memory, and keep track of things
like the allocated IP addresses, IDs, and other non deterministic data
that is returned from OpenStack API calls.
Environment
-----------
Resource scripts get called in a specific environment already set:
- TOP_DIR - will be set to the root of the devstack directory for the
BASE version of devstack incase this is needed to find files like a
working ``openrc``
- GRENADE_DIR - the root directory of the grenade directory.
The following snippet will give you access to both the grenade and
TARGET devstack functions::
source $GRENADE_DIR/grenaderc
source $GRENADE_DIR/functions
Best Practices
--------------
Do as many actions as non admin as possible. As early as you can in
your resource script it's worth allocating a user/project for the
script to run as. This ensures isolation against other scripts, and
ensures that actions don't only work because admin gets to bypass
safeties.
Test side effects, not just API actions. The point of these resource
survival scripts is to test that things created beyond the API / DB
interaction still work later. Just testing that data can be stored /
retrieved from the database isn't very interesting, and should be
covered other places. The value in the resource scripts is these side
effects. Actual VMs running, actual iscsi targets running, etc. And
ensuring these things are not disrupted when the control plane is
shifted out from under them.
Out of Tree Plugins
===================
A grenade plugin can be hosted out of tree in a project tree, similar
to external devstack plugins. There are a few subtle differences when
this happens.
The plugin structure will live under ``$project/devstack/upgrade/``
directory.
The plugin is enabled by adding::
enable_grenade_plugin <$project> <giturl> [branch]
To ``pluginrc`` in the ``GRENADE_DIR``. An additional rc file was
required due to sequencing of when plugin functions become available.
Note: For using this with ``openstack-infra/devstack-gate`` you would set this
by setting the GRENADE_PLUGINRC environment variable. For example to set it for
Ironic::
export GRENADE_PLUGINRC="enable_grenade_plugin ironic https://git.openstack.org/openstack/ironic"
Changing Devstack Localrc
-------------------------
There is also a mechanism that allows a ``settings`` file change the
devstack localrc files with the ``devstack_localrc`` function.
::
devstack_localrc <base|target> arbitrary stuff to add
Which will take all the rest of the stuff on that line and add it to
the localrc for either the base or target devstack.
Example settings
----------------
The following is a reasonable example ``settings`` for out of tree
plugin::
register_project_for_upgrade heat
register_db_to_save heat
devstack_localrc base enable_service h-api h-api-cfn h-api-cw h-eng heat
devstack_localrc target enable_service h-api h-api-cfn h-api-cw h-eng heat
This registers the project for upgrade, symbolicly enables the heat
database for dump during upgrade, and adds the heat services into the
service list for base and target.
It's expected that most ``settings`` files for out of tree plugins
will need equivalent lines.