Go to file

Aurelien Lourot 654d4bfc4c Sync helpers for 20.05

Change-Id: Ibc49348b236c1b7dd4dc4ab2408940180b60cff0

2020-05-18 14:49:55 +02:00

actions

Mysqldump Action

2020-02-28 22:57:19 +00:00

charmhelpers

Sync helpers for 20.05

2020-05-18 14:49:55 +02:00

files

Sync charm/ceph helpers, tox, and requirements

2019-09-30 22:10:15 -05:00

hooks

Mysqldump Action

2020-02-28 22:57:19 +00:00

keys

Initial charm

2013-09-03 17:52:02 +01:00

lib

Update tox.ini files from release-tools gold copy

2016-09-09 19:43:36 +00:00

ocf/percona

mysql_monitor: Apply patch available in upstream PR #52

2015-04-07 12:51:43 -03:00

templates

Mysqldump Action

2020-02-28 22:57:19 +00:00

tests

Remove disco support from the charm

2020-02-18 14:50:08 +00:00

unit_tests

Notify clients of series upgrade

2020-01-29 12:56:23 +00:00

.coveragerc

Tweak coverage settings

2015-04-20 11:55:40 +01:00

.gitignore

Redesign cluster buildup process

2018-03-29 09:24:05 -07:00

.gitreview

OpenDev Migration Patch

2019-04-19 19:36:23 +00:00

.stestr.conf

Replace ostestr with stestr in testing framework.

2019-03-07 17:12:58 -05:00

.zuul.yaml

Switch to Ussuri jobs

2019-12-10 09:24:30 +08:00

actions.yaml

Mysqldump Action

2020-02-28 22:57:19 +00:00

charm-helpers-hooks.yaml

Sync charm/ceph helpers, tox, and requirements

2019-09-30 22:10:15 -05:00

config.yaml

wsrep_slave_threads: default to 48 on bionic

2019-04-09 15:55:19 +08:00

[freyes,r=james-page] Ensure VIP is tied to a good mysqld instance.

2015-04-20 11:53:43 +01:00

hardening.yaml

Add hardening support

2016-03-24 18:40:04 +00:00

Makefile

Sync helpers for 20.05

2020-05-18 14:49:55 +02:00

metadata.yaml

Remove disco support from the charm

2020-02-18 14:50:08 +00:00

README.md

Format based on README template

2020-05-14 15:41:08 -04:00

requirements.txt

Sync charm/ceph helpers, tox, and requirements

2019-09-30 22:10:15 -05:00

revision

Rationalize configuration for percona/galera, add generic helpers for parsing mysql configuration options, use mysqlhelper for creation of SST user

2013-09-23 09:37:07 +01:00

setup.cfg

Add Python 3 Train unit tests

2019-07-30 10:19:37 -04:00

test-requirements.txt

Sync charm/ceph helpers, tox, and requirements

2019-09-30 22:10:15 -05:00

tox.ini

Remove Amulet leftover and support Python 3.8

2020-03-05 12:36:27 +01:00

README.md

Overview

Percona XtraDB Cluster is a high availability and high scalability solution for MySQL clustering. Percona XtraDB Cluster integrates Percona Server with the Galera library of MySQL high availability solutions in a single product package which enables you to create a cost-effective MySQL cluster.

The percona-cluster charm deploys Percona XtraDB Cluster and provides DB services to those charms that support the 'mysql-shared' interface. The current list of such charms can be obtained from the Charm Store (the charms officially supported by the OpenStack Charms project are published by 'openstack-charmers').

Usage

Configuration

This section covers common configuration options. See file config.yaml for the full list of options, along with their descriptions and default values.

`max-connections`

The max-connections option set the maximum number of allowed connections. The default is 600. This is an important option and is discussed in the Memory section below.

`min-cluster-size`

The min-cluster-size option sets the number of percona-cluster units required to form its cluster. It is best practice to use this option as doing so ensures that the charm will wait until the cluster is up before accepting relations from other client applications.

Deployment

To deploy a single percona-cluster unit:

juju deploy percona-cluster

To make use of DB services, simply add a relation between percona-cluster and an application that supports the 'mysql-shared' interface. For instance:

juju add-relation percona-cluster:shared-db keystone:shared-db

Passwords required for the correct operation of the deployment are automatically generated and stored by the application leader. The root password for mysql can be retrieved using the following command:

juju run --unit percona-cluster/0 leader-get root-password

Root user DB access is only usable from within one of the deployed units (access to root is restricted to localhost only).

High availability

When more than one unit is deployed with the hacluster application the charm will bring up an HA active/active cluster. The min-cluster-size option should be used (see description above).

To deploy a three-node cluster:

juju deploy -n 3 --config min-cluster-size=3 percona-cluster

There are two mutually exclusive high availability options: using virtual IP(s) or DNS. In both cases the hacluster subordinate charm is used to provide the Corosync and Pacemaker backend HA functionality.

See the OpenStack high availability appendix in the OpenStack Charms Deployment Guide for details.

Actions

This section lists Juju actions supported by the charm. Actions allow specific operations to be performed on a per-unit basis. To display action descriptions run juju actions percona-cluster. If the charm is not deployed then see file actions.yaml.

backup
bootstrap-pxc
complete-cluster-series-upgrade
mysqldump
notify-bootstrapped
pause
resume
set-pxc-strict-mode

Memory

Percona Cluster is extremely memory sensitive. Setting memory values too low will give poor performance. Setting them too high will create problems that are very difficult to diagnose. Please take time to evaluate these settings for each deployment environment rather than copying and pasting bundle configurations.

The Percona Cluster charm needs to be able to be deployed in small low memory development environments as well as high performance production environments. The charm configuration opinionated defaults favour the developer environment in order to ease initial testing. Production environments need to consider carefully the memory requirements for the hardware or cloud in use. Consult a MySQL memory calculator to understand the implications of the values.

Between the 5.5 and 5.6 releases a significant default was changed. The performance schema defaulted to on for 5.6 and later. This allocates all the memory that would be required to handle max-connections plus several other memory settings. With 5.5 memory was allocated during run-time as needed.

The charm now makes performance schema configurable and defaults to off (False). With the performance schema turned off memory is allocated when needed during run-time. It is important to understand this can lead to run-time memory exhaustion if the configuration values are set too high. Consult a MySQL memory calculator to understand the implications of the values.

The value of max-connections should strike a balance between connection exhaustion and memory exhaustion. Occasionally connection exhaustion occurs in large production HA clouds with a value of less than 2000. The common practice became to set it unrealistically high (near 10k or 20k). In the move to 5.6 on Xenial this became a problem as Percona would fail to start up or behave erratically as memory exhaustion occurred on the host due to performance schema being turned on. Even with the default now turned off this value should be carefully considered against the production requirements and resources available.

MySQL asynchronous replication

This charm supports MySQL asynchronous replication feature which can be used to replicate databases between multiple Percona XtraDB Clusters. In order to setup master-slave replication of "example1" and "example2" databases between "pxc1" and "pxc2" applications, first configure mandatory options:

juju config pxc1 databases-to-replicate="database1:table1,table2;database2"
juju config pxc2 databases-to-replicate="database1:table1,table2;database2"
juju config pxc1 cluster-id=1
juju config pxc2 cluster-id=2

and then relate them:

juju add-relation pxc1:master pxc2:slave

In order to setup master-master replication, add another relation:

juju add-relation pxc2:master pxc1:slave

In the same way circular replication can be setup between multiple clusters.

Network Space support

This charm supports the use of Juju Network Spaces, allowing the charm to be bound to network space configurations managed directly by Juju. This is only supported with Juju 2.0 and above.

You can ensure that database connections and cluster peer communication are bound to specific network spaces by binding the appropriate interfaces:

juju deploy percona-cluster --bind "shared-db=internal-space cluster=internal-space"

Alternatively, configuration can be provided as part of a bundle:

percona-cluster:
  charm: cs:xenial/percona-cluster
  num_units: 1
  bindings:
    shared-db: internal-space
    cluster: internal-space

The 'cluster' endpoint binding is used to determine which network space units within the percona-cluster deployment should use for communication with each other; the 'shared-db' endpoint binding is used to determine which network space should be used for access to MySQL databases services from other charms.

Note

: Spaces must be configured in the underlying provider prior to attempting to use them.

Note

: Existing deployments using the access-network configuration option will continue to function; this option is preferred over any network space binding provided for the 'shared-db' relation if set.

Limitations

Note that Percona XtraDB Cluster is not a 'scale-out' MySQL solution; reads and writes are channelled through a single service unit and synchronously replicated to other nodes in the cluster; reads/writes are as slow as the slowest node you have in your deployment.

Series Upgrade

Procedure

Take a backup of all the databases

juju run-action mysql/N backup

Get that backup off the mysql/N unit and somewhere safe.

juju scp -- -r mysql/N:/opt/backups/mysql /path/to/local/backup/dir

Pause all non-leader units and corresponding hacluster units. The leader node will remain up for the time being. This is to ensure the leader has the latest sequence number and will be considered the most up to date by the cluster.

juju run-action hacluster/N pause juju run-action percona-cluster/N pause
Prepare the leader node

juju upgrade-series $MACHINE_NUMBER prepare $SERIES
Administratively perform the upgrade.

do-release-upgrade plus any further steps administratively required steps for an upgrade.

Reboot
Complete the series upgrade on the leader:

juju upgrade-series $MACHINE_NUMBER complete
Administratively validate the leader node database is up and running

Connect to the database and check for expected data
Review "SHOW GLOBAL STATUS;"

Upgrade the non-leader nodes one at a time following the same pattern summarized bellow:

juju upgrade-series $MACHINE_NUMBER prepare $SERIES
Administratively Upgrade
Reboot
juju upgrade-series $MACHINE_NUMBER complete
Validate

Finalize the upgrade Run action on leader node. This action informs each node of the cluster the upgrade process is complete cluster wide. This also updates mysql configuration with all peers in the cluster.

juju run-action mysql/N complete-cluster-series-upgrade
Set future instance to the new series and set the source origin

juju set-series percona-cluster xenial juju config mysql source=distro

Upstream documentation

Cold Boot

In the event of an unexpected power outage and cold boot, the cluster will be unable to reestablish itself without manual intervention.

The cluster will be in scenario 3 or 6 from the upstream Percona Cluster documentation Please read the upstream documentation as it provides context to the steps outlined here. In either scenario, it is necessary to choose a unit to become the bootstrap node.

Determine the node with the highest sequence number

This information can be found in the /var/lib/percona-xtradb-cluster/grastate.dat file. The charm will also display this information in the juju status.

Example juju status after a cold boot of percona-cluster

Unit                Workload  Agent  Machine  Public address  Ports     Message
keystone/0*         active    idle   0        10.5.0.32       5000/tcp  Unit is ready
percona-cluster/0   blocked   idle   1        10.5.0.20       3306/tcp  MySQL is down. Sequence Number: 355. Safe To Bootstrap: 0
  hacluster/0       active    idle            10.5.0.20                 Unit is ready and clustered
percona-cluster/1   blocked   idle   2        10.5.0.17       3306/tcp  MySQL is down. Sequence Number: 355. Safe To Bootstrap: 0
  hacluster/1       active    idle            10.5.0.17                 Unit is ready and clustered
percona-cluster/2*  blocked   idle   3        10.5.0.27       3306/tcp  MySQL is down. Sequence Number: 355. Safe To Bootstrap: 0
  hacluster/2*      active    idle            10.5.0.27                 Unit is ready and clustered

Note: An application leader is denoted by any asterisk in the Unit column.

In the above example all the sequence numbers match. This means we can bootstrap from any unit we choose.

In the next example the percona-cluster/2 node has the highest sequence number so we must choose that node to avoid data loss.

Unit                Workload  Agent  Machine  Public address  Ports     Message
keystone/0*         active    idle   0        10.5.0.32       5000/tcp  Unit is ready
percona-cluster/0*  blocked   idle   1        10.5.0.20       3306/tcp  MySQL is down. Sequence Number: 1318. Safe To Bootstrap: 0
  hacluster/0*      active    idle            10.5.0.20                 Unit is ready and clustered
percona-cluster/1   blocked   idle   2        10.5.0.17       3306/tcp  MySQL is down. Sequence Number: 1318. Safe To Bootstrap: 0
  hacluster/1       active    idle            10.5.0.17                 Unit is ready and clustered
percona-cluster/2   blocked   idle   3        10.5.0.27       3306/tcp  MySQL is down. Sequence Number: 1325. Safe To Bootstrap: 0
  hacluster/2       active    idle            10.5.0.27                 Unit is ready and clustered

Bootstrap the node with the highest sequence number

Run the bootstrap-pxc action on the node with the highest sequence number. In this example, it is unit percona-cluster/2, which happens to be a non-leader.

juju run-action --wait percona-cluster/2 bootstrap-pxc

Notify the cluster of the new bootstrap UUID

In the vast majority of cases, once the bootstrap-pxc action has been run and the model has settled the output to the juju status command will now look like this:

Unit                Workload  Agent  Machine  Public address  Ports     Message
keystone/0*         active    idle   0        10.5.0.32       5000/tcp  Unit is ready
percona-cluster/0*  waiting   idle   1        10.5.0.20       3306/tcp  Unit waiting for cluster bootstrap
  hacluster/0*      active    idle            10.5.0.20                 Unit is ready and clustered
percona-cluster/1   waiting   idle   2        10.5.0.17       3306/tcp  Unit waiting for cluster bootstrap
  hacluster/1       active    idle            10.5.0.17                 Unit is ready and clustered
percona-cluster/2   waiting   idle   3        10.5.0.27       3306/tcp  Unit waiting for cluster bootstrap
  hacluster/2       active    idle            10.5.0.27                 Unit is ready and clustered

If you observe the above output ("Unit waiting for cluster bootstrap") then the notify-bootstrapped action needs to be run on a unit. There are two possibilities:

If the bootstrap-pxc action was run on a leader then run notify-bootstrapped on a non-leader.
If the bootstrap-pxc action was run on a non-leader then run notify-bootstrapped on the leader.

In the current example, the first action was run on a non-leader so we'll run the second action on the leader, percona-cluster/0:

juju run-action percona-cluster/0 notify-bootstrapped --wait

After the model settles, the output should show all nodes in active and ready state:

Unit                Workload  Agent  Machine  Public address  Ports     Message
keystone/0*         active    idle   0        10.5.0.32       5000/tcp  Unit is ready
percona-cluster/0*  active    idle   1        10.5.0.20       3306/tcp  Unit is ready
  hacluster/0*      active    idle            10.5.0.20                 Unit is ready and clustered
percona-cluster/1   active    idle   2        10.5.0.17       3306/tcp  Unit is ready
  hacluster/1       active    idle            10.5.0.17                 Unit is ready and clustered
percona-cluster/2   active    idle   3        10.5.0.27       3306/tcp  Unit is ready
  hacluster/2       active    idle            10.5.0.27                 Unit is ready and clustered

The percona-cluster application is now back to a clustered and healthy state.

Bugs

Please report bugs on Launchpad.

For general charm questions refer to the OpenStack Charm Guide.