Monasca Database Persister

Go to file

Craig Bryant a7112fd30b Increase Persister Performance The main improvement comes from using the Influxdb Line Protocol. The encoding methods in line_utils.py are like the ones used in the influxdb client but optimized for our data Additional improvement comes from avoiding calls to encode('utf8') as the influxdb client already does that. On my test system, these changes increased the number of measurements processed from about 2200/second to about 3700/second. Measurement processing time is now dominated by Kafka. Approximately, 35% of time is spent reading from Kafka and approximately 22% of time is committing offsets. Only 10% of the time is spent writing to Influxdb. About 30% of the time is spent converting messages from the json string read from Kafka into the Line Protocol format for Influxdb. Once monasca-common is modified to use the faster kafka library, performance should be even better. I did try using ujson, but my tests showed it wasn't any faster than the json package. Change-Id: I2acf76d9a5f583c74a272e18350b9c0ad5883f95		2017-06-22 14:00:41 -06:00
common	Ensure the same branch is used for common build	2016-02-10 15:30:38 -07:00
etc/monasca	Granular logging control	2017-01-09 05:35:51 +00:00
java	Change Java version to 1.2.0	2016-12-15 16:59:28 +00:00
monasca_persister	Increase Persister Performance	2017-06-22 14:00:41 -06:00
tools	Sync tools/tox_install.sh	2016-08-30 20:17:06 +02:00
.gitignore	Add persister.py unit tests	2017-01-12 09:18:29 +00:00
.gitreview	Update .gitreview for new namespace	2015-10-17 22:30:59 +00:00
.testr.conf	Add persister.py unit tests	2017-01-12 09:18:29 +00:00
LICENSE	Added copyright header and LICENSE file.	2014-05-01 12:45:08 -06:00
pom.xml	Change Java version to 1.2.0	2016-12-15 16:59:28 +00:00
README.md	Show team and repo badges on README	2016-11-25 18:22:59 +01:00
requirements.txt	Updated from global requirements	2017-01-24 20:37:00 +00:00
run_maven.sh	Ensure the same branch is used for common build	2016-02-10 15:30:38 -07:00
setup.cfg	Add persister.py unit tests	2017-01-12 09:18:29 +00:00
setup.py	Updated from global requirements	2016-08-31 18:25:52 +00:00
test-requirements.txt	Updated from global requirements	2017-01-12 15:06:23 +00:00
tox.ini	Add persister.py unit tests	2017-01-12 09:18:29 +00:00

README.md

Team and repository tags

monasca-persister

The Monitoring Persister consumes metrics and alarm state transitions from the Message Queue and stores them in the Metrics and Alarms database.

Although the Persister isn't primarily a Web service it uses DropWizard, https://dropwizard.github.io/dropwizard/, which provides a nice Web application framework to expose an http endpoint that provides an interface through which metrics about the Persister can be queried as well as health status.

The basic design of the Persister is to have one Kafka consumer publish to a Disruptor, https://github.com/LMAX-Exchange/disruptor, that has output processors. The output processors use prepared batch statements to write to the Metrics and Alarms database.

The number of output processors/threads in the Persister can be specified to scale to more messages. To horizontally scale and provide fault-tolerance any number of Persisters can be started as consumers from the Message Queue.

Build

Requires monasca-common from https://github.com/openstack/monasca-common. Download and build following instructions in its README.md. Then build monasca-persister by:

mvn clean package

Configuration

A sample configuration file is available in java/src/deb/etc/persister-config.yml-sample.

A second configuration file is provided in java/src/main/resources/persister-config.yml for use with the vagrant "mini-mon" development environment.

TODO

Purge metrics on shutdown
Add more robust offset management in Kafka. Currently, the offset is advanced as each message is read. If the Persister stops after the metric has been read and prior to it being committed to the Metrics and Alarms database, the metric will be lost.
Add better handling of SQL exceptions.
Complete health check.
Specify and document the names of the metrics that are available for monitoring of the Persister.
Document the yaml configuration parameters.

License

Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.