DEPRECATED, Monasca Transform and Aggregation Engine
Go to file
Ashwin Agate 3feaf7400e Refresh monasca transform docs
* Refreshed monasca transform README.md

* Added new documentation on generic aggregation components

* Added new documentation on creating a new aggregation
  pipeline

* Added new documentation for internal data formats
  used by monasca transform namely record store data format
  and instance usage data format.

Change-Id: Id4cfb422ace2c59103c658e58e4ffb0d61a303f5
Story: 2001765
Task: 12167
2018-04-16 08:22:17 -07:00
devstack Use generic user for both zuul v2 and v3 2017-10-18 09:26:47 +07:00
doc/source Remove vestigate HUDSON_PUBLISH_DOCS reference 2017-09-02 08:50:25 -05:00
docs Refresh monasca transform docs 2018-04-16 08:22:17 -07:00
etc Set region in metric meta from config file 2018-03-27 16:47:46 -07:00
monasca_transform Set region in metric meta from config file 2018-03-27 16:47:46 -07:00
scripts Enhanced refresh monasca transform script 2017-04-19 17:16:50 -07:00
tests Switch to using Spark version 2.2.0 2017-08-21 11:18:22 -07:00
tools/vagrant Update refresh monasca transfom script to systemd 2017-06-01 15:02:03 -07:00
.gitignore Delete doc/build/html/.root-marker 2017-03-22 18:09:33 +01:00
.gitreview Added .gitreview 2016-04-08 07:58:56 +00:00
.testr.conf Corrected test environment 2016-11-22 10:24:07 +00:00
.zuul.yaml add lower-constraints job 2018-03-22 15:50:40 -04:00
LICENSE monasca-transform initial commit 2016-05-26 00:10:37 +00:00
README.md Refresh monasca transform docs 2018-04-16 08:22:17 -07:00
lower-constraints.txt add lower-constraints job 2018-03-22 15:50:40 -04:00
requirements.txt Updated from global requirements 2017-12-15 21:40:24 +00:00
setup.cfg Changes to setup cfg 2017-05-25 13:53:57 -07:00
setup.py Updated from global requirements 2017-03-17 15:59:01 -07:00
test-requirements.txt Updated from global requirements 2017-09-16 23:14:59 +00:00
tox.ini add lower-constraints job 2018-03-22 15:50:40 -04:00

README.md

Team and repository tags

Team and repository tags

Monasca Transform

monasca-transform is a data driven aggregation engine which collects, groups and aggregates existing individual Monasca metrics according to business requirements and publishes new transformed (derived) metrics to the Monasca Kafka queue.

  • Since the new transformed metrics are published as any other metric in Monasca, alarms can be set and triggered on the transformed metric.

  • Monasca Transform uses Apache Spark to aggregate data. Apache Spark is a highly scalable, fast, in-memory, fault tolerant and parallel data processing framework. All monasca-transform components are implemented in Python and use Spark's PySpark Python API to interact with Spark.

  • Monasca Transform does transformation and aggregation of incoming metrics in two phases.

    • In the first phase spark streaming application is set to retrieve in data from kafka at a configurable stream interval (default stream_inteval is 10 minutes) and write the data aggregated for stream interval to pre_hourly_metrics topic in kafka.

    • In the second phase, which is kicked off every hour, all metrics in metrics_pre_hourly topic in Kafka are aggregated again, this time over a larger interval of an hour. These hourly aggregated metrics published to metrics topic in kafka.

Use Cases handled by Monasca Transform

Please refer to Problem Description section on the Monasca/Transform wiki

Operation

Please refer to How Monasca Transform Operates section on the Monasca/Transform wiki

Architecture

Please refer to Architecture and Logical processing data flow sections on the Monasca/Transform wiki

To set up the development environment

The monasca-transform uses DevStack as a common dev environment. See the README.md in the devstack directory for details on how to include monasca-transform in a DevStack deployment.

Generic aggregation components

Monasca Transform uses a set of generic aggregation components which can be assembled in to an aggregation pipeline.

Please refer to generic aggregation components document for information on list of generic aggregation components available.

Create a new aggregation pipeline example

Generic aggregation components make it easy to build new aggregation pipelines for different Monasca metrics.

This create a new aggregation pipeline example shows how to create pre_transform_specs and transform_specs to create an aggregation pipeline for a new set of Monasca metrics, while leveraging existing set of generic aggregation components.

Original proposal and blueprint

Original proposal: Monasca/Transform-proposal

Blueprint: monasca-transform blueprint