gnocchi/doc/source/architecture.rst
dsxyy 19e9c59f63 doc: fix the number of points for 30 days
Change-Id: I10a3852acefe8a35bb12559caa9528674786fa27
Closes-Bug: #1582057
2016-05-17 09:48:51 +08:00

5.3 KiB
Executable File
Raw Blame History

Project Architecture

Gnocchi consists of several services: a HTTP REST API (see rest), an optional statsd-compatible daemon (see statsd), and an asynchronous processing daemon. Data is received via the HTTP REST API and statsd daemon. The asynchronous processing daemon, called gnocchi-metricd, performs operations (statistics computing, metric cleanup, etc...) on the received data in the background.

Both the HTTP REST API and the asynchronous processing daemon are stateless and are scalable. Additional workers can be added depending on load.

Back-ends

Gnocchi uses two different back-end for storing data: one for storing the time series (the storage driver) and one for indexing the data (the index driver).

The storage is responsible for storing measures of created metrics. It receives timestamps and values, and pre-computes aggregations according to the defined archive policies.

The indexer is responsible for storing the index of all resources, along with their types and properties. Gnocchi only knows about resource types from the OpenStack project, but also provides a generic type so you can create basic resources and handle the resource properties yourself. The indexer is also responsible for linking resources with metrics.

How to choose back-ends

Gnocchi currently offers different storage drivers:

  • File
  • Swift
  • Ceph (preferred)

The drivers are based on an intermediate library, named Carbonara, which handles the time series manipulation, since none of these storage technologies handle time series natively.

The three Carbonara based drivers are working well and are as scalable as their back-end technology permits. Ceph and Swift are inherently more scalable than the file driver.

Depending on the size of your architecture, using the file driver and storing your data on a disk might be enough. If you need to scale the number of server with the file driver, you can export and share the data via NFS among all Gnocchi processes. In any case, it is obvious that Ceph and Swift drivers are largely more scalable. Ceph also offers better consistency, and hence is the recommended driver.

How to plan for Gnocchis storage

Gnocchi uses a custom file format based on its library Carbonara. In Gnocchi, a time series is a collection of points, where a point is a given measure, or sample, in the lifespan of a time series. The storage format is compressed using various techniques, therefore the computing of a time series' size can be estimated based on its worst case scenario with the following formula:

number of points × 9 bytes = size in bytes

The number of points you want to keep is usually determined by the following formula:

number of points = timespan ÷ granularity

For example, if you want to keep a year of data with a one minute resolution:

number of points = (365 days × 24 hours × 60 minutes) ÷ 1 minute
number of points = 525 600

Then:

size in bytes = 525 600 × 9 = 4 730 400 bytes = 4 620 KiB

This is just for a single aggregated time series. If your archive policy uses the 8 default aggregation methods (mean, min, max, sum, std, median, count, 95pct) with the same "one year, one minute aggregations" resolution, the space used will go up to a maximum of 8 × 4.5 MiB = 36 MiB.

How to set the archive policy and granularity

In Gnocchi, the archive policy is expressed in number of points. If your archive policy defines a policy of 10 points with a granularity of 1 second, the time series archive will keep up to 10 seconds, each representing an aggregation over 1 second. This means the time series will at maximum retain 10 seconds of data (sometimes a bit more) between the more recent point and the oldest point. That does not mean it will be 10 consecutive seconds: there might be a gap if data is fed irregularly.

There is no expiry of data relative to the current timestamp. Also, you cannot delete old data points (at least for now).

Therefore, both the archive policy and the granularity entirely depends on your use case. Depending on the usage of your data, you can define several archiving policies. A typical low grained use case could be:

3600 points with a granularity of 1 second = 1 hour
1440 points with a granularity of 1 minute = 24 hours
720 points with a granularity of 1 hour = 30 days
365 points with a granularity of 1 day = 1 year

This would represent 6125 points × 9 = 54 KiB per aggregation method. If you use the 8 standard aggregation method, your metric will take up to 8 × 54 KiB = 432 KiB of disk space.

Default archive policies

By default, 3 archive policies are created using the default archive policy list (listed in default_aggregation_methods, i.e. mean, min, max, sum, std, median, count, 95pct):

  • low (maximum estimated size per metric: 5 KiB)
    • 5 minutes granularity over 1 hour
    • 1 hour granularity over 1 day
    • 1 day granularity over 1 month
  • medium (maximum estimated size per metric: 139 KiB)
    • 1 minute granularity over 1 day
    • 1 hour granularity over 1 week
    • 1 day granularity over 1 year
  • high (maximum estimated size per metric: 1 578 KiB)
    • 1 second granularity over 1 hour
    • 1 minute granularity over 1 week
    • 1 hour granularity over 1 year