openstacksdk statsd records for API operations are not response code
specific. This change adds a glob to the statsd record path to include
all return code response metrics in our API graphs. I believe this
change may have come with the openstacksdk 0.103.0 update. We also
update paths for servers POST, server details, and flavor details as
they have changed.
Note the network info is empty which is why we don't get graphs for
them, but the paths appear correct. I think this may be beacuse we don't
need to query network info in any of our clouds currently.
While we are at it we stop updating the airship and inap cloud graphs
since those should be cleaned up and this keeps the review overhead
smaller.
Change-Id: I5a6b80118afaf3b7782a1d1c131787f208583799
I think I generally messed these up on the original import as every
stat seems to refer to the same thing. Over time, the layout of
openstacksdk stats has changed meaning this doesn't work at all now.
Use stats that are actually in graphite which should show the overall
health of API requests.
Change-Id: I6bd82b38d80db2b56a399f80132a723564f9bc40
grafyaml actually knows this is deprecated, but it's actually more
than that and doesn't work to refresh the variable at all now. "1"
means "on load" which is what we want.
Change-Id: I34ecdd30c2188cb7e6ec32e33c6a6e99b6240934
The minor updates are apparently due to us not having run the script
the last time it was updated with new urls.
Change-Id: I255d1e47b5cff29a3ed377b65ceab677ab1c272e
All of these dashboards are the same, and have mostly copied all the
same issues with them. This makes updating anything a massive pain.
This implements a single dashboard template with a small script to
create individual dashboards for each provider and its regions.
I have included a range of fixes. The y-axis format has changed in
later versions of grafana. The API time tracking is no longer scaled,
but we just tell grafana it is in ms and it displays it correctly.
The test nodes history graph is moved to the top, as it is probably
the most interesting graph (note this splits itself out per region, if
mulitple regions are selected). Values for "null as zero" are
consistently set. Various formatting fixes for the labels are
included.
Change-Id: I5fbffaec3c82aa1fce0947f771de67edd15f7dfc
These stats aren't updating any more. Unfortunately, I don't think
there's any current replacement as nodepool doesn't have any insight
into the job it is satisfying a request for.
Change-Id: Ib69fbda5ee019180cd8761d0ead474b426bce379
Since we now query a cloud for its quota information, lets track the
response rate in grafana.
Change-Id: Ie9e2727b5dc3d18f5e5fc37be89a9a5f9492eb47
Signed-off-by: Paul Belanger <pabelanger@redhat.com>
Following the update to Zuul v3 some things changed:
- nodes.delete became nodes.deleting
- nodes.used became nodes.in-use but nodes.used is still relevant
as it's the status between 'in-use' and 'deleting'
- Add a panel for displaying failed nodes
Change-Id: I240d082115bd9078e45984d8fcff212a4e40e842
Depends-On: I6a89752d74ed7424267c3af3937ad01fb4bb8f86
Because the revert was from almost a year ago, the graph data
is incorrect. This patch brings it in sync with the other
graphs.
Change-Id: I967fa0747867ef7613b15e320826a1a2b604218f
First, thanks to vexxhost-ca-ymq-1 for providing resource for
openstack-infra. It was awesome!
Moving forward, it appears vexxhost-ca-ymq-1 could be better suited
for control plane server then nodepool resources. As such, remove
them from nodepool.
Change-Id: I31e3bae80a2cc3ca553be5ec1f1e0a82d6967186
Signed-off-by: Paul Belanger <pabelanger@redhat.com>
Now that nodepool has been switch to use shade, we need to update
grafana to use the new shade syntax for Server related tasks.
Change-Id: I7698d54d89bda5327ac434fd8e662f0fe58d7f5e
Signed-off-by: Paul Belanger <pabelanger@redhat.com>