Cosmos-sdk: Integrate Promotheus with gaiad and gaiacli

Created on 27 Jun 2018 · 7Comments · Source: cosmos/cosmos-sdk

@greg-szabo

Edit (2018-07-01):

It would be nice if the sdk and the hub ship their own metrics to promotheus.

Source

adrianbrink

Most helpful comment

Generally anything that requires state to be kept is not a good candidate for prometheus and as pointed out any quantitive information e.g. number of operations, errors, latencies and beyond that dimensional breakdown with tags e.g. operation type, endpoint, error type. There is some information which fits well into gauges which could value beyond operational insight. An interesting exercise would be to actually compile a list of potential metrics and see if they would work with the prometheus modle and if it is feasible to track.

xla on 5 Jul 2018

👍3

All 7 comments

AFAIK this just means upgrade for the new tendermint with prometheus support. though maybe we want to include metrics at the SDK level too

ebuchman on 30 Jun 2018

👍1

I'm a bit confused. Are we simply stating the we need to upgrade the version of Tendermint in the SDK or do we also want to expose additional separate SDK metrics (e.g. total gets, total puts, request metrics, validator stats, etc...)?

alexanderbez on 3 Jul 2018

I think the latter couldn't hurt.

alexanderbez on 3 Jul 2018

I think the request is to update to the latest tendermint.

But I'm also not sure prometheus is the correct tool for tracking info about the state machine. It would have to persist data and stay synced with the blockchain properly. More likely we should keep it focused on information about the running process, rather than getting involved with the SDK state machine. Though it could be used to track reads/writes to the underlying db and maybe latency spent in AVL store access. @xla does that sound right?

Getting metrics on the db/avl access sounds pretty useful, so let's leave this open for that.

ebuchman on 5 Jul 2018

Ok cool, so I'll boil this down to:

Update to latest TM
Expose Prom metrics on DB/IAVL+ ops

Correct?

alexanderbez on 5 Jul 2018

👍1

xla on 5 Jul 2018

👍3

Can we close this @ebuchman? Seems like we want to create a ticket for compiling a list of potential metrics (most likely gauges). Doesn't seem super high priority atm.

alexanderbez on 16 Jul 2018

👍1

Was this page helpful?

0 / 5 - 0 ratings