Tendermint: use an order-preserving, scan-friendly database key encoding

Created on 14 Mar 2020 · 3Comments · Source: tendermint/tendermint

The BlockStore currently uses this encoding for e.g. block metadata keys:

[]byte(fmt.Sprintf("H:%v", height))

This uses alphabetical ordering instead of numerical ordering, such that e.g. block 10 is ordered between block 1 and block 100, not between 9 and 11. This makes it impossible to do efficient range scans - if we e.g. want to prune all blocks between 0 and 1000000 we have to explicitly test for the existence of each and every one rather than simply scan the keys that actually exist. Since this obviously does not scale, one must resort to e.g. short-circuiting a reverse iteration on the first missing key, which is not robust.

The encoding should instead use the big-endian binary representation of the number, or some other order-preserving encoding.

breaking encoding jank perf

Source

erikgrinaker

❤2 🎉2

Most helpful comment

I believe this is also for the evidence database as well

cmwaters on 3 Apr 2020

👍2

All 3 comments

This applies to the state database as well.

erikgrinaker on 24 Mar 2020

I believe this is also for the evidence database as well

cmwaters on 3 Apr 2020

👍2

SQLite uses a varint encoding that preserves order: https://sqlite.org/src4/doc/trunk/www/varint.wiki

It's unclear if this is the same encoding which is used e.g. by the Go binary package: https://golang.org/pkg/encoding/binary/

But if that package preserves ordering as well, we should use varints.

erikgrinaker on 22 Apr 2020

👍1

Was this page helpful?

0 / 5 - 0 ratings

Related issues

Can't run basic example

ddsvetlov · 3Comments

tendermint-debug tool

ebuchman · 3Comments

why use the mock.Mempool{} to initialize the blockExec

banishee · 3Comments

RPC: convenient route to monitor nodes

melekes · 4Comments

p2p: prevent bad peer from connecting to us for some time

melekes · 3Comments