Tendermint: p2p/conn: large (or a lot of) outgoing messages from other reactors can block consensus reactor from making progress

Created on 21 Dec 2020 · 3Comments · Source: tendermint/tendermint

See https://github.com/tendermint/tendermint/issues/5796 and https://github.com/tendermint/tendermint/issues/3920#issuecomment-748963728

What happened

Large messages (or many messages) produced by other reactors and scheduled to be send can temporarily block consensus reactor from making progress.

Why do you think it happens?

Either libs/flow library does not perform as expected or p2p/conn/connection scheduling logic is invalid.

What did you expect?

No halting. Tendermint top priority is exchanging votes and making blocks with a few transactions + evidence (if any). Consensus reactor messages should have a top priority, while other (e.g. mempool gossip, evidence) should have a lower priority.

p2p bug

Source

melekes

Most helpful comment

Ah, got it -- you're right, different issue. New P2P stack will handle this as well, by having separate outbound queues per peer with some scheduling policy, and dropping messages if a peer can't keep up to avoid blocking reactors.

erikgrinaker on 21 Dec 2020

👍2

All 3 comments

This is basically #2888. The new P2P stack will have separate queues per reactor channel.

erikgrinaker on 21 Dec 2020

This is basically #2888. The new P2P stack will have separate queues per reactor channel.

How's that? The issue here is incorrect dispatching (sending) while multiplexing over single TCP stream, while #2888 is about individual Reactor#Receive blocking receiving messages => sending != receiving. But you're right that if we adopt QUIC (independent streams), this issue will go away.

melekes on 21 Dec 2020

erikgrinaker on 21 Dec 2020

👍2

Was this page helpful?

0 / 5 - 0 ratings

Related issues

p2p: prevent bad peer from connecting to us for some time

melekes · 3Comments

replay doesn't work for fast sync

ebuchman · 3Comments

Can't run basic example

ddsvetlov · 3Comments

store/store.BlockStore.SaveBlock is not atomic

dshulyak · 3Comments

No empty blocks

ebuchman · 4Comments