`fuel-sync` design #625

xgreenx · 2022-09-18T11:29:37Z

I described the implementation, assuming that we can't change the block producer. Or we know the block producer at the height X because Ethereum blocks are synced, and we can get information about the producer.

The responsibility from whom and how requests the blocks on the P2P side. But it also is possible to move this responsibility into FS, but it requires corresponding modification of traits.

…not.

vlopes11

looks solid overall; couple of comments

vlopes11 · 2022-09-21T10:00:25Z

docs/fuel-sync-design.md

+
+1. Every service knows about other services related somehow to it. It 
+can be interpreted as: each service struct containing reference to other 
+services inside and using the `async/await` mechanism to interact.


Shouldn't we require async/await only for operations that are I/O? It might be overkill to have everything async, unless ofc everything is I/O

Yes, I meant that we could call methods directly. I will rename it to "sync/async".

vlopes11 · 2022-09-21T10:01:46Z

docs/fuel-sync-design.md

+1. Every service knows about other services related somehow to it. It 
+can be interpreted as: each service struct containing reference to other 
+services inside and using the `async/await` mechanism to interact.
+1. Every service knows nothing about other services. It can be interpreted 


This is ofc desirable, but then we need concrete types to be sent as message (via channels?). We don't know about the internals of the service, but then we need to know about the messages the service is ready to accept.

Or are we going to have a broker in the middle that will do that for us?

I didn't think about this rule a lot because the usage of channels makes the code more asynchronous but less clear. Also, most services can work synchronously, and channels make the development harder. For example current implementation of the TX pool use channels. To get the list of transactions, we need to write a lot of boilerplate code using async where we don't need it.

vlopes11 · 2022-09-21T10:02:58Z

docs/fuel-sync-design.md

+```rust
+pub enum BlockBroadcast {
+    NewSealedHeaders([Sealed<FuelBlockHeader>]),
+    NewUnsealedHeaders([FuelBlockHeader]),


nit: these channel types must be owned (here its a semi-slice, and not a vec or array)

Yes, it is only pseudocode representing the idea of what should be received from the channel.

After implementation, we will use more concrete types.

vlopes11 · 2022-09-21T10:04:28Z

docs/fuel-sync-design.md

+    /// Example: If the transaction or blocker header is invalid, the punisher can decrease the reputation 
+    /// of the data provider.
+    // TODO: `D` should implement some bound that allows retrieving the data's signer and allowing to broadcast it.
+    fn report<D>(&mut self, data: D, verdict: Verdict<Reason>);


Shouldn't we have a mechanism to enforce the data to identify who is reporting? (ie AsRef<SomeKey>)

I think it is not essential "which" services report it. More critical "about what" we want to report. If we report about blocks, it can have one score system, if about transaction -> another.

Also, we don't need to know about other services, we only need to know about the data(P2P already knows about data).

vlopes11 · 2022-09-21T10:05:23Z

docs/fuel-sync-design.md

+
+pub trait BlocksFetcher: Punisher<BlockReason> + Gossiper {
+    /// Returns structure that can be used to subscribe for `BlockBroadcast`.
+    fn sender(&self) -> &Sender<BlockBroadcast>;


Shouldn't we return the sender as owned? I know the user can clone it himself, but I think its always going to be used as owned

Returning the reference may be helpful if you want to check something but don't subscribe. Like is_closed or something else. On the interface level, we don't need to force the user to actually subscribe=) If he needs it, he will clone.

A pattern we use now for owned senders is a subscribe method. We could have both an owned for creating a subscription and unowned version for read-only inquiries.

We can have both methods, but I want to keep the interface as minimal as possible=) sender adds flexibility, but I don't like naming

Wouldn't this be a Receiver? Sender implies that anyone who obtains this could send a broadcast, instead of just listening for broadcasts.

You are right, the idea was to use Receiver to be able only to receive information.

I thought Sender.send requires mutability, but it does not. So yea, then better to name the method subscribe and return Receiver<...>

vlopes11 · 2022-09-21T10:06:27Z

docs/fuel-sync-design.md

+    fn sender(&self) -> &Sender<BlockBroadcast>;
+
+    /// Pings the block fetcher to fetch a new latest block header and broadcasts it.
+    fn ping(&mut self);


Maybe we rename it? Ping is traditionally a request with a response (and the response is the indicator that everything is fine). In this case, we are not expecting a response, so we could instead call it heartbeat

AFAIU we are expecting the service to take some action in some channel. Do we have a way to enforce that in the signature? Otherwise, a service can implement this trait, and optionally implement it correctly/the intended way

Actually, maybe better to remove ping at all and add this request to fetch.

I didn't get the second part=(

vlopes11 · 2022-09-21T10:28:38Z

docs/fuel-sync-design.md

+- Is it a new height?
+  - No -> Is it already in the blockchain?
+    - Yes -> Ignore(Maybe report some neutral error to P2P to prevent DDoS)
+    - No -> It is case of the [fork](#fork).


hmm this one is a hard question. I'm not sure we can passively receive forks without being the case of a punishable action. Lets open a thread to discuss this one?

I guess you are talking about "Yes -> Ignore(Maybe report some neutral error to P2P to prevent DDoS)".

Yes, we need to punish. Actually, it should be solved on the P2P level. Because P2P should punish nodes that send not requested information.

Each node can report about new height one time, each next time the reputation should decrease(if we didn't request it).

Voxelot · 2022-09-24T01:31:14Z

docs/fuel-sync-design.md

+![img.png](diagrams/fuel-core-design-fuel-sync-part.png)
+
+The FS is a connection point between:
+- In the PoA version P2P and BI


I think we could generalize the connection point between FS and CM regardless of whether we are in PoS mode or not.

The benefit of this approach is that the synchronizer isn't required to contain business logic about how to validate the sealed block consensus metadata, and just defers that to whichever configuration the CM is running in (PoA or PoS).

We can=) I see CM as descending service for FS in this case. So FS will execute something like the check(blocks: &[Sealed<Block>]) -> bool method and after deciding what to do with blocks(reject or commit).

Voxelot · 2022-09-24T01:34:58Z

docs/fuel-sync-design.md

+
+Those rules are rough, and services can have cases where some functionality 
+requires direct access and some pub-sub model. So it should be decided 
+individually. But after comparison, the third rule covers all our needs 


So it should be decided individually.

I think this is the correct approach, and focuses on using the right tool for each job instead of complicating simple APIs like the txpool with channels just for the sake of consistency.

docs/fuel-sync-design.md

Voxelot · 2022-09-24T01:48:02Z

docs/fuel-sync-design.md

+    fn sender(&self) -> &Sender<BlockBroadcast>;
+
+    /// Fetches the data(somehow, maybe in parallel) and broadcasts ready parts.
+    fn fetch(&mut self, request: FetchRequest);


This may be a good usecase for FuturesUnordered, which allows joining on a set of parallel tasks.

The consumer of the API would have the option of using poll_next if they want to process the data immediately as it becomes available instead of having to wait for all the futures to complete.

docs/fuel-sync-design.md

Co-authored-by: Brandon Kite <brandonkite92@gmail.com>

Voxelot · 2022-09-26T20:42:23Z

docs/fuel-sync-design.md

+}
+
+// TODO: `D` should implement some bound that allows to serialize it for gossiping.
+pub trait Gossiper<D> {


I think I'd prefer using a concrete enum over all gossipable data types instead of a generic param. This makes it very clear what kind of data is actually supported for gossip, and also gives us more compiler guarantees that gossiping is implemented across all of our types via exhaustive matching.

leviathanbeak · 2022-09-27T15:58:18Z

docs/fuel-sync-design.md

+
+The same approach can be used between the P2P and TP. 
+`Punisher` trait adds a mechanism for [reputation management](https://github.com/FuelLabs/fuel-core/issues/460).
+`Gossiper` trait solves [gossiping of invalid transactions](https://github.com/FuelLabs/fuel-core/issues/595).


I don't see how a trait solves this problem, Gossipsub Messages are already clearly defined in fuel-p2p, the issue #595 that is referenced is more when there is a "valid" type, like a Transaction, but it's invalid in terms of its state on the chain itself.

We have several validity checks:

Validate that entity can be properly decoded from the bytes. It should be done on the P2P side.

Validate that transactions are valid in business logic(all input arguments are unique, all output are valid, use CheckedTransaction). It is not P2P responsibility. It seems part of the TXPool.

Validate that transition already in the TXPool(we don't need to gossip the same transaction). This can be done only by TXPool.

Validate that transition already in the TXPool(we don't need to gossip the same transaction). This can be done only by TXPool.

It's debatable whether compact blocks over p2p are really useful. In practice, they can slow down consensus if nodes have to query peers for txs they are missing in their txpool. I think we should assume a broadcast of full blocks. Full blocks are also required for syncing old blocks, as the txs will no longer exist in anyone's txpool.

Why should nodes query transactions that missing in node pool?

Ah wait, I misunderstood validating if a transaction already in txpool would be used for compact blocks. Re-read and realized it's just to prevent re-gossiping txs we've already received.

Voxelot · 2022-09-28T22:20:36Z

docs/fuel-sync-design.md

+- Is it a valid header? Blocks should be linked, and signatures should match. After, forwards blocks to the CM to verify producer/consensus at the according height.
+  - No -> `Punisher::report`.
+  - Yes -> Next step.
+  - Don't know. The Ethereum part is not synced yet to this height. If the height is 


We can defer ethereum height checking to the BI, to consolidate this check between CM->BI and FS->BI.

If we are behind on the ethereum height, the BI should decide whether to wait on a better height or reject the block.

But it means that we should check the consensus details when the ethereum block is available. This check should be done by CM then. CM can wait for a new height from the relayer

Ah good point! Maybe we should check it in a few places then, since it'll up to the CM to decide what details from ethereum it needs (or not). Whereas the block importer needs to provide some guarantees that the messages included in a block are going to exist before passing the block to the executor.

Then it's decided=) SealValidator::valdiate is async. If there is no Ethereum block, then CM waits for it(In the PoA, it is the default behavior for all block headers, in the PoS, we need to think about cases with fake height). When BI will ask the relayer about the information, it will already be available(because CM awaited it). BlocksCommitter::commit also is async.

With this approach, we have a problem: we can have many async pending requests, and each consumes a memory. Maybe we need to limit this somehow.

The first place we can restrict this is via the active set size, since that's equivalent to the buffer size of blocks we're trying to sync.

The other way to restrict this is to validate blocks with the CM and commit to BI one at a time. This assumes that the CM doesn't wait for the eth height until it's already verified things like the signature first to avoid this becoming an attack vector.

Could you elaborate more on the second solution, please? I didn't get it=(

The CM can't verify the signature(that it is done by the correct signer) because it doesn't know who the signer is(Eth is not synced enough).

Ah true, I suppose we could use some heuristic where we check if our relayer node is synced and the da_height is greater than some margin to reject. This way we add some tolerance in case our relayer's eth node claims to be synced when it really isn't.

For example:

flowchart a["if block.da_height > relayer.da_height"] -- no --> b["check signature"] b -- valid --> s["success"] b -- invalid --> f["reject"] a -- yes --> c["wait for relayer to sync"] c --> d["is block.da_height > relayer.da_height + 10(adjustable margin)"] d -- yes --> f d -- no --> a

Yep, added the final description of what to do in the case when the Relayer is behind.

Voxelot · 2023-01-26T16:45:02Z

Redo to match the current state of sync without reputation. Add reputation in a follow-up.

Not finished version of the FS design

8da374d

xgreenx self-assigned this Sep 18, 2022

I described the approach with block headers with known blocks producer.

3c46a3d

xgreenx added question Further information is requested architecture Something related to the architecture or the architecture description itself. fuel-sync labels Sep 19, 2022

xgreenx requested a review from a team September 19, 2022 13:13

xgreenx changed the title ~~[WIP] fuel-sync design~~ [Ready for discussion] fuel-sync design Sep 19, 2022

xgreenx added 3 commits September 19, 2022 18:18

Added description for the case when we don't know is header valid or …

2d9312c

…not.

ADD Gossiper

05f10a6

Suggestion after the call

6b1e5d8

vlopes11 reviewed Sep 21, 2022

View reviewed changes

xgreenx requested review from leviathanbeak and Voxelot September 21, 2022 11:30

xgreenx and others added 3 commits September 21, 2022 13:46

Merge branch 'master' into docs/fuel-sync

f52cd24

Small fixed due comments

14457ad

Merge remote-tracking branch 'origin/docs/fuel-sync' into docs/fuel-sync

f3f593d

Voxelot reviewed Sep 24, 2022

View reviewed changes

docs/fuel-sync-design.md Outdated Show resolved Hide resolved

Voxelot reviewed Sep 24, 2022

View reviewed changes

docs/fuel-sync-design.md Outdated Show resolved Hide resolved

xgreenx and others added 2 commits September 25, 2022 19:00

Update docs/fuel-sync-design.md

8ef1996

Co-authored-by: Brandon Kite <brandonkite92@gmail.com>

Update docs/fuel-sync-design.md

c611020

Co-authored-by: Brandon Kite <brandonkite92@gmail.com>

Voxelot requested a review from a team September 26, 2022 17:11

Voxelot reviewed Sep 26, 2022

View reviewed changes

xgreenx added 2 commits September 26, 2022 23:17

Added enum for gossiping

9571ee7

Updated method

06472de

leviathanbeak reviewed Sep 27, 2022

View reviewed changes

Voxelot requested a review from a team September 28, 2022 16:30

Updated the description after architecture call.

9094985

xgreenx changed the title ~~[Ready for discussion] fuel-sync design~~ fuel-sync design Sep 28, 2022

Merge branch 'master' into docs/fuel-sync

55db398

xgreenx marked this pull request as ready for review September 28, 2022 22:13

xgreenx requested review from leviathanbeak, bvrooman, vlopes11 and Voxelot September 28, 2022 22:13

Voxelot reviewed Sep 28, 2022

View reviewed changes

Added description for the case if relayer is not synced enough.

dbb1547

xgreenx requested a review from Voxelot September 30, 2022 23:08

ControlCplusControlV and others added 3 commits October 2, 2022 10:29

Merge branch 'master' into docs/fuel-sync

a51b83a

Merge branch 'master' into docs/fuel-sync

9a96611

Merge branch 'master' into docs/fuel-sync

a7d2c4f

Merge branch 'master' into docs/fuel-sync

cb07190

xgreenx marked this pull request as draft August 9, 2023 12:15

fuel-sync design #625

Are you sure you want to change the base?

fuel-sync design #625

Conversation

xgreenx commented Sep 18, 2022 • edited

vlopes11 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Voxelot Sep 24, 2022 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Voxelot Sep 24, 2022 • edited

Choose a reason for hiding this comment

Voxelot Sep 26, 2022 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

xgreenx Sep 28, 2022 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

xgreenx Sep 29, 2022 • edited

Choose a reason for hiding this comment

Voxelot Sep 30, 2022 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Voxelot commented Jan 26, 2023

`fuel-sync` design #625

`fuel-sync` design #625

xgreenx commented Sep 18, 2022 •

edited

Voxelot Sep 24, 2022 •

edited

Voxelot Sep 24, 2022 •

edited

Voxelot Sep 26, 2022 •

edited

xgreenx Sep 28, 2022 •

edited

xgreenx Sep 29, 2022 •

edited

Voxelot Sep 30, 2022 •

edited