Optimistic Block #10584

Longarithm · 2024-02-07T22:11:28Z

Longarithm · 2025-01-16T10:05:46Z

Zulip thread: https://near.zulipchat.com/#narrow/channel/295558-core/topic/Optimistic.20Block.20Design/near/467293949

Original description from 8 Feb 2024

Stateless validation implementation introduces 2x latency of chunk execution. This is because validation of chunk (N+1) is blocked on execution of chunk N, and this validation repeats execution of chunk N itself.

It can be avoided by optimistic execution:

Once CP produces chunk N, it can go ahead and distribute it to CP for chunk (N+1). Because they have state, they can execute it before inclusion into block and record resulting state proof.
- This requires couple annoying runtime changes. For example, currently chunk execution protocol uses block hash during execution; with optimistic execution it can use only prev block hash.
- If we have Implement delayed chunks application for Stateless Validation #9982, chunk must be re-executed for each skipped block I guess.
Once BP includes chunk N into block, CP for (N+1) will already have state proof for N ready, can immediately produce chunk (N+1) and send state witness to CVs.

It is a well-defined win after stateless validation release.
For now, I believe we don't need to change config delays because stateless validation improves performance on its own.

Original context: https://docs.google.com/document/d/1k0NRMcLsDZp6C9pCRjNu5l7irDyRHsZ3VtKAKno_tFY/edit#heading=h.7ae0b4dh7648

Another pic of current workflow I came up with trying to understand this:

This PR introduces the shape of the optimistic block described in near#10584. Along with it I have added the functions to create and sign it. In the next PR, I will link this in the path of the block production.

…ks` on apply chunk (near#12746) ### Context We want to improve chunk processing efficiency by applying chunks optimistically (near#10584), when all partial chunks for the next height and block metadata (from an `OptimisticBlock`) are already available. This allows the results of chunk application to be reused when the actual block is received, enabling the next chunk to be produced immediately. Currently, this work is a step towards supporting `OptimisticBlock`. While `OptimisticBlock` is not introduced yet, this refactor prepares the codebase for its implementation by reducing data dependency on the current block. ### Change I replaced the dependency on `Block` with `ApplyChunkBlockContext` and `chunk_headers: &Chunks`. We'll just need to add conversion from `OptimisticBlock` to `ApplyChunkBlockContext` later. Chunk headers are just taken from block, and for optimistic block, they must be supplied by ShardsManager. Some APIs are refactored to reflect that change. ### Next steps * Convert `OptimisticBlock` to `ApplyChunkBlockContext`. * Then, we call `get_update_shard_job` for `OptimisticBlock` and reuse result if it is called for the actual `Block`. --------- Co-authored-by: Razvan Barbascu <r.barbascu@gmail.com>

This PR introduces the shape of the optimistic block described in near#10584. Along with it I have added the functions to create and sign it. In the next PR, I will link this in the path of the block production.

This PR introduces the shape of the optimistic block described in #10584. Along with it I have added the functions to create and sign it. In the next PR, I will link this in the path of the block production.

We continue the implementation of Optimistic block #10584, by adding the logic to produce the block as soon as the previous block is done. If available, the optimistic block will be used in the production of the block to use the same timestamp. --------- Co-authored-by: Aleksandr Logunov <the.alex.logunov@gmail.com>

…12777) #10584 There is another unexpected dependency on block hash during chunk application - it is used in `shuffle_receipt_proofs` to shuffle new receipts targeting our shard. As block hash is unknown in optimistic block, I replace it with prev block hash with a protocol upgrade. Additionally, use `Chunks` instead of `Block` in `collect_incoming_receipts_from_chunks` - it will be useful for optimistic block execution flow later. ## Security Some block producer can brute force hashes to get salt which gives more desirable order. But block hash is prone to that as well, prev hash has equivalent safety. ## Upgrade I use `BlockHeightForReceiptId` feature because it has similar goal and it is going to be released soon. Adding separate feature makes code harder to read I think. ## Testing IMO it makes sense only to check consistency of the shuffling, I don't see much value in checking that specific salt is used. So I claim that running existing tests is enough to test that change.

Introduce the pool for storing OptimisticBlocks, joining them with chunks, executing them and reusing cached results. #10584 The change is pretty big; however, I think it's important to merge at once because it already gives the working example. I'll describe 2 major changes which are sufficient to review. ### OptimisticBlockChunksPool Receives optimistic block (OB), currently from block producer itself only. Receives chunks from ShardsManager. When, on top of some prev block, OB and chunks are received, allows to take ready OB. Some primitive throttling and garbage collection is required, to ensure that OBs are not executed many times and that pool doesn't OOM when there are forks. For that purpose, we maintain `minimal_base_height` for chunks and `block_height_threshold` for blocks. Note that we **don't remove** chunks immediately because if there is a block skip, chunks should be reused to process the next OB. This feature is independent, so I also implement simple unit tests for it. ### Processing OB As discussed before, result of chunk execution on top of OB doesn't impact any part of block processing and doesn't persist anything. It is simply put to cache which can be reused when the actual block is received. This cache, however, needs some unique key to store results. For that, I introduce `CachedShardUpdateKey` which includes necessary fields for Block or OB, all the chunks and shard id (index could also work). Note that we need chunk hashes because they define prev outgoing receipts, which in turn are used to generate incoming receipts for our chunk. For execution, `BlocksInProcessing` is extended a bit to keep OBs as well to limit the number of parallel chunk executions for blocks and OBs together. The population happens in `postprocess_optimistic_block`. ### Testing Finally, we are also able to write `test_optimistic_block`. For now I just check that there is at least **one** cache hit, let's think about more complex cases later. I'll resolve merge conflicts later.

Longarithm · 2025-02-13T13:19:40Z

Performance update: https://near.zulipchat.com/#narrow/channel/295558-core/topic/Optimistic.20Block.20Design/near/499464703

300ms block delay + mirror traffic + multi shard synth-bm native transfers traffic - block rate per second goes from 1.66 to 2.21 (2.7 bps with just mirror traffic).

Introduce the pool for storing OptimisticBlocks, joining them with chunks, executing them and reusing cached results. near#10584 The change is pretty big; however, I think it's important to merge at once because it already gives the working example. I'll describe 2 major changes which are sufficient to review. ### OptimisticBlockChunksPool Receives optimistic block (OB), currently from block producer itself only. Receives chunks from ShardsManager. When, on top of some prev block, OB and chunks are received, allows to take ready OB. Some primitive throttling and garbage collection is required, to ensure that OBs are not executed many times and that pool doesn't OOM when there are forks. For that purpose, we maintain `minimal_base_height` for chunks and `block_height_threshold` for blocks. Note that we **don't remove** chunks immediately because if there is a block skip, chunks should be reused to process the next OB. This feature is independent, so I also implement simple unit tests for it. ### Processing OB As discussed before, result of chunk execution on top of OB doesn't impact any part of block processing and doesn't persist anything. It is simply put to cache which can be reused when the actual block is received. This cache, however, needs some unique key to store results. For that, I introduce `CachedShardUpdateKey` which includes necessary fields for Block or OB, all the chunks and shard id (index could also work). Note that we need chunk hashes because they define prev outgoing receipts, which in turn are used to generate incoming receipts for our chunk. For execution, `BlocksInProcessing` is extended a bit to keep OBs as well to limit the number of parallel chunk executions for blocks and OBs together. The population happens in `postprocess_optimistic_block`. ### Testing Finally, we are also able to write `test_optimistic_block`. For now I just check that there is at least **one** cache hit, let's think about more complex cases later. I'll resolve merge conflicts later.

Longarithm added the A-stateless-validation Area: stateless validation label Feb 7, 2024

Longarithm mentioned this issue Feb 7, 2024

[ProjectTracking]: Stateless validation Mainnet Release near/near-one-project-tracking#46

Open

52 tasks

github-actions bot mentioned this issue Mar 1, 2024

Monthly issue metrics report utnet-org/utility-readonly#10

Closed

Longarithm mentioned this issue Jul 2, 2024

NEP-509: Stateless validation stage 0 near/NEPs#509

Merged

Longarithm assigned VanBarbascu and Longarithm Dec 3, 2024

Longarithm changed the title ~~[stateless_validation] Solve 2x Latency Problem~~ Optimistic Block Dec 3, 2024

Longarithm mentioned this issue Jan 16, 2025

refactor: replace dependency on Block with BlockContext and Chunks on apply chunk #12746

Merged

VanBarbascu mentioned this issue Jan 17, 2025

feat(optimistic_block): Introduce the optimistic block struct #12730

Merged

VanBarbascu mentioned this issue Jan 20, 2025

feat(optimistic_block): produce #12761

Merged

Longarithm mentioned this issue Jan 23, 2025

feat(optimistic_block): shuffle incoming receipts based on prev hash #12777

Merged

Longarithm mentioned this issue Feb 3, 2025

feat: optimistic block pool #12788

Merged

walnut-the-cat closed this as completed Mar 21, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimistic Block #10584

Optimistic Block #10584

Longarithm commented Feb 7, 2024 •

edited

Loading

Longarithm commented Jan 16, 2025

Longarithm commented Feb 13, 2025

Optimistic Block #10584

Optimistic Block #10584

Comments

Longarithm commented Feb 7, 2024 • edited Loading

Goal

Plan

Testing (previous steps also include testing)

Longarithm commented Jan 16, 2025

Original description from 8 Feb 2024

Longarithm commented Feb 13, 2025

Longarithm commented Feb 7, 2024 •

edited

Loading