[chunks] Move chunk completion out of ShardsManager #7622

robin-near · 2022-09-16T02:26:07Z

Move handling of chunk completion to a ClientActor message (ShardsManagerResponse), because this will eventually be async. So no more passing DoneApplyingChunksCallback. Add support of this message to TestEnv; tests that rely on chunk part messaging now need to manually handle such messages (because the TestEnv doesn't have real actors). Fixed tests that need this.
Move ReedSolomonWrapper inside ShardsManager. However Client still has a copy because it actually needs to encode the parts to produce a chunk - at least the merkle root is needed.
Does not touch ChainStore yet. Leaving that till later.
Minor refactoring to push chunk part processing logic from Client into ShardsManager (otherwise Client needs to fetch header and stuff).

mzhangmzz

Overall looks good! I have two comments:

It might be better to achieve the callback of on_chunk_complete not through actor messages. My concern of changing that to actor messages is that it shares the same queue as all other messages that ClientActor receives and if the queue is long, the callback is delayed. You can still have a ClientActorAdapterForShardsManager trait which has on_chunk_complete function and let client actor implement that function.
Maybe we should send the whole chunk instead of only chunk header to client and move the logic of saving chunks from ShardsManager to client.

mzhangmzz · 2022-09-20T15:55:27Z

chain/client/src/client.rs

-        prev_block_hash: &CryptoHash,
-        apply_chunks_done_callback: DoneApplyChunkCallback,
-    ) {
+    pub fn check_incomplete_chunks(&mut self, prev_block_hash: &CryptoHash) {


Maybe we should move this function to ShardsManger too?

Yeah, I just don't wanna do too much in this PR yet. So... kinda pausing at an arbitrary point.

mzhangmzz · 2022-09-20T20:49:29Z

chain/chunks/src/lib.rs

@@ -1760,6 +1812,23 @@ impl ShardsManager {
        Ok(result)
    }

+    pub fn process_partial_encoded_chunk_response(


I think this function is not used right now

mzhangmzz · 2022-09-20T21:12:26Z

Adding @matklad as a reviewer too

robin-near · 2022-09-21T03:41:54Z

Thanks! How would we invoke on_chunk_complete not through actor messages? We need a &mut Client for that call...

matklad · 2022-09-21T12:29:16Z

Do we have some broader context here? What is the end-goal of the refactor and such?

matklad · 2022-09-21T12:38:29Z

chain/chunks/src/client.rs

+    ChunkCompleted(ShardChunkHeader),
+}
+
+pub trait ClientAdapterForShardsManager: MsgRecipient<ShardsManagerResponse> {}


Let's maybe hide actix from the API here?

diff --git a/chain/chunks/src/client.rs b/chain/chunks/src/client.rs index ed2b2ba58..e207c0cd1 100644 --- a/chain/chunks/src/client.rs +++ b/chain/chunks/src/client.rs @@ -2,12 +2,18 @@ use actix::Message; use near_network::types::MsgRecipient; use near_primitives::sharding::ShardChunkHeader; +pub trait ClientAdapterForShardsManager { + fn did_complete_chunk(&self, chunk_header: ShardChunkHeader); +} + #[derive(Message)] #[rtype(result = "()")] pub enum ShardsManagerResponse { ChunkCompleted(ShardChunkHeader), } -pub trait ClientAdapterForShardsManager: MsgRecipient<ShardsManagerResponse> {} - -impl<A: MsgRecipient<ShardsManagerResponse>> ClientAdapterForShardsManager for A {} +impl<A: MsgRecipient<ShardsManagerResponse>> ClientAdapterForShardsManager for A { + fn did_complete_chunk(&self, chunk_header: ShardChunkHeader) { + self.do_send(ShardsManagerResponse::ChunkCompleted(chunk_header)) + } +} diff --git a/chain/chunks/src/lib.rs b/chain/chunks/src/lib.rs index 990d7cbee..b376f353d 100644 --- a/chain/chunks/src/lib.rs +++ b/chain/chunks/src/lib.rs @@ -85,7 +85,7 @@ use std::sync::Arc; use std::time::{Duration, Instant}; use chrono::DateTime; -use client::{ClientAdapterForShardsManager, ShardsManagerResponse}; +use client::ClientAdapterForShardsManager; use near_primitives::time::Utc; use rand::seq::IteratorRandom; use rand::seq::SliceRandom; @@ -1967,7 +1967,7 @@ impl ShardsManager { self.encoded_chunks.remove_from_cache_if_outside_horizon(chunk_hash); self.requested_partial_encoded_chunks.remove(chunk_hash); debug!(target: "chunks", "Completed chunk {:?}", chunk_hash); - self.client_adapter.do_send(ShardsManagerResponse::ChunkCompleted(header)); + self.client_adapter.did_complete_chunk(header); } /// Send the parts of the partial_encoded_chunk that are owned by `self.me` to the

In generally we want to move away from it, so better if stuff doesn't call actix APIs directly

Thanks @matklad; @mzhangmzz, is this the same thing you were talking about? It doesn't solve the fact that this will go on the same client actor queue though. I don't see how to avoid that...

Let me do this for the next PR; I need to change the API anyway.

mzhangmzz · 2022-09-21T15:15:43Z

Do we have some broader context here? What is the end-goal of the refactor and such?

@matklad The end goal is to move ShardsManager to its own thread(actor)

matklad · 2022-09-21T15:18:05Z

Is there some issue on that, so that we can mention it in the RP description, to make sure it gets surfaced in git blame in the future?

mzhangmzz · 2022-09-21T20:35:09Z

Is there some issue on that, so that we can mention it in the RP description, to make sure it gets surfaced in git blame in the future?

Good point. @robin-near do you mind creating a github and a jira issue to track this project?

mzhangmzz · 2022-09-21T20:48:41Z

Could you also run nayduck tests? Since this PR touches quite a few things

robin-near requested a review from mzhangmzz September 16, 2022 02:26

robin-near force-pushed the async branch from 747c085 to af2ca8c Compare September 19, 2022 21:55

robin-near changed the title ~~Draft for moving stuff out of ShardsManager~~ [chunks] Move chunk completion out of ShardsManager Sep 19, 2022

robin-near marked this pull request as ready for review September 19, 2022 22:36

robin-near requested a review from a team as a code owner September 19, 2022 22:36

mzhangmzz reviewed Sep 20, 2022

View reviewed changes

mzhangmzz requested a review from matklad September 20, 2022 21:12

matklad reviewed Sep 21, 2022

View reviewed changes

robin-near linked an issue Sep 21, 2022 that may be closed by this pull request

Move ShardsManager chunk management logic to a separate Actix thread #7662

Closed

robin-near force-pushed the async branch from 4f84f3f to 99813b0 Compare September 23, 2022 16:07

mzhangmzz approved these changes Sep 23, 2022

View reviewed changes

Move chunk completion out of ShardsManager.

3f5b40a

robin-near force-pushed the async branch from 99813b0 to 3f5b40a Compare September 23, 2022 23:57

robin-near merged commit 1b48959 into near:master Sep 24, 2022

nikurt pushed a commit that referenced this pull request Sep 26, 2022

Move chunk completion out of ShardsManager. (#7622)

201a24f

nikurt pushed a commit that referenced this pull request Nov 9, 2022

Move chunk completion out of ShardsManager. (#7622)

eebd8ae

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[chunks] Move chunk completion out of ShardsManager #7622

[chunks] Move chunk completion out of ShardsManager #7622

robin-near commented Sep 16, 2022 •

edited

Loading

mzhangmzz left a comment

mzhangmzz Sep 20, 2022

robin-near Sep 21, 2022

mzhangmzz Sep 20, 2022

mzhangmzz commented Sep 20, 2022

robin-near commented Sep 21, 2022

matklad commented Sep 21, 2022

matklad Sep 21, 2022

robin-near Sep 21, 2022

robin-near Sep 22, 2022

mzhangmzz commented Sep 21, 2022

matklad commented Sep 21, 2022

mzhangmzz commented Sep 21, 2022

mzhangmzz commented Sep 21, 2022

[chunks] Move chunk completion out of ShardsManager #7622

[chunks] Move chunk completion out of ShardsManager #7622

Conversation

robin-near commented Sep 16, 2022 • edited Loading

mzhangmzz left a comment

Choose a reason for hiding this comment

mzhangmzz Sep 20, 2022

Choose a reason for hiding this comment

robin-near Sep 21, 2022

Choose a reason for hiding this comment

mzhangmzz Sep 20, 2022

Choose a reason for hiding this comment

mzhangmzz commented Sep 20, 2022

robin-near commented Sep 21, 2022

matklad commented Sep 21, 2022

matklad Sep 21, 2022

Choose a reason for hiding this comment

robin-near Sep 21, 2022

Choose a reason for hiding this comment

robin-near Sep 22, 2022

Choose a reason for hiding this comment

mzhangmzz commented Sep 21, 2022

matklad commented Sep 21, 2022

mzhangmzz commented Sep 21, 2022

mzhangmzz commented Sep 21, 2022

robin-near commented Sep 16, 2022 •

edited

Loading