remove vec allocation in CRDS deserialize #5143

alexpyattaev · 2025-03-04T22:53:28Z

Problem

Unnecessary Vec allocation for every CrdsValue deserialization

Summary of Changes

Use stack-allocated array instead

Fixes #
partially addresses #5034

gregcusack

looks mostly good. just wondering if you've checked into the tradeoffs here. thank you!

gregcusack · 2025-03-05T01:36:07Z

gossip/src/crds_value.rs

+        let mut buffer = [0u8; PACKET_DATA_SIZE];
+        let position = {
+            let mut cursor = std::io::Cursor::new(buffer.as_mut());
+            bincode::serialize_into(&mut cursor, &data).map_err(serde::de::Error::custom)?;
+            cursor.position() as usize
+        };
+        let hash = solana_sha256_hasher::hashv(&[signature.as_ref(), &buffer[0..position]]);


you have any numbers on the trade off between increased stack usage vs. reduced heap allocations?

Stack allocation of 1KB is essentially free and does not rely on any global locking. Heap allocation is VERY expensive in comparison, see numbers below (keep in mind that is microbenchmark data on 1 thread).

heap 1000 time: [529.70 ns 530.29 ns 530.92 ns] stack 1000 time: [293.75 ns 295.05 ns 296.31 ns] heap 100 time: [164.27 ns 164.97 ns 165.82 ns] stack 100 time: [25.971 ns 26.017 ns 26.073 ns]

https://github.com/alexpyattaev/heapstack

gossip/src/crds_value.rs

steviez · 2025-03-05T09:39:51Z

gossip/src/crds_value.rs

+        let mut buffer: MaybeUninit<[u8; PACKET_DATA_SIZE]> = MaybeUninit::uninit();
+        // SAFETY: we are only using this buffer to store serialize results
+        let buf_ref = unsafe { buffer.assume_init_mut() };


I don't think we should do this as we haven't actually initialized the buffer. We currently do something like this elsewhere in the codebase, but we shouldn't do it there either and PR's that have addressed that behavior have gotten hung up in review.

I think there are two approaches we can consider here:

Just do [0u8; PACKET_DATA_SIZE] like you previously had OR

Do something like I did in Introduce API to safely initialize Packets #3533 (which never merged 😆 )

This linked PR also has some conversation about why we shouldn't do assume_init() in one of the other places that we do

Not having to zero the buffer should seemingly have some gains. But, I think removing the alloc is the more significant gain so maybe it makes sense to keep this PR simpler for now

CC @alessandrod

@steviez I do not see a reason to zero-out this buffer. We are never actually reading any values from it for anything other than hashing. I agree the actual overhead is probably very small, but we are doing this up to 20K times per second. @alessandrod was the one who suggested that initializing it is not useful.

I don't want to speak for Alessandro, but here is a comment from him on a very similar item on that same PR I linked: #3533 (comment)

If I understand correctly, I think his suggestion is something like:

Create MaybeUninit<[u8; PACKET_DATA_SIZE]> like you have it now

Serialize data into the buffer

Given that a MaybeUninit is written to with pointers, you can't directly call bincode::serialize_into() (unless I missed something) which is why I wrote that thin wrapper in my code

Fill the remainder of the buffer with 0's

Finally, do .assume_init() to mark that we have now fully initialized the buffer

remove vec allocation in CRDS deserialize

95fc74a

alexpyattaev marked this pull request as ready for review March 4, 2025 23:02

alexpyattaev requested a review from gregcusack March 4, 2025 23:02

gregcusack reviewed Mar 5, 2025

View reviewed changes

comments from review

d8d544e

steviez reviewed Mar 5, 2025

View reviewed changes

gregcusack self-requested a review March 5, 2025 23:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

remove vec allocation in CRDS deserialize #5143

remove vec allocation in CRDS deserialize #5143

alexpyattaev commented Mar 4, 2025

gregcusack left a comment

gregcusack Mar 5, 2025

alexpyattaev Mar 5, 2025

steviez Mar 5, 2025

alexpyattaev Mar 5, 2025

steviez Mar 5, 2025

remove vec allocation in CRDS deserialize #5143

Are you sure you want to change the base?

remove vec allocation in CRDS deserialize #5143

Conversation

alexpyattaev commented Mar 4, 2025

Problem

Summary of Changes

gregcusack left a comment

Choose a reason for hiding this comment

gregcusack Mar 5, 2025

Choose a reason for hiding this comment

alexpyattaev Mar 5, 2025

Choose a reason for hiding this comment

steviez Mar 5, 2025

Choose a reason for hiding this comment

alexpyattaev Mar 5, 2025

Choose a reason for hiding this comment

steviez Mar 5, 2025

Choose a reason for hiding this comment