Support decoding multiplexed RRD streams #7091

teh-cmc · 2024-08-07T09:14:54Z

TL;DR: the following is now possible:

cat docs/snippets/all/archetypes/*_rust.rrd | rerun -

This will of course become more interesting as you build more and more complex CLI pipelines with rerun rrd

(Also fixed some missing buffered io while I was around.)

Checklist

I have read and agree to Contributor Guide and the Code of Conduct
I've included a screenshot or gif (if applicable)
I have tested the web demo (if applicable):
- Using examples from latest main build: rerun.io/viewer
- Using full set of examples from nightly build: rerun.io/viewer
The PR title and labels are set such as to maximize their usefulness for the next release's CHANGELOG
If applicable, add a new check to the release checklist!
If have noted any breaking changes to the log API in CHANGELOG.md and the migration guide

To run all checks from main, comment on the PR with @rerun-bot full-check.

crates/store/re_log_encoding/src/decoder/mod.rs

emilk · 2024-08-07T09:25:28Z

crates/store/re_log_encoding/src/decoder/mod.rs

 pub struct Decoder<R: std::io::Read> {
    version: CrateVersion,
    compression: Compression,
-    read: R,
+    read: Reader<R>,


Do we ever want to support unbuffered reads? Aren't those just strictly slower in almost all cases?

Not all cases -- if you're reading from an array of bytes, like we do in a bunch of places, adding extra buffering is pure waste of time and space.

crates/store/re_log_encoding/src/decoder/mod.rs

emilk · 2024-08-07T09:28:42Z

crates/store/re_log_encoding/src/decoder/mod.rs

+    /// This is particularly useful when working with stdio streams.
+    ///
+    /// If you're not familiar with multiplexed RRD streams, then you probably want to use
+    /// [`Decoder::new`] instead.


Why do we need both constructors though? They look identical, except one requires a buffered reader.

Isn't it just better if we have one constructor that always handled concatenated streams?

Three reasons:

As mentioned above, buffering is a net loss is some real world cases that we depend on today.

There's non negligible overhead to constantly check for unexpected FileHeaders, which there's really no reason to be paying for in most cases.

This is a very specific constructor that relies specifically on std::io::BufReader, as opposed to any type that implements std::io::BufRead.

You can now do this: ``` cat docs/snippets/all/archetypes/*_rust.rrd | rerun rrd print ``` and this: ``` cat docs/snippets/all/archetypes/*_rust.rrd | rrd merge -o /tmp/all_merged.rrd ``` and this ``` cat docs/snippets/all/archetypes/*_rust.rrd | rerun rrd compact --max-rows 99999999 --max-bytes 999999999 -o /tmp/all_compacted_max.rrd ``` - Part of #7048 - DNM: requires #7091

teh-cmc added enhancement New feature or request include in changelog CLI Related to the Rerun CLI labels Aug 7, 2024

emilk reviewed Aug 7, 2024

View reviewed changes

crates/store/re_log_encoding/src/decoder/mod.rs Outdated Show resolved Hide resolved

emilk reviewed Aug 7, 2024

View reviewed changes

teh-cmc mentioned this pull request Aug 7, 2024

Improved CLI: stdin streaming support #7092

Merged

6 tasks

teh-cmc added 4 commits August 8, 2024 09:40

re_log_encoding: impl support for multiplexed streams

e7ddabe

load_blueprint_file: add missing bufreader

b2ed075

load_stdin: fix missing bufreader

884708b

rrd compare: fix missing bufreader

7098f77

teh-cmc mentioned this pull request Aug 8, 2024

Keep track of original filename in RRD's FileHeader #7109

Open

review

951ce6b

teh-cmc force-pushed the cmc/multiplexed_decodeer branch from 9533541 to 951ce6b Compare August 8, 2024 07:49

teh-cmc merged commit 84f63a0 into main Aug 8, 2024
10 of 19 checks passed

teh-cmc deleted the cmc/multiplexed_decodeer branch August 8, 2024 07:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support decoding multiplexed RRD streams #7091

Support decoding multiplexed RRD streams #7091

teh-cmc commented Aug 7, 2024 •

edited by github-actions bot

Loading

emilk Aug 7, 2024

teh-cmc Aug 7, 2024

emilk Aug 7, 2024

teh-cmc Aug 7, 2024

Support decoding multiplexed RRD streams #7091

Support decoding multiplexed RRD streams #7091

Conversation

teh-cmc commented Aug 7, 2024 • edited by github-actions bot Loading

Checklist

emilk Aug 7, 2024

Choose a reason for hiding this comment

teh-cmc Aug 7, 2024

Choose a reason for hiding this comment

emilk Aug 7, 2024

Choose a reason for hiding this comment

teh-cmc Aug 7, 2024

Choose a reason for hiding this comment

teh-cmc commented Aug 7, 2024 •

edited by github-actions bot

Loading