Add torchcodec cpu #798

jadechoghari · 2025-03-03T06:33:24Z

What this does

This PR replaces torchvision CPU decoding by torchcodec CPU decoding.
Also added a decode_video_frames function that wraps multiple backends, instead of calling decode_video_frames_BACKENDNAME separately. This makes it more efficient and allows us to add more decoders later on!

The decoder used is decided based on the dataset.video_backend key, but defaults to torchcodec.

How it was tested

Test and Benchmark the decoders on different datasets/policies.

How to checkout & try? (for the reviewer)

Just run the training script, with a dataset containing videos to decode.
example:

python lerobot/scripts/train.py \
    --output_dir=outputs/train/act_aloha_insertion \
    --policy.type=act \
    --dataset.repo_id=lerobot/aloha_sim_insertion_human \
    --env.type=aloha \
    --env.task=AlohaInsertion-v0 \

Benchmarks

Ran one benchmark on lerobot/aloha_sim_insertion_human_image dataset
Comparison: PyAV vs TorchCodec (CPU)

Metric	PyAV	TorchCodec-CPU
Video to Images Load Time Ratio	1.87	1.25
Avg MSE	5.14e-05	4.88e-05
Avg PSNR	43.17	43.37
Avg SSIM	0.995	0.995

What's left

~~Remove/suppress libdav1d logs (they're noisy) -> there's no env variable to disable those for now but they'll be deactivated in the next version of torchcodec.~~

PR is in a good state ✅

for more information, see https://pre-commit.ci

Cadene

Really nice work Jade! Thanks :)

Let's wait for the next version of torchcodec then!

In the meantime, could you try reproducing results on pusht and aloha transfer cube? and adding the commands that you use and the success rate in the README?

THanks!

lerobot/common/datasets/video_utils.py

lerobot/common/datasets/lerobot_dataset.py

Co-authored-by: Remi <re.cadene@gmail.com>

for more information, see https://pre-commit.ci

jadechoghari · 2025-03-08T08:17:52Z

Torchcodec consistently outperforms pyav across all datasets and video codecs (encoders), it achieves lower MSE (better accuracy), higher PSNR (better quality), and higher SSIM (better perceptual similarity). this trend is evident across libsvtav1, libx264, and libx265, and it makes torchcodec the superior choice for both efficiency and quality. To reproduce the full results, check this link

jadechoghari and others added 3 commits March 2, 2025 20:47

add torchcodec cpu

4e2dc91

[pre-commit.ci] auto fixes from pre-commit.com hooks

e8126dc

for more information, see https://pre-commit.ci

add dependency

2f9cbfb

jadechoghari marked this pull request as draft March 3, 2025 06:49

jadechoghari and others added 2 commits March 3, 2025 07:25

add dependency

a963dba

[pre-commit.ci] auto fixes from pre-commit.com hooks

a8fcd35

for more information, see https://pre-commit.ci

jadechoghari marked this pull request as ready for review March 3, 2025 07:32

Cadene self-requested a review March 4, 2025 08:31

Cadene reviewed Mar 4, 2025

View reviewed changes

jadechoghari and others added 5 commits March 4, 2025 13:27

Update lerobot/common/datasets/video_utils.py

c03b0db

Co-authored-by: Remi <re.cadene@gmail.com>

Update lerobot/common/datasets/video_utils.py

e1732b4

Co-authored-by: Remi <re.cadene@gmail.com>

Update lerobot/common/datasets/lerobot_dataset.py

7449060

Co-authored-by: Remi <re.cadene@gmail.com>

[pre-commit.ci] auto fixes from pre-commit.com hooks

9200792

for more information, see https://pre-commit.ci

fix arg

2295e6c

imstevenpmwork added enhancement Suggestions for new features or improvements performance Issues aimed at improving speed or resource usage labels Mar 4, 2025

update benchmark to new dataset format

0b379e9

jadechoghari added 2 commits March 12, 2025 21:02

update torchcodec version

1a1740d

Merge branch 'main' into torchcodec-cpu

bb7542d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add torchcodec cpu #798

Add torchcodec cpu #798

jadechoghari commented Mar 3, 2025 •

edited

Loading

Cadene left a comment

jadechoghari commented Mar 8, 2025

Add torchcodec cpu #798

Are you sure you want to change the base?

Add torchcodec cpu #798

Conversation

jadechoghari commented Mar 3, 2025 • edited Loading

What this does

How it was tested

How to checkout & try? (for the reviewer)

Benchmarks

What's left

Cadene left a comment

Choose a reason for hiding this comment

jadechoghari commented Mar 8, 2025

jadechoghari commented Mar 3, 2025 •

edited

Loading