aloha_hd5 to LerobotDataset v2 #586

Raziel90 · 2024-12-17T12:35:51Z

What this does

Adds Script for the conversion of aloha_hd5 dataset format to LerobotDataset (v2)

The scripts decouples the components dependant on the Original dataset format to those that create the dataset.
The script is easy to generalise to other datasets as long as an equivalent of the AlohaHD5Extractor (check code) is provided

NB: Automated testing will be provided in a separate PR.

How it was tested

Launched the script to upload the dataset in multiple ways (e.g. images as images or as videos, different episode descriptions)

These datasets are uploaded with this
https://huggingface.co/datasets/ccop/aloha-hd5-conversion-test
https://huggingface.co/datasets/ccop/test-again

Examples:

Added examples/port_datasets/aloha_hd5.py

How to checkout & try? (for the reviewer)

You can try on an hd5 dataset using the following. (change the arguments to fit your dataset).

python examples/port_datasets/aloha_hd5.py --fps=50 --raw-path=/home/ccop/code/aloha_data/aloha_stationary_pick_place_bottle/  --dataset-repo-id=ccop/test-again --video-encoding=false --push=True --description="Aloha HD5 dataset - Bottle Pick and Place"

Michael-Equi · 2024-12-19T22:11:02Z

Hi Raziel. Thanks for writing this! I think there is a bug in the implementation you are submitting that results in saving only one frame per episode. The fix should just be to tab in this line https://github.com/huggingface/lerobot/pull/586/files#diff-8a6e4cf809bbdbbde9c6d8d6887e9c3d1537f3dda6179c2bf819a04984bb9fd4R131

Raziel90 · 2025-01-09T14:12:20Z

Hi @Michael-Equi do you mean line 135?

frames.append(frame)

I'm fixing that

…scope

Raziel90 · 2025-01-27T18:45:56Z

Hi @Michael-Equi, I fixed it if you want to do a double check :)

pprett · 2025-02-05T12:58:43Z

examples/port_datasets/aloha_hd5.py

+                        "dtype": "video" if encode_as_video else "image",
+                        "shape": cv2.imdecode(hdf5_file[topic][0], 1).transpose(2, 0, 1).shape
+                        if image_compressed
+                        else sample.shape,


does aloha use shape (c, h, w) when uncompressed? I remember it to be (h, w, c)

I will give a check, but if I have transposed there, there is a reason that I don't recall right now.
The result seems good here

Takuzenn · 2025-02-22T17:56:01Z

Hi @Raziel90 I tried but it is very slow, do you know why?

Raziel90 · 2025-02-24T12:33:03Z

Hi @Takuzenn can you elaborate what you tried to run and why you think is slow?
In the script I am not doing any data heavy changes.

Have you tried to change set the number of workers? are you converting images to video or are you keeping images?

for more information, see https://pre-commit.ci

Raziel90 and others added 2 commits December 17, 2024 12:19

aloha_hd5 to LerobotDataset v2

899c76a

Merge branch 'main' into aloha_hd5_to_dataset_v2

b65fd44

Raziel90 and others added 4 commits December 20, 2024 09:09

observations -> observation, qpos -> state

72e0d24

Merge branch 'main' into aloha_hd5_to_dataset_v2

fe5ce5e

Merge branch 'main' into aloha_hd5_to_dataset_v2

3d4d02e

Merge branch 'main' into aloha_hd5_to_dataset_v2

0d9a0cd

Raziel90 and others added 3 commits January 27, 2025 17:00

Merge branch 'main' into aloha_hd5_to_dataset_v2

283545f

fix: aloha_hd5 to LerobotDataset v2 frame appending out of the right …

d8e4a2c

…scope

fix: aloha_hd5 to LerobotDataset v2 frame appending out of the right …

9a6ca75

…scope

Raziel90 mentioned this pull request Jan 28, 2025

Add port aloha hdf5 datasets #659

Draft

Raziel90 added 2 commits January 28, 2025 13:23

Merge branch 'main' into aloha_hd5_to_dataset_v2

82fefed

Merge branch 'main' into aloha_hd5_to_dataset_v2

be80ac1

pprett reviewed Feb 5, 2025

View reviewed changes

Merge branch 'main' into aloha_hd5_to_dataset_v2

f7b84fa

Merge branch 'main' into aloha_hd5_to_dataset_v2

dca5c22

Raziel90 and others added 4 commits March 4, 2025 13:53

Merge branch 'main' into aloha_hd5_to_dataset_v2

56b546b

[pre-commit.ci] auto fixes from pre-commit.com hooks

eadfd67

for more information, see https://pre-commit.ci

Merge branch 'main' into aloha_hd5_to_dataset_v2

7e41c0f

Merge branch 'main' into aloha_hd5_to_dataset_v2

e5d3ed4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

aloha_hd5 to LerobotDataset v2 #586

aloha_hd5 to LerobotDataset v2 #586

Raziel90 commented Dec 17, 2024

Michael-Equi commented Dec 19, 2024

Raziel90 commented Jan 9, 2025

Raziel90 commented Jan 27, 2025

pprett Feb 5, 2025

Raziel90 Feb 12, 2025

Takuzenn commented Feb 22, 2025 •

edited

Loading

Raziel90 commented Feb 24, 2025

aloha_hd5 to LerobotDataset v2 #586

Are you sure you want to change the base?

aloha_hd5 to LerobotDataset v2 #586

Conversation

Raziel90 commented Dec 17, 2024

What this does

How it was tested

How to checkout & try? (for the reviewer)

Michael-Equi commented Dec 19, 2024

Raziel90 commented Jan 9, 2025

Raziel90 commented Jan 27, 2025

pprett Feb 5, 2025

Choose a reason for hiding this comment

Raziel90 Feb 12, 2025

Choose a reason for hiding this comment

Takuzenn commented Feb 22, 2025 • edited Loading

Raziel90 commented Feb 24, 2025

Takuzenn commented Feb 22, 2025 •

edited

Loading