Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Exporting: Occurrences to Darwin Core Archive format #298

Open
mihow opened this issue Oct 31, 2023 · 1 comment
Open

Exporting: Occurrences to Darwin Core Archive format #298

mihow opened this issue Oct 31, 2023 · 1 comment
Assignees

Comments

@mihow
Copy link
Collaborator

mihow commented Oct 31, 2023

Start with management command for creating an export from the backend

Could use a serializer, pandas DataFrame or manual way of serializing to a flat CSV

Start with Darwin core format
Others:

  • COCO (for detections)

See existing PR as a start #634

@mihow mihow self-assigned this Jun 10, 2024
@mihow mihow changed the title Support for exporting Support for exporting: Occurrences in Darwin Core Feb 7, 2025
@mihow mihow changed the title Support for exporting: Occurrences in Darwin Core Support for exporting: Occurrences in Darwin Core Archive Feb 7, 2025
@mihow mihow changed the title Support for exporting: Occurrences in Darwin Core Archive Exporting: Occurrences to Darwin Core Archive format Feb 7, 2025
@mohamedelabbas1996 mohamedelabbas1996 linked a pull request Feb 18, 2025 that will close this issue
@mohamedelabbas1996 mohamedelabbas1996 self-assigned this Feb 18, 2025
@mihow
Copy link
Collaborator Author

mihow commented Mar 12, 2025

Here are all the formats available from Label Studio. You can see these are aimed at different use cases for research or compatibility with other common workflows.

Format Description Tags
JSON List of items in raw JSON format stored in one JSON file. Use to export both the data and the annotations for a dataset. It's Label Studio Common Format  
JSON_MIN List of items where only "from_name", "to_name" values from the raw JSON format are exported. Use to export only the annotations for a dataset.  
CSV Results are stored as comma-separated values with the column names specified by the values of the "from_name" and "to_name" fields.  
TSV Results are stored in tab-separated tabular file with column names specified by "from_name" "to_name" values  
COCO Popular machine learning format used by the COCO dataset for object detection and image segmentation tasks with polygons and rectangles. image segmentation, object detection
COCO_WITH_IMAGES COCO format with images downloaded. image segmentation, object detection
VOC Popular XML format used for object detection and polygon image segmentation tasks. image segmentation, object detection
YOLO Popular TXT format is created for each image file. Each txt file contains annotations for the corresponding image file, that is object class, object coordinates, height & width. image segmentation, object detection
YOLO_WITH_IMAGES YOLO format with images downloaded. image segmentation, object detection
YOLO_OBB Popular TXT format is created for each image file. Each txt file contains annotations for the corresponding image file. The YOLO OBB format designates bounding boxes by their four corner points with coordinates normalized between 0 and 1, so it is possible to export rotated objects. image segmentation, object detection
YOLO_OBB_WITH_IMAGES YOLOv8 OBB format with images downloaded. image segmentation, object detection
CONLL2003 Popular format used for the CoNLL-2003 named entity recognition challenge. sequence labeling, text tagging, named entity recognition
BRUSH_TO_NUMPY Export your brush labels as NumPy 2d arrays. Each label outputs as one image. image segmentation
BRUSH_TO_PNG Export your brush labels as PNG images. Each label outputs as one image. image segmentation
ASR_MANIFEST Export audio transcription labels for automatic speech recognition as the JSON manifest format expected by NVIDIA NeMo models. speech recognition

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants