Exporting the Dataset

You can export datasets in Data Labeling in various text and image formats, and snapshot JSONL files.

You can export datasets in Data Labeling to any Object Storage location in the tenancy. Thus, you can maintain versions, or use the dataset elsewhere, for example, as an input to machine learning model development. The output file location is included in the export panel. After export, the destination is available in the associated work request. The destination is also displayed in the Dataset Details page, but only while the work request exists.

For documents, you can export to JSONL files.

For images, you have the choice of exporting to the following file formats:
  • JSONL
  • YOLO V5
  • COCO
  • PASCAL VOC
For text, you have the choice of exporting to the following file formats:
  • JSONL
  • JSONL Compact Plus Content
  • spaCy
  • CoNLL V2003
    Note

    If you export text in the CoNLL format, recursive and overlapping entities are ignored.
Note

For CSV, the only option is to export to JSONL.

Examples of Exported Document, Image, and Text Datasets

Examples of the JSON files created when a dataset is exported in Data Labeling.