Current support format
Currently, the following formats are supported:
| from | to | implemented |
| :--------: | :--------------------: | ----------- |
| PASCAL VOC | YOLO(TXT files) | Yes |
| YOLO | PASCAL VOC (XML files) | Yes |
Upcoming support format
| from | to | Issue/PR(if any) |
| :--------: | :-----------------------: | ---------------- |
| PASCAL VOC | COCO (JSON files) | No |
| PASCAL VOC | TFRecord (TFRecord files) | No |
| COCO | PASCAL VOC (XML files) | No |
| COCO | YOLO (TXT files) | No |
| COCO | TFRecord (TFRecord files) | No |
| YOLO | COCO (JSON files) | No |
| YOLO | TFRecord (TFRecord files) | No |
Installation
Installation from source code
bash
git clone https://github.com/codePerfectPlus/dataset-convertor/
bash
cd dataset-convertor
python -m venv venv
bash
source venv/bin/activate
pip install requirements.txt
Installation from PyPI
bash
pip install dataset-convertor
Usage
convert annotations from one format to another format.
dataset formatting example:
- data/pascal_voc/JPEGImages/*.jpg
- data/pascal_voc/Annotations/*.xml
- data/yolo5/JPEGImages/*.jpg
- data/yolo5/labels/*.txt
Pascal VOC(xml) to yolo(txt)
bash
from convert import Convertor
con = Convertor(input_folder='/home/user/data/pascal_voc', output_folder='/home/user/data/yolo5')
con.voc2yolo()
from yolo(txt) to Pascal VOC(xml)
bash
from convert import Convertor
con = Convertor(input_folder='/home/user/data/yolo5', output_folder='/home/user/data/pascal_voc')
con.yolo2voc()