Computer Vision, Image Processing, Video Processing
See also:
- OpenCV
- Yolo
Image Labelling #
- roboflow online
- python lib chaiwenda/ImgLabel
- python HumanSignal/label-studio, replacing HumanSignal/labelImg (archived 2024.02)
- MakeSense.ai
Common Formats Explained (esp. available on Roboflow) #
Format | Used with/by |
---|---|
COCO JSON | EfficientDet Pytorch; Detectron 2 |
Pascal VOC XML | Common XML annotation format for local data munging (pioneered by ImageNet). |
Yolo Darknet | Darknet TXT annotations used with YOLO Darknet (both v3 and v4) and YOLOv3 PyTorch. |
TFRecord | TFRecord binary format used for both Tensorflow 1.5 and Tensorflow 2.0 Object Detection models. |
PaliGemma | PaliGemma JSONL format used for fine-tuning PaliGemma, Google’s open multimodal vision model. |
CreateML JSON | CreateML JSON format is used with Apple’s CreateML and Turi Create tools. |