Computer Vision, Image Processing, Video Processing

Computer Vision, Image Processing, Video Processing

2024-12-04. Category & Tags: CV, Yolo, OpenCV, LLM, Large Language Model, 大模型

See also:

Image Labelling #

Common Formats Explained (esp. available on Roboflow) #

Format Used with/by
COCO JSON EfficientDet Pytorch; Detectron 2
Pascal VOC XML Common XML annotation format for local data munging (pioneered by ImageNet).
Yolo Darknet Darknet TXT annotations used with YOLO Darknet (both v3 and v4) and YOLOv3 PyTorch.
TFRecord TFRecord binary format used for both Tensorflow 1.5 and Tensorflow 2.0 Object Detection models.
PaliGemma PaliGemma JSONL format used for fine-tuning PaliGemma, Google’s open multimodal vision model.
CreateML JSON CreateML JSON format is used with Apple’s CreateML and Turi Create tools.