Computer Vision, Image Processing, Video Processing

Computer Vision, Image Processing, Video Processing

2024-12-04. Category & Tags: CV, Yolo, OpenCV, LLM, Large Language Model, 大模型

See also:

OpenCV
Yolo

Image Labelling #

roboflow online
python lib chaiwenda/ImgLabel
python HumanSignal/label-studio, replacing HumanSignal/labelImg (archived 2024.02)
MakeSense.ai

特性/工具	LabelImg	VoTT	Labelme	CVAT
支持的标注类型	边界框	边界框、多边形、点、区域等	边界框、点、线、多边形、自由形状等	边界框、多边形、点、轨迹、骨架、线等
自动化标注	无	支持（集成模型推理）	无	支持（集成 OpenVINO 等模型）
用户界面	简单、直观	现代化界面，但配置较复杂	基础的用户界面	功能强大但学习曲线陡峭
操作系统支持	Windows, Mac, Linux	Windows, Mac, Linux	Windows, Mac, Linux	Web 应用，跨平台
文件格式支持	Pascal VOC, YOLO, CSV 等	Pascal VOC, COCO, YOLO, TFRecord 等	JSON（可导出至多种格式）	COCO, Pascal VOC, YOLO, TFRecord, MOT 等
资源占用	低	较高（特别是集成模型时）	低	高（适合在服务器环境下运行）
团队协作	不支持	不支持	不支持	支持多人协作，任务分配，审核流程等
自动化功能	无	半自动化标注，依赖集成模型	无	半自动化标注，依赖集成模型
定制化能力	基本不支持	支持一定程度的集成	高度可定制（开源）	高度可定制（开源）
部署难度	低（安装简单）	中等（需配置模型推理）	低（安装简单）	高（需要在服务器上部署）
适用场景	小型项目，轻量级标注	复杂标注任务，自动化标注需求	需要灵活标注类型，支持自定义属性	大规模项目，多用户协作，自动化标注需求

ref: cnblogs (bak)

Common Formats Explained (esp. available on Roboflow) #

Format	Used with/by
COCO JSON	EfficientDet Pytorch; Detectron 2
Pascal VOC XML	Common XML annotation format for local data munging (pioneered by ImageNet).

Yolo Darknet	Darknet TXT annotations used with YOLO Darknet (both v3 and v4) and YOLOv3 PyTorch.
TFRecord	TFRecord binary format used for both Tensorflow 1.5 and Tensorflow 2.0 Object Detection models.
PaliGemma	PaliGemma JSONL format used for fine-tuning PaliGemma, Google’s open multimodal vision model.
CreateML JSON	CreateML JSON format is used with Apple’s CreateML and Turi Create tools.