A high-performance, zero-overhead, extensible Python compiler using LLVM
Updated
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more
Updated
The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.
Updated
Official repository of ’Visual-RFT: Visual Reinforcement Fine-Tuning’
Updated
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
Updated
The official code for “DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction”, ACM MM, Oral Paper, 2021.
Updated
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
Updated
A treasure chest for visual classification and recognition powered by PaddlePaddle
Updated
Ultralytics YOLO11 🚀
Updated
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Updated
The official code for “Deep Unrestricted Document Image Rectification”, TMM, 2023.
Updated
OpenMMLab Computer Vision Foundation
Updated
OpenMMLab Foundational Library for Training Deep Learning Models
Updated
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
Updated
OpenMMLab Image Classification Toolbox and Benchmark
Updated
OpenMMLab Pre-training Toolbox and Benchmark
Updated
[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale
Updated
[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"
Updated
Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Updated
Real-time multi-camera multi-object tracker using YOLOv5 and StrongSORT with OSNet
Updated