[![PyPI](https://img.shields.io/pypi/v/pai-easycv)](https://pypi.org/project/pai-easycv/) [![Documentation Status](https://readthedocs.org/projects/easy-cv/badge/?version=latest)](https://easy-cv.readthedocs.io/en/latest/) [![license](https://img.shields.io/github/license/alibaba/EasyCV.svg)](https://github.com/open-mmlab/mmdetection/blob/master/LICENSE) [![open issues](https://isitmaintained.com/badge/open/alibaba/EasyCV.svg)](https://github.com/alibaba/EasyCV/issues) [![GitHub pull-requests](https://img.shields.io/github/issues-pr/alibaba/EasyCV.svg)](https://GitHub.com/alibaba/EasyCV/pull/) [![GitHub latest commit](https://badgen.net/github/last-commit/alibaba/EasyCV)](https://GitHub.com/alibaba/EasyCV/commit/)

# EasyCV [English](README.md) | 简体中文 ## 简介 EasyCV是一个涵盖多个领域的基于Pytorch的计算机视觉工具箱，聚焦自监督学习和视觉transformer关键技术，覆盖主流的视觉建模任务例如图像分类，度量学习，目标检测，关键点检测等。 ### 核心特性 - **SOTA 自监督算法** EasyCV提供了state-of-the-art的自监督算法，有基于对比学习的算法例如 SimCLR，MoCO V2，Swav， Moby，DINO，也有基于掩码图像建模的MAE算法，除此之外我们还提供了标准的benchmark工具用来进行自监督算法模型的效果评估。 - **视觉Transformers** EasyCV聚焦视觉transformer技术，希望通过一种简洁的方式让用户方便地使用各种SOTA的、基于自监督预训练和imagenet预训练的视觉transformer模型，例如ViT，Swin-Transformer，Shuffle Transformer，未来也会加入更多相关模型。此外，我们还支持所有[timm](https://github.com/rwightman/pytorch-image-models)仓库中的预训练模型. - **易用性和可扩展性** 除了自监督学习，EasyCV还支持图像分类、目标检测，度量学习，关键点检测等领域，同时未来也会支持更多任务领域。尽管横跨多个任务领域，EasyCV保持了统一的架构，整体框架划分为数据集、模型、回调模块，非常容易增加新的算法、功能，以及基于现有模块进行扩展。推理方面，EasyCV提供了端到端的简单易用的推理接口，支持上述多个领域。此外所有的模型都支持使用[PAI-EAS](https://help.aliyun.com/document_detail/113696.html)进行在线部署，支持自动伸缩和服务监控。 - **高性能** EasyCV支持多机多卡训练，同时支持[TorchAccelerator](docs/source/tutorials/torchacc.md)和fp16进行训练加速。在数据读取和预处理方面，EasyCV使用[DALI](https://github.com/NVIDIA/DALI)进行加速。对于模型推理优化，EasyCV支持使用jit script导出模型，使用[PAI-Blade](https://help.aliyun.com/document_detail/205134.html)进行模型优化。 ## 最新进展 [🔥 2023.05.09] * 09/05/2023 EasyCV v0.11.0 was released. - 支持EasyCV作为[modelscope](https://github.com/modelscope/modelscope)插件接入模型. [🔥 2023.03.06] * 06/03/2023 EasyCV v0.10.0 was released. - 增加分割模型STDC - 增加基于关键点的视频分类模型STGCN - 支持多镜头多目标跟踪 [🔥 2023.01.17] * 17/01/2023 EasyCV v0.9.0 was released. - 支持单镜头多目标跟踪。 - 增加视频识别算法 (X3D, SWIN-video) [🔥 2022.12.02] * 02/12/2022 EasyCV v0.8.0 版本发布. - bevformer-base NDS在nuscenes val上提升0.8，训练速度提升10%，推理速度提升40%。 - 支持Objects365预训练，加入DINO++模型在200M模型规模下可达到63.4mAP的精度（同等规模下精度最佳）。 [🔥 2022.08.31] 近期我们开源了YOLOX-PAI，在40-50mAP(推理速度小于1ms)范围内达到了业界的SOTA水平。同时EasyCV提供了一套简洁高效的模型导出和预测接口，供用户快速完成端到端的图像检测任务。如果你想快速了解YOLOX-PAI, 点击 [这里](docs/source/tutorials/yolox.md)! * 31/08/2022 EasyCV v0.6.0 版本发布。 - 发布YOLOX-PAI，在轻量级模型中取得SOTA效果 - 增加检测算法DINO， COCO mAP 58.5 - 增加Mask2Former算法 - Datahub新增imagenet1k, imagenet22k, coco, lvis, voc2012 数据的百度网盘链接，加速下载更多版本的详细信息请参考[变更记录](docs/source/change_log.md)。 ## 技术文章我们有一系列关于EasyCV功能的技术文章。 * [EasyCV开源｜开箱即用的视觉自监督+Transformer算法库](https://zhuanlan.zhihu.com/p/505219993) * [MAE自监督算法介绍和基于EasyCV的复现](https://zhuanlan.zhihu.com/p/515859470) * [基于EasyCV复现ViTDet：单层特征超越FPN](https://zhuanlan.zhihu.com/p/528733299) * [基于EasyCV复现DETR和DAB-DETR，Object Query的正确打开方式](https://zhuanlan.zhihu.com/p/543129581) * [YOLOX-PAI: 加速YOLOX, 比YOLOv6更快更强](https://zhuanlan.zhihu.com/p/560597953) * [EasyCV带你复现更好更快的自监督算法-FastConvMAE](https://zhuanlan.zhihu.com/p/566988235) * [EasyCV DataHub 提供多领域视觉数据集下载，助力模型生产](https://zhuanlan.zhihu.com/p/572593950) * [使用EasyCV Mask2Former轻松实现图像分割](https://zhuanlan.zhihu.com/p/583831421) ## 安装请参考[快速开始教程](docs/source/quick_start.md)中的安装章节。 ## 快速开始请参考[快速开始教程](docs/source/quick_start.md) 快速开始。我们也提供了更多的教程方便你的学习和使用。 * [自监督学习教程](docs/source/tutorials/ssl.md) * [图像分类教程](docs/source/tutorials/cls.md) * [度量学习教程](docs/source/tutorials/metric_learning.md) * [使用YOLOX-PAI进行物体检测教程](docs/source/tutorials/yolox.md) * [YOLOX模型压缩教程](docs/source/tutorials/compression.md) * [torchacc使用](docs/source/tutorials/torchacc.md) * [本地/oss文件读取](docs/source/tutorials/file.md) * [mmdetection模型使用](docs/source/tutorials/mmdet_models_usage_guide.md) * [批量推理工具](docs/source/tutorials/predict.md) ## 模型库

模型

自监督学习

图像分类

目标检测

分割

3D目标检测

实例分割

语义分割

全景分割

Mask2Former (CVPR'2022)

BEVFormer (ECCV'2022)

不同领域的模型仓库和benchmark指标如下 - [自监督模型库](docs/source/model_zoo_ssl.md) - [图像分类模型库](docs/source/model_zoo_cls.md) - [目标检测模型库](docs/source/model_zoo_det.md) - [3D目标检测模型库](docs/source/model_zoo_det3d.md) - [图像分割模型库](docs/source/model_zoo_seg.md) - [关键点模型库](docs/source/model_zoo_pose.md) ## 开源许可证本项目使用 [Apache 2.0 开源许可证](LICENSE). 项目内含有一些第三方依赖库源码，部分实现借鉴其他开源仓库，仓库名称和开源许可证说明请参考[NOTICE文件](NOTICE)。 ## Contact 本项目由阿里云机器学习平台PAI-CV团队维护，你可以通过如下方式联系我们：钉钉群号: 41783266 邮箱: easycv@list.alibaba-inc.com ### 企业级服务如果你需要针对EasyCV提供企业级服务，或者购买云产品服务，你可以通过加入钉钉群联系我们。 ![dingding_qrcode](https://user-images.githubusercontent.com/4771825/165244727-b5d69628-97a6-4e2a-a23f-0c38a8d29341.jpg)