PaddleClas/README_en.md

[简体中文](README_ch.md) | English

# PaddleClas

## Introduction

PaddleClas is an image classification and image recognition toolset for industry and academia, helping users train better computer vision models and apply them in real scenarios.

<div align="center">
<img src="./docs/images/class_simple.gif"  width = "600" />

PULC demo images
</div>
&nbsp;


<div align="center">
<img src="./docs/images/recognition.gif"  width = "400" />

PP-ShiTu demo images
</div>

**Recent updates**
- 2022.6.15 Release **P**ractical **U**ltra **L**ight-weight image **C**lassification solutions. PULC models inference within 3ms on CPU devices, with accuracy comparable with SwinTransformer. We also release 9 practical models covering pedestrian, vehicle and OCR.
- 2022.4.21 Added the related [code](https://github.com/PaddlePaddle/PaddleClas/pull/1820/files) of the CVPR2022 oral paper [MixFormer](https://arxiv.org/pdf/2204.02557.pdf).

- 2021.09.17 Add PP-LCNet series model developed by PaddleClas, these models show strong competitiveness on Intel CPUs.
For the introduction of PP-LCNet, please refer to [paper](https://arxiv.org/pdf/2109.15099.pdf) or [PP-LCNet model introduction](docs/en/models/PP-LCNet_en.md). The metrics and pretrained model are available [here](docs/en/ImageNet_models_en.md).

- 2021.06.29 Add Swin-transformer series model，Highest top1 acc on ImageNet1k dataset reaches 87.2%, training, evaluation and inference are all supported. Pretrained models can be downloaded [here](docs/en/models/models_intro_en.md).
- 2021.06.16 PaddleClas release/2.2. Add metric learning and vector search modules. Add product recognition, animation character recognition, vehicle recognition and logo recognition. Added 30 pretrained models of LeViT, Twins, TNT, DLA, HarDNet, and RedNet, and the accuracy is roughly the same as that of the paper.
- [more](./docs/en/update_history_en.md)

## Features

PaddleClas release PP-HGNet、PP-LCNetv2、 PP-LCNet and **S**imple **S**emi-supervised **L**abel **D**istillation algorithms, and support plenty of 
image classification and image recognition algorithms. 
Based on th algorithms above, PaddleClas release PP-ShiTu image recognition system and **P**ractical **U**ltra **L**ight-weight image **C**lassification solutions.


![](https://user-images.githubusercontent.com/19523330/173347904-f2998e00-7b86-4adf-b546-23c684fc67b9.png)

## Welcome to Join the Technical Exchange Group

* You can also scan the QR code below to join the PaddleClas QQ group and WeChat group (add and replay "C") to get more efficient answers to your questions and to communicate with developers from all walks of life. We look forward to hearing from you.

<div align="center">
<img src="https://user-images.githubusercontent.com/80816848/164383225-e375eb86-716e-41b4-a9e0-4b8a3976c1aa.jpg" width="200"/>
<img src="https://user-images.githubusercontent.com/48054808/160531099-9811bbe6-cfbb-47d5-8bdb-c2b40684d7dd.png" width="200"/>
</div>

## Quick Start
Quick experience of PP-ShiTu image recognition system：[Link](./docs/en/tutorials/quick_start_recognition_en.md)
Quick experience of **P**ractical **U**ltra **L**ight-weight image **C**lassification models：[Link](docs/zh_CN/PULC/PULC_quickstart.md)

## Tutorials

- [Quick Installation](./docs/en/tutorials/install_en.md)
- [Practical Ultra Light-weight image Classification solutions](./docs/en/)
- [Quick Start of Recognition](./docs/en/tutorials/quick_start_recognition_en.md)
- [Introduction to Image Recognition Systems](#Introduction_to_Image_Recognition_Systems)
- [Demo images](#Demo_images)
- Algorithms Introduction
    - [Backbone Network and Pre-trained Model Library](./docs/en/ImageNet_models_en.md)
    - [Mainbody Detection](./docs/en/application/mainbody_detection_en.md)
    - [Image Classification](./docs/en/tutorials/image_classification_en.md)
    - [Feature Learning](./docs/en/application/feature_learning_en.md)
        - [Product Recognition](./docs/en/application/product_recognition_en.md)
        - [Vehicle Recognition](./docs/en/application/vehicle_recognition_en.md)
        - [Logo Recognition](./docs/en/application/logo_recognition_en.md)
        - [Animation Character Recognition](./docs/en/application/cartoon_character_recognition_en.md)
    - [Vector Search](./deploy/vector_search/README.md)
- Models Training/Evaluation
    - [Image Classification](./docs/en/tutorials/getting_started_en.md)
    - [Feature Learning](./docs/en/tutorials/getting_started_retrieval_en.md)
- Inference Model Prediction
    - [Python Inference](./docs/en/inference.md)
    - [C++ Classfication Inference](./deploy/cpp/readme_en.md)， [C++ PP-ShiTu Inference](deploy/cpp_shitu/readme_en.md)
- Model Deploy (only support classification for now, recognition coming soon)
    - [Hub Serving Deployment](./deploy/hubserving/readme_en.md)
    - [Mobile Deployment](./deploy/lite/readme_en.md)
    - [Inference Using whl](./docs/en/whl_en.md)
- Advanced Tutorial
    - [Knowledge Distillation](./docs/en/advanced_tutorials/distillation/distillation_en.md)
    - [Model Quantization](./docs/en/extension/paddle_quantization_en.md)
    - [Data Augmentation](./docs/en/advanced_tutorials/image_augmentation/ImageAugment_en.md)
- [License](#License)
- [Contribution](#Contribution)

<a name="Introduction_to_PULC"></a>
## Introduction to Practical Ultra Light-weight image Classification solutions
<div align="center">
<img src="https://user-images.githubusercontent.com/19523330/173011854-b10fcd7a-b799-4dfd-a1cf-9504952a3c44.png"  width = "800" />
</div>
PULC solutions consists of PP-LCNet light-weight backbone, SSLD pretrained models, Ensemble of Data Augmentation strategy and SKL-UGI knowledge distillation.
PULC models inference within 3ms on CPU devices, with accuracy comparable with SwinTransformer. We also release 9 practical models covering pedestrian, vehicle and OCR.

<a name="Introduction_to_Image_Recognition_Systems"></a>
## Introduction to Image Recognition Systems

<div align="center">
<img src="./docs/images/structure.jpg"  width = "800" />
</div>

Image recognition can be divided into three steps:
- （1）Identify region proposal for target objects through a detection model；
- （2）Extract features for each region proposal;
- （3）Search features in the retrieval database and output results;

For a new unknown category, there is no need to retrain the model, just prepare images of new category, extract features and update retrieval database and the category can be recognised.

## PULC demo images
<div align="center">
<img src="docs/images/classification.gif">
</div>

<a name="Rec_Demo_images"></a>
## Image Recognition Demo images [more](https://github.com/PaddlePaddle/PaddleClas/tree/release/2.2/docs/images/recognition/more_demo_images)
- Product recognition
<div align="center">
<img src="https://user-images.githubusercontent.com/18028216/122769644-51604f80-d2d7-11eb-8290-c53b12a5c1f6.gif"  width = "400" />
</div>

- Cartoon character recognition
<div align="center">
<img src="https://user-images.githubusercontent.com/18028216/122769746-6b019700-d2d7-11eb-86df-f1d710999ba6.gif"  width = "400" />
</div>

- Logo recognition
<div align="center">
<img src="https://user-images.githubusercontent.com/18028216/122769837-7fde2a80-d2d7-11eb-9b69-04140e9d785f.gif"  width = "400" />
</div>

- Car recognition
<div align="center">
<img src="https://user-images.githubusercontent.com/18028216/122769916-8ec4dd00-d2d7-11eb-8c60-42d89e25030c.gif"  width = "400" />
</div>

<a name="License"></a>
## License
PaddleClas is released under the Apache 2.0 license <a href="https://github.com/PaddlePaddle/PaddleCLS/blob/master/LICENSE">Apache 2.0 license</a>


<a name="Contribution"></a>
## Contribution
Contributions are highly welcomed and we would really appreciate your feedback!!


- Thank [nblib](https://github.com/nblib) to fix bug of RandErasing.
- Thank [chenpy228](https://github.com/chenpy228) to fix some typos PaddleClas.
- Thank [jm12138](https://github.com/jm12138) to add ViT, DeiT models and RepVGG models into PaddleClas.
- Thank [FutureSI](https://aistudio.baidu.com/aistudio/personalcenter/thirdview/76563) to parse and summarize the PaddleClas code.
-												Update README_en.md
											
										
										
											2021-06-21 17:16:56 +08:00
+								[简体中文](README_ch.md) | English
-												update README

											
										
										
											2021-06-17 13:00:13 +08:00
 								# PaddleClas
 								## Introduction
-												update readme

											
										
										
											2022-06-14 10:42:27 +08:00
+								PaddleClas is an image classification and image recognition toolset for industry and academia, helping users train better computer vision models and apply them in real scenarios.
-												update README

											
										
										
											2021-06-17 13:00:13 +08:00
-												update readme

											
										
										
											2022-06-14 10:42:27 +08:00
+								<div align="center">
 								<img src="./docs/images/class_simple.gif"  width = "600" />
 								PULC demo images
 								</div>
 								&nbsp;
 								<div align="center">
 								<img src="./docs/images/recognition.gif"  width = "400" />
-												update README

											
										
										
											2021-06-17 13:00:13 +08:00
-												update readme

											
										
										
											2022-06-14 10:42:27 +08:00
+								PP-ShiTu demo images
 								</div>
 								**Recent updates**
 								- 2022.6.15 Release **P**ractical **U**ltra **L**ight-weight image **C**lassification solutions. PULC models inference within 3ms on CPU devices, with accuracy comparable with SwinTransformer. We also release 9 practical models covering pedestrian, vehicle and OCR.
-												fix typo in README

											
										
										
											2022-04-25 13:57:19 +08:00
+								- 2022.4.21 Added the related [code](https://github.com/PaddlePaddle/PaddleClas/pull/1820/files) of the CVPR2022 oral paper [MixFormer](https://arxiv.org/pdf/2204.02557.pdf).
-												Add MixFormer link to README

											
										
										
											2022-04-21 10:30:59 +08:00
-												docs: update wechat qr code

											
										
										
											2022-01-21 17:18:41 +08:00
+								- 2021.09.17 Add PP-LCNet series model developed by PaddleClas, these models show strong competitiveness on Intel CPUs.
-												Update PP-LCNet_en docs

											
										
										
											2021-10-18 16:21:10 +08:00
+								For the introduction of PP-LCNet, please refer to [paper](https://arxiv.org/pdf/2109.15099.pdf) or [PP-LCNet model introduction](docs/en/models/PP-LCNet_en.md). The metrics and pretrained model are available [here](docs/en/ImageNet_models_en.md).
-												Add LCNet docs

											
										
										
											2021-09-08 16:10:34 +08:00
-												add swin (#980)

* add swin transformer
											
										
										
											2021-06-29 12:27:57 +08:00
+								- 2021.06.29 Add Swin-transformer series model，Highest top1 acc on ImageNet1k dataset reaches 87.2%, training, evaluation and inference are all supported. Pretrained models can be downloaded [here](docs/en/models/models_intro_en.md).
-												Update ImageNet_models_en.md

											
										
										
											2021-07-07 19:39:26 +08:00
+								- 2021.06.16 PaddleClas release/2.2. Add metric learning and vector search modules. Add product recognition, animation character recognition, vehicle recognition and logo recognition. Added 30 pretrained models of LeViT, Twins, TNT, DLA, HarDNet, and RedNet, and the accuracy is roughly the same as that of the paper.
-												update README

											
										
										
											2021-06-17 13:00:13 +08:00
+								- [more](./docs/en/update_history_en.md)
 								## Features
-												update readme

											
										
										
											2022-06-14 10:42:27 +08:00
+								PaddleClas release PP-HGNet、PP-LCNetv2、 PP-LCNet and **S**imple **S**emi-supervised **L**abel **D**istillation algorithms, and support plenty of
 								image classification and image recognition algorithms.
 								Based on th algorithms above, PaddleClas release PP-ShiTu image recognition system and **P**ractical **U**ltra **L**ight-weight image **C**lassification solutions.
-												update README

											
										
										
											2021-06-17 13:00:13 +08:00
-												update readme

											
										
										
											2022-06-14 10:42:27 +08:00
+								![](https://user-images.githubusercontent.com/19523330/173347904-f2998e00-7b86-4adf-b546-23c684fc67b9.png)
-												Update README_en.md
											
										
										
											2021-06-21 17:05:30 +08:00
-												Update README_en.md

											
										
										
											2021-06-17 17:26:12 +08:00
+								## Welcome to Join the Technical Exchange Group
-												update README

											
										
										
											2021-06-17 13:00:13 +08:00
-												docs: update wechat qr code

											
										
										
											2022-04-21 14:07:39 +08:00
+								* You can also scan the QR code below to join the PaddleClas QQ group and WeChat group (add and replay "C") to get more efficient answers to your questions and to communicate with developers from all walks of life. We look forward to hearing from you.
-												update README

											
										
										
											2021-06-17 13:00:13 +08:00
 								<div align="center">
-												docs: update wechat qr code

											
										
										
											2022-04-21 14:07:39 +08:00
+								<img src="https://user-images.githubusercontent.com/80816848/164383225-e375eb86-716e-41b4-a9e0-4b8a3976c1aa.jpg" width="200"/>
 								<img src="https://user-images.githubusercontent.com/48054808/160531099-9811bbe6-cfbb-47d5-8bdb-c2b40684d7dd.png" width="200"/>
-												update README

											
										
										
											2021-06-17 13:00:13 +08:00
+								</div>
-												fix readme (#945)


											
										
										
											2021-06-21 17:12:49 +08:00
+								## Quick Start
-												update readme

											
										
										
											2022-06-14 10:42:27 +08:00
+								Quick experience of PP-ShiTu image recognition system：[Link](./docs/en/tutorials/quick_start_recognition_en.md)
 								Quick experience of **P**ractical **U**ltra **L**ight-weight image **C**lassification models：[Link](docs/zh_CN/PULC/PULC_quickstart.md)
-												update README

											
										
										
											2021-06-17 13:00:13 +08:00
 								## Tutorials
-												Update README_en.md
											
										
										
											2021-06-21 16:54:19 +08:00
+								- [Quick Installation](./docs/en/tutorials/install_en.md)
-												update readme

											
										
										
											2022-06-14 10:42:27 +08:00
+								- [Practical Ultra Light-weight image Classification solutions](./docs/en/)
-												update demo images

											
										
										
											2021-06-20 15:28:43 +08:00
+								- [Quick Start of Recognition](./docs/en/tutorials/quick_start_recognition_en.md)
-												update column

											
										
										
											2021-06-20 20:42:26 +08:00
+								- [Introduction to Image Recognition Systems](#Introduction_to_Image_Recognition_Systems)
 								- [Demo images](#Demo_images)
-												update column

											
										
										
											2021-06-20 17:44:01 +08:00
+								- Algorithms Introduction
-												Update ImageNet_models_en.md

											
										
										
											2021-07-07 19:39:26 +08:00
+								    - [Backbone Network and Pre-trained Model Library](./docs/en/ImageNet_models_en.md)
-												Update README_en.md
											
										
										
											2021-06-20 17:27:51 +08:00
+								    - [Mainbody Detection](./docs/en/application/mainbody_detection_en.md)
-												update english column

											
										
										
											2021-06-19 10:01:19 +08:00
+								    - [Image Classification](./docs/en/tutorials/image_classification_en.md)
 								    - [Feature Learning](./docs/en/application/feature_learning_en.md)
 								        - [Product Recognition](./docs/en/application/product_recognition_en.md)
 								        - [Vehicle Recognition](./docs/en/application/vehicle_recognition_en.md)
 								        - [Logo Recognition](./docs/en/application/logo_recognition_en.md)
 								        - [Animation Character Recognition](./docs/en/application/cartoon_character_recognition_en.md)
-												update column

											
										
										
											2021-06-20 17:44:01 +08:00
+								    - [Vector Search](./deploy/vector_search/README.md)
-												Update README_en.md

											
										
										
											2021-06-17 17:26:12 +08:00
+								- Models Training/Evaluation
-												update english column

											
										
										
											2021-06-19 10:01:19 +08:00
+								    - [Image Classification](./docs/en/tutorials/getting_started_en.md)
-												update column

											
										
										
											2021-06-20 20:42:26 +08:00
+								    - [Feature Learning](./docs/en/tutorials/getting_started_retrieval_en.md)
-												update column

											
										
										
											2021-06-20 17:44:01 +08:00
+								- Inference Model Prediction
-												add inference doc to column

											
										
										
											2021-06-20 17:35:07 +08:00
+								    - [Python Inference](./docs/en/inference.md)
-												add pp-shitu c++ link in readme

											
										
										
											2022-01-25 19:35:38 +08:00
+								    - [C++ Classfication Inference](./deploy/cpp/readme_en.md)， [C++ PP-ShiTu Inference](deploy/cpp_shitu/readme_en.md)
-												update english column

											
										
										
											2021-06-19 10:01:19 +08:00
+								- Model Deploy (only support classification for now, recognition coming soon)
 								    - [Hub Serving Deployment](./deploy/hubserving/readme_en.md)
 								    - [Mobile Deployment](./deploy/lite/readme_en.md)
 								    - [Inference Using whl](./docs/en/whl_en.md)
-												Update README_en.md

											
										
										
											2021-06-17 17:26:12 +08:00
+								- Advanced Tutorial
-												update english column

											
										
										
											2021-06-19 10:01:19 +08:00
+								    - [Knowledge Distillation](./docs/en/advanced_tutorials/distillation/distillation_en.md)
 								    - [Model Quantization](./docs/en/extension/paddle_quantization_en.md)
 								    - [Data Augmentation](./docs/en/advanced_tutorials/image_augmentation/ImageAugment_en.md)
-												update README

											
										
										
											2021-06-17 13:00:13 +08:00
+								- [License](#License)
 								- [Contribution](#Contribution)
-												update readme

											
										
										
											2022-06-14 10:42:27 +08:00
+								<a name="Introduction_to_PULC"></a>
 								## Introduction to Practical Ultra Light-weight image Classification solutions
 								<div align="center">
 								<img src="https://user-images.githubusercontent.com/19523330/173011854-b10fcd7a-b799-4dfd-a1cf-9504952a3c44.png"  width = "800" />
 								</div>
 								PULC solutions consists of PP-LCNet light-weight backbone, SSLD pretrained models, Ensemble of Data Augmentation strategy and SKL-UGI knowledge distillation.
 								PULC models inference within 3ms on CPU devices, with accuracy comparable with SwinTransformer. We also release 9 practical models covering pedestrian, vehicle and OCR.
-												update column

											
										
										
											2021-06-20 20:42:26 +08:00
+								<a name="Introduction_to_Image_Recognition_Systems"></a>
-												Update README_en.md

											
										
										
											2021-06-17 17:26:12 +08:00
+								## Introduction to Image Recognition Systems
-												update README

											
										
										
											2021-06-17 13:00:13 +08:00
-												Update README_en.md

											
										
										
											2021-06-17 17:26:12 +08:00
+								<div align="center">
-												add gif demo image

											
										
										
											2021-11-01 01:42:50 +08:00
+								<img src="./docs/images/structure.jpg"  width = "800" />
-												Update README_en.md

											
										
										
											2021-06-17 17:26:12 +08:00
+								</div>
-												update README

											
										
										
											2021-06-17 13:00:13 +08:00
-												Update README_en.md

											
										
										
											2021-06-17 17:26:12 +08:00
+								Image recognition can be divided into three steps:
 								- （1）Identify region proposal for target objects through a detection model；
 								- （2）Extract features for each region proposal;
 								- （3）Search features in the retrieval database and output results;
-												update README

											
										
										
											2021-06-17 13:00:13 +08:00
-												Update README_en.md

											
										
										
											2021-06-17 17:26:12 +08:00
+								For a new unknown category, there is no need to retrain the model, just prepare images of new category, extract features and update retrieval database and the category can be recognised.
-												update README

											
										
										
											2021-06-17 13:00:13 +08:00
-												update readme

											
										
										
											2022-06-14 10:42:27 +08:00
+								## PULC demo images
 								<div align="center">
 								<img src="docs/images/classification.gif">
 								</div>
 								<a name="Rec_Demo_images"></a>
 								## Image Recognition Demo images [more](https://github.com/PaddlePaddle/PaddleClas/tree/release/2.2/docs/images/recognition/more_demo_images)
-												update english readme, add demo images

											
										
										
											2021-06-20 17:53:14 +08:00
+								- Product recognition
 								<div align="center">
-												Update README_en.md
											
										
										
											2021-06-21 21:29:39 +08:00
+								<img src="https://user-images.githubusercontent.com/18028216/122769644-51604f80-d2d7-11eb-8290-c53b12a5c1f6.gif"  width = "400" />
-												update english readme, add demo images

											
										
										
											2021-06-20 17:53:14 +08:00
+								</div>
 								- Cartoon character recognition
 								<div align="center">
-												Update README_en.md
											
										
										
											2021-06-21 21:29:39 +08:00
+								<img src="https://user-images.githubusercontent.com/18028216/122769746-6b019700-d2d7-11eb-86df-f1d710999ba6.gif"  width = "400" />
-												update english readme, add demo images

											
										
										
											2021-06-20 17:53:14 +08:00
+								</div>
 								- Logo recognition
 								<div align="center">
-												Update README_en.md
											
										
										
											2021-06-21 21:29:39 +08:00
+								<img src="https://user-images.githubusercontent.com/18028216/122769837-7fde2a80-d2d7-11eb-9b69-04140e9d785f.gif"  width = "400" />
-												update english readme, add demo images

											
										
										
											2021-06-20 17:53:14 +08:00
+								</div>
-												update README

											
										
										
											2021-06-17 13:00:13 +08:00
-												update english readme, add demo images

											
										
										
											2021-06-20 17:53:14 +08:00
+								- Car recognition
 								<div align="center">
-												Update README_en.md
											
										
										
											2021-06-21 21:29:39 +08:00
+								<img src="https://user-images.githubusercontent.com/18028216/122769916-8ec4dd00-d2d7-11eb-8c60-42d89e25030c.gif"  width = "400" />
-												update english readme, add demo images

											
										
										
											2021-06-20 17:53:14 +08:00
+								</div>
 								<a name="License"></a>
-												Update README_en.md

											
										
										
											2021-06-17 17:26:12 +08:00
+								## License
 								PaddleClas is released under the Apache 2.0 license <a href="https://github.com/PaddlePaddle/PaddleCLS/blob/master/LICENSE">Apache 2.0 license</a>
-												update README

											
										
										
											2021-06-17 13:00:13 +08:00
 								<a name="Contribution"></a>
 								## Contribution
 								Contributions are highly welcomed and we would really appreciate your feedback!!
-												Update README_en.md

											
										
										
											2021-06-17 17:26:12 +08:00
-												update README

											
										
										
											2021-06-17 13:00:13 +08:00
+								- Thank [nblib](https://github.com/nblib) to fix bug of RandErasing.
 								- Thank [chenpy228](https://github.com/chenpy228) to fix some typos PaddleClas.
 								- Thank [jm12138](https://github.com/jm12138) to add ViT, DeiT models and RepVGG models into PaddleClas.
-												Update README_en.md

											
										
										
											2021-06-17 17:26:12 +08:00
+								- Thank [FutureSI](https://aistudio.baidu.com/aistudio/personalcenter/thirdview/76563) to parse and summarize the PaddleClas code.