mmocr/README.md

<div align="center">
  <img src="resources/mmocr-logo.png" width="500px"/>
  <div>&nbsp;</div>
  <div align="center">
    <b><font size="5">OpenMMLab website</font></b>
    <sup>
      <a href="https://openmmlab.com">
        <i><font size="4">HOT</font></i>
      </a>
    </sup>
    &nbsp;&nbsp;&nbsp;&nbsp;
    <b><font size="5">OpenMMLab platform</font></b>
    <sup>
      <a href="https://platform.openmmlab.com">
        <i><font size="4">TRY IT OUT</font></i>
      </a>
    </sup>
  </div>
  <div>&nbsp;</div>

[![build](https://github.com/open-mmlab/mmocr/workflows/build/badge.svg)](https://github.com/open-mmlab/mmocr/actions)
[![docs](https://readthedocs.org/projects/mmocr/badge/?version=dev-1.x)](https://mmocr.readthedocs.io/en/dev-1.x/?badge=dev-1.x)
[![codecov](https://codecov.io/gh/open-mmlab/mmocr/branch/main/graph/badge.svg)](https://codecov.io/gh/open-mmlab/mmocr)
[![license](https://img.shields.io/github/license/open-mmlab/mmocr.svg)](https://github.com/open-mmlab/mmocr/blob/main/LICENSE)
[![PyPI](https://badge.fury.io/py/mmocr.svg)](https://pypi.org/project/mmocr/)
[![Average time to resolve an issue](https://isitmaintained.com/badge/resolution/open-mmlab/mmocr.svg)](https://github.com/open-mmlab/mmocr/issues)
[![Percentage of issues still open](https://isitmaintained.com/badge/open/open-mmlab/mmocr.svg)](https://github.com/open-mmlab/mmocr/issues)
<a href="https://console.tiyaro.ai/explore?q=mmocr&pub=mmocr"> <img src="https://tiyaro-public-docs.s3.us-west-2.amazonaws.com/assets/try_on_tiyaro_badge.svg"></a>

[📘Documentation](https://mmocr.readthedocs.io/en/dev-1.x/) |
[🛠️Installation](https://mmocr.readthedocs.io/en/dev-1.x/get_started/install.html) |
[👀Model Zoo](https://mmocr.readthedocs.io/en/dev-1.x/modelzoo.html) |
[🆕Update News](https://mmocr.readthedocs.io/en/dev-1.x/notes/changelog.html) |
[🤔Reporting Issues](https://github.com/open-mmlab/mmocr/issues/new/choose)

</div>

<div align="center">

English | [简体中文](README_zh-CN.md)

</div>

## Introduction

MMOCR is an open-source toolbox based on PyTorch and mmdetection for text detection, text recognition, and the corresponding downstream tasks including key information extraction. It is part of the [OpenMMLab](https://openmmlab.com/) project.

The main branch works with **PyTorch 1.6+**.

<div align="center">
  <img src="https://user-images.githubusercontent.com/24622904/187838618-1fdc61c0-2d46-49f9-8502-976ffdf01f28.png"/>
</div>

### Major Features

- **Comprehensive Pipeline**

  The toolbox supports not only text detection and text recognition, but also their downstream tasks such as key information extraction.

- **Multiple Models**

  The toolbox supports a wide variety of state-of-the-art models for text detection, text recognition and key information extraction.

- **Modular Design**

  The modular design of MMOCR enables users to define their own optimizers, data preprocessors, and model components such as backbones, necks and heads as well as losses. Please refer to [Overview](https://mmocr.readthedocs.io/en/dev-1.x/get_started/overview.html) for how to construct a customized model.

- **Numerous Utilities**

  The toolbox provides a comprehensive set of utilities which can help users assess the performance of models. It includes visualizers which allow visualization of images, ground truths as well as predicted bounding boxes, and a validation tool for evaluating checkpoints during training.  It also includes data converters to demonstrate how to convert your own data to the annotation files which the toolbox supports.

## Latest Updates

v1.0.0rc5 was released in 2023-01-06.

1. Two models, Aster and SVTR, are added to our model zoo. The full implementation of ABCNet is also available now.
2. Dataset Preparer supports 5 more datasets: CocoTextV2, FUNSD, TextOCR, NAF, SROIE.
3. We have 4 more text recognition transforms, and two more helper transforms.
4. The transform, `FixInvalidPolygon`, is getting smarter at dealing with invalid polygons, and now capable of handling more weird annotations. As a result, a complete training cycle on TotalText dataset can be performed bug-free. The weights of DBNet and FCENet pretrained on TotalText are also released.

Read [Changelog](https://mmocr.readthedocs.io/en/dev-1.x/notes/changelog.html) for more details!

## What's New in MMOCR 1.0

1. **New engines**. MMOCR 1.x is based on [MMEngine](https://github.com/open-mmlab/mmengine), which provides a general and powerful runner that allows more flexible customizations and significantly simplifies the entrypoints of high-level interfaces.

2. **Unified interfaces**. As a part of the OpenMMLab 2.0 projects, MMOCR 1.x unifies and refactors the interfaces and internal logics of train, testing, datasets, models, evaluation, and visualization. All the OpenMMLab 2.0 projects share the same design in those interfaces and logics to allow the emergence of multi-task/modality algorithms.

3. **Cross project calling**. Benefiting from the unified design, you can use the models implemented in other OpenMMLab projects, such as MMDet. We provide an example of how to use MMDetection's Mask R-CNN through `MMDetWrapper`. Check our documents for more details. More wrappers will be released in the future.

4. **Stronger visualization**. We provide a series of useful tools which are mostly based on brand-new visualizers. As a result, it is more convenient for the users to explore the models and datasets now.

5. **More documentation and tutorials**. We add a bunch of documentation and tutorials to help users get started more smoothly. Read it [here](https://mmocr.readthedocs.io/en/dev-1.x/).

6. **One-stop Dataset Preparaion**. Multiple datasets are instantly ready with only one line of command, via our [Dataset Preparer](https://mmocr.readthedocs.io/en/dev-1.x/user_guides/data_prepare/dataset_preparer.html).

7. **Embracing more `projects/`**: We now introduce `projects/` folder, where some experimental features, frameworks and models can be placed, only needed to satisfy the minimum requirement on the code quality. Everyone is welcome to post their implementation of any great ideas in this folder! Learn more from our [example project](https://github.com/open-mmlab/mmocr/blob/dev-1.x/projects/example_project/).

8. **More models**. MMOCR 1.0 supports more tasks and more state-of-the-art models!

## Installation

MMOCR depends on [PyTorch](https://pytorch.org/), [MMEngine](https://github.com/open-mmlab/mmengine), [MMCV](https://github.com/open-mmlab/mmcv) and [MMDetection](https://github.com/open-mmlab/mmdetection).
Below are quick steps for installation.
Please refer to [Install Guide](https://mmocr.readthedocs.io/en/dev-1.x/get_started/install.html) for more detailed instruction.

```shell
conda create -n open-mmlab python=3.8 pytorch=1.10 cudatoolkit=11.3 torchvision -c pytorch -y
conda activate open-mmlab
pip3 install openmim
mim install mmengine
mim install 'mmcv>=2.0.0rc1'
mim install 'mmdet>=3.0.0rc0'
git clone https://github.com/open-mmlab/mmocr.git
cd mmocr
git checkout 1.x
pip3 install -e .
```

## Get Started

Please see [Quick Run](https://mmocr.readthedocs.io/en/dev-1.x/get_started/quick_run.html) for the basic usage of MMOCR.

## [Model Zoo](https://mmocr.readthedocs.io/en/dev-1.x/modelzoo.html)

Supported algorithms:

<details open>
<summary>BackBone</summary>

- [x] [oCLIP](configs/backbone/oclip/README.md) (ECCV'2022)

</details>

<details open>
<summary>Text Detection</summary>

- [x] [DBNet](configs/textdet/dbnet/README.md) (AAAI'2020) / [DBNet++](configs/textdet/dbnetpp/README.md) (TPAMI'2022)
- [x] [Mask R-CNN](configs/textdet/maskrcnn/README.md) (ICCV'2017)
- [x] [PANet](configs/textdet/panet/README.md) (ICCV'2019)
- [x] [PSENet](configs/textdet/psenet/README.md) (CVPR'2019)
- [x] [TextSnake](configs/textdet/textsnake/README.md) (ECCV'2018)
- [x] [DRRG](configs/textdet/drrg/README.md) (CVPR'2020)
- [x] [FCENet](configs/textdet/fcenet/README.md) (CVPR'2021)

</details>

<details open>
<summary>Text Recognition</summary>

- [x] [ABINet](configs/textrecog/abinet/README.md) (CVPR'2021)
- [x] [ASTER](configs/textrecog/aster/README.md) (TPAMI'2018)
- [x] [CRNN](configs/textrecog/crnn/README.md) (TPAMI'2016)
- [x] [MASTER](configs/textrecog/master/README.md) (PR'2021)
- [x] [NRTR](configs/textrecog/nrtr/README.md) (ICDAR'2019)
- [x] [RobustScanner](configs/textrecog/robust_scanner/README.md) (ECCV'2020)
- [x] [SAR](configs/textrecog/sar/README.md) (AAAI'2019)
- [x] [SATRN](configs/textrecog/satrn/README.md) (CVPR'2020 Workshop on Text and Documents in the Deep Learning Era)
- [x] [SVTR](configs/textrecog/svtr/README.md) (IJCAI'2022)

</details>

<details open>
<summary>Key Information Extraction</summary>

- [x] [SDMG-R](configs/kie/sdmgr/README.md) (ArXiv'2021)

</details>

<details open>
<summary>Text Spotting</summary>

- [x] [ABCNet](projects/ABCNet/README.md) (CVPR'2020)

</details>

Please refer to [model_zoo](https://mmocr.readthedocs.io/en/dev-1.x/modelzoo.html) for more details.

## Contributing

We appreciate all contributions to improve MMOCR. Please refer to [CONTRIBUTING.md](.github/CONTRIBUTING.md) for the contributing guidelines.

## Acknowledgement

MMOCR is an open-source project that is contributed by researchers and engineers from various colleges and companies. We appreciate all the contributors who implement their methods or add new features, as well as users who give valuable feedbacks.
We hope the toolbox and benchmark could serve the growing research community by providing a flexible toolkit to reimplement existing methods and develop their own new OCR methods.

## Citation

If you find this project useful in your research, please consider cite:

```bibtex
@article{mmocr2021,
    title={MMOCR:  A Comprehensive Toolbox for Text Detection, Recognition and Understanding},
    author={Kuang, Zhanghui and Sun, Hongbin and Li, Zhizhong and Yue, Xiaoyu and Lin, Tsui Hin and Chen, Jianyong and Wei, Huaqiang and Zhu, Yiqin and Gao, Tong and Zhang, Wenwei and Chen, Kai and Zhang, Wayne and Lin, Dahua},
    journal= {arXiv preprint arXiv:2108.06543},
    year={2021}
}
```

## License

This project is released under the [Apache 2.0 license](LICENSE).

## Projects in OpenMMLab

- [MMEngine](https://github.com/open-mmlab/mmengine): OpenMMLab foundational library for training deep learning models
- [MMCV](https://github.com/open-mmlab/mmcv): OpenMMLab foundational library for computer vision.
- [MIM](https://github.com/open-mmlab/mim): MIM installs OpenMMLab packages.
- [MMClassification](https://github.com/open-mmlab/mmclassification): OpenMMLab image classification toolbox and benchmark.
- [MMDetection](https://github.com/open-mmlab/mmdetection): OpenMMLab detection toolbox and benchmark.
- [MMDetection3D](https://github.com/open-mmlab/mmdetection3d): OpenMMLab's next-generation platform for general 3D object detection.
- [MMRotate](https://github.com/open-mmlab/mmrotate): OpenMMLab rotated object detection toolbox and benchmark.
- [MMSegmentation](https://github.com/open-mmlab/mmsegmentation): OpenMMLab semantic segmentation toolbox and benchmark.
- [MMOCR](https://github.com/open-mmlab/mmocr): OpenMMLab text detection, recognition, and understanding toolbox.
- [MMPose](https://github.com/open-mmlab/mmpose): OpenMMLab pose estimation toolbox and benchmark.
- [MMHuman3D](https://github.com/open-mmlab/mmhuman3d): OpenMMLab 3D human parametric model toolbox and benchmark.
- [MMSelfSup](https://github.com/open-mmlab/mmselfsup): OpenMMLab self-supervised learning toolbox and benchmark.
- [MMRazor](https://github.com/open-mmlab/mmrazor): OpenMMLab model compression toolbox and benchmark.
- [MMFewShot](https://github.com/open-mmlab/mmfewshot): OpenMMLab fewshot learning toolbox and benchmark.
- [MMAction2](https://github.com/open-mmlab/mmaction2): OpenMMLab's next-generation action understanding toolbox and benchmark.
- [MMTracking](https://github.com/open-mmlab/mmtracking): OpenMMLab video perception toolbox and benchmark.
- [MMFlow](https://github.com/open-mmlab/mmflow): OpenMMLab optical flow toolbox and benchmark.
- [MMEditing](https://github.com/open-mmlab/mmediting): OpenMMLab image and video editing toolbox.
- [MMGeneration](https://github.com/open-mmlab/mmgeneration): OpenMMLab image and video generative models toolbox.
- [MMDeploy](https://github.com/open-mmlab/mmdeploy): OpenMMLab model deployment framework.

## Welcome to the OpenMMLab community

Scan the QR code below to follow the OpenMMLab team's [**Zhihu Official Account**](https://www.zhihu.com/people/openmmlab) and join the OpenMMLab team's [**QQ Group**](https://jq.qq.com/?_wv=1027&k=aCvMxdr3), or join the official communication WeChat group by adding the WeChat, or join our [**Slack**](https://join.slack.com/t/mmocrworkspace/shared_invite/zt-1ifqhfla8-yKnLO_aKhVA2h71OrK8GZw)

<div align="center">
<img src="https://raw.githubusercontent.com/open-mmlab/mmcv/master/docs/en/_static/zhihu_qrcode.jpg" height="400" />  <img src="https://raw.githubusercontent.com/open-mmlab/mmcv/master/docs/en/_static/qq_group_qrcode.jpg" height="400" />  <img src="https://raw.githubusercontent.com/open-mmlab/mmcv/master/docs/en/_static/wechat_qrcode.jpg" height="400" />
</div>

We will provide you with the OpenMMLab community

- 📢 share the latest core technologies of AI frameworks
- 💻 Explaining PyTorch common module source Code
- 📰 News related to the release of OpenMMLab
- 🚀 Introduction of cutting-edge algorithms developed by OpenMMLab
  🏃 Get the more efficient answer and feedback
- 🔥 Provide a platform for communication with developers from all walks of life

The OpenMMLab community looks forward to your participation! 👬
-												initial commit

											
										
										
											2021-04-02 15:55:42 +08:00
+								<div align="center">
-												fix #29: update logo (#30)


											
										
										
											2021-04-05 20:30:33 +08:00
+								  <img src="resources/mmocr-logo.png" width="500px"/>
-												Add website links to readme (#731)


											
										
										
											2022-01-13 20:11:13 +08:00
+								  <div>&nbsp;</div>
 								  <div align="center">
 								    <b><font size="5">OpenMMLab website</font></b>
 								    <sup>
 								      <a href="https://openmmlab.com">
 								        <i><font size="4">HOT</font></i>
 								      </a>
 								    </sup>
 								    &nbsp;&nbsp;&nbsp;&nbsp;
 								    <b><font size="5">OpenMMLab platform</font></b>
 								    <sup>
 								      <a href="https://platform.openmmlab.com">
 								        <i><font size="4">TRY IT OUT</font></i>
 								      </a>
 								    </sup>
 								  </div>
 								  <div>&nbsp;</div>
-												added link to readme.md (#80)


											
										
										
											2021-04-16 17:45:20 +08:00
-												Fix typos (#26)

* Fix typos

Signed-off-by: lizz <lizz@sensetime.com>

* Ohh

Signed-off-by: lizz <lizz@sensetime.com>
											
										
										
											2021-04-05 20:16:13 +08:00
+								[![build](https://github.com/open-mmlab/mmocr/workflows/build/badge.svg)](https://github.com/open-mmlab/mmocr/actions)
-												[Docs] readme update (#1359)

* updata install

* update

* update link

* update pic link
											
										
										
											2022-09-01 14:08:10 +08:00
+								[![docs](https://readthedocs.org/projects/mmocr/badge/?version=dev-1.x)](https://mmocr.readthedocs.io/en/dev-1.x/?badge=dev-1.x)
-												Update README.md
											
										
										
											2021-04-10 20:44:20 +08:00
+								[![codecov](https://codecov.io/gh/open-mmlab/mmocr/branch/main/graph/badge.svg)](https://codecov.io/gh/open-mmlab/mmocr)
 								[![license](https://img.shields.io/github/license/open-mmlab/mmocr.svg)](https://github.com/open-mmlab/mmocr/blob/main/LICENSE)
-												update badge

											
										
										
											2021-04-09 10:59:14 +08:00
+								[![PyPI](https://badge.fury.io/py/mmocr.svg)](https://pypi.org/project/mmocr/)
 								[![Average time to resolve an issue](https://isitmaintained.com/badge/resolution/open-mmlab/mmocr.svg)](https://github.com/open-mmlab/mmocr/issues)
 								[![Percentage of issues still open](https://isitmaintained.com/badge/open/open-mmlab/mmocr.svg)](https://github.com/open-mmlab/mmocr/issues)
-												cherry pick main (#1355)

* [Fix] Update owners (#1248)

* [Docs] Update installation guide (#1254)

* [Docs] Update installation guide

* add pic

* minor fix

* fix

* [Docs] Update image link (#1255)

* [Docs] demo, experiments and live inference API on Tiyaro (#1272)

* docs: added Try on Tiyaro Badge

* docs: fix mdformat

* docs: update tiyaro docs url

Co-authored-by: Tong Gao <gaotongxiao@gmail.com>
Co-authored-by: Venkat Raman <vraman2811@gmail.com>
											
										
										
											2022-08-31 09:32:55 +08:00
+								<a href="https://console.tiyaro.ai/explore?q=mmocr&pub=mmocr"> <img src="https://tiyaro-public-docs.s3.us-west-2.amazonaws.com/assets/try_on_tiyaro_badge.svg"></a>
-												initial commit

											
										
										
											2021-04-02 15:55:42 +08:00
-												[Docs] readme update (#1359)

* updata install

* update

* update link

* update pic link
											
										
										
											2022-09-01 14:08:10 +08:00
+								[📘Documentation](https://mmocr.readthedocs.io/en/dev-1.x/) |
 								[🛠️Installation](https://mmocr.readthedocs.io/en/dev-1.x/get_started/install.html) |
 								[👀Model Zoo](https://mmocr.readthedocs.io/en/dev-1.x/modelzoo.html) |
 								[🆕Update News](https://mmocr.readthedocs.io/en/dev-1.x/notes/changelog.html) |
-												[Enchance] add codespell ignore and use mdformat (#1022)

* update

* update contributing

* update ci

* fix md

* update pre-commit hook

* update mdformat

Co-authored-by: gaotongxiao <gaotongxiao@gmail.com>
											
										
										
											2022-06-09 14:58:44 +08:00
+								[🤔Reporting Issues](https://github.com/open-mmlab/mmocr/issues/new/choose)
-												[Docs] Update readme according to the guideline (#1047)

* [Docs] Update readme according to the guideline

* fix

* fix cn links
											
										
										
											2022-06-01 10:24:07 +08:00
 								</div>
 								<div align="center">
 								English | [简体中文](README_zh-CN.md)
 								</div>
 								## Introduction
-												Update README.md (#10)


											
										
										
											2021-04-08 12:07:28 +08:00
+								MMOCR is an open-source toolbox based on PyTorch and mmdetection for text detection, text recognition, and the corresponding downstream tasks including key information extraction. It is part of the [OpenMMLab](https://openmmlab.com/) project.
-												initial commit

											
										
										
											2021-04-02 15:55:42 +08:00
-												[Docs] update dependency version req, dockerfile and change logs for 0.2.1 (#331)

* update pytorch req and dockerfile

* Update dependency requirement

* update readme for 0.2.1

* update change log

* update release date
											
										
										
											2021-07-20 23:18:47 +08:00
+								The main branch works with **PyTorch 1.6+**.
-												initial commit

											
										
										
											2021-04-02 15:55:42 +08:00
-												[Docs] Update readme according to the guideline (#1047)

* [Docs] Update readme according to the guideline

* fix

* fix cn links
											
										
										
											2022-06-01 10:24:07 +08:00
+								<div align="center">
-												[Docs] readme update (#1359)

* updata install

* update

* update link

* update pic link
											
										
										
											2022-09-01 14:08:10 +08:00
+								  <img src="https://user-images.githubusercontent.com/24622904/187838618-1fdc61c0-2d46-49f9-8502-976ffdf01f28.png"/>
-												initial commit

											
										
										
											2021-04-02 15:55:42 +08:00
+								</div>
 								### Major Features
 								- **Comprehensive Pipeline**
-												[CI] Add CI (#1176)

* [CI] Add CI

* update init

* fix lint

* fix lint

* fix linting

* fix linting

* fix linting

* fix

* fix

* fix

* fix

* fix

* fix

* disable github ci

* fix

* Update .circleci/test.yml

Co-authored-by: Qing Jiang <mountchicken@outlook.com>

* fix

* fix

Co-authored-by: Qing Jiang <mountchicken@outlook.com>
											
										
										
											2022-07-21 14:28:57 +08:00
+								  The toolbox supports not only text detection and text recognition, but also their downstream tasks such as key information extraction.
-												initial commit

											
										
										
											2021-04-02 15:55:42 +08:00
 								- **Multiple Models**
 								  The toolbox supports a wide variety of state-of-the-art models for text detection, text recognition and key information extraction.
 								- **Modular Design**
-												[Docs] readme update (#1359)

* updata install

* update

* update link

* update pic link
											
										
										
											2022-09-01 14:08:10 +08:00
+								  The modular design of MMOCR enables users to define their own optimizers, data preprocessors, and model components such as backbones, necks and heads as well as losses. Please refer to [Overview](https://mmocr.readthedocs.io/en/dev-1.x/get_started/overview.html) for how to construct a customized model.
-												initial commit

											
										
										
											2021-04-02 15:55:42 +08:00
 								- **Numerous Utilities**
-												fix typo (#7)


											
										
										
											2021-04-08 10:23:03 +08:00
+								  The toolbox provides a comprehensive set of utilities which can help users assess the performance of models. It includes visualizers which allow visualization of images, ground truths as well as predicted bounding boxes, and a validation tool for evaluating checkpoints during training.  It also includes data converters to demonstrate how to convert your own data to the annotation files which the toolbox supports.
-												initial commit

											
										
										
											2021-04-02 15:55:42 +08:00
-												Bump version to 1.0.0rc5 (#1662)

* Bump version to 1.0.0rc5

* fix

* update
											
										
										
											2023-01-06 17:35:07 +08:00
+								## Latest Updates
 								v1.0.0rc5 was released in 2023-01-06.
 . Two models, Aster and SVTR, are added to our model zoo. The full implementation of ABCNet is also available now.
 . Dataset Preparer supports 5 more datasets: CocoTextV2, FUNSD, TextOCR, NAF, SROIE.
 . We have 4 more text recognition transforms, and two more helper transforms.
 . The transform, `FixInvalidPolygon`, is getting smarter at dealing with invalid polygons, and now capable of handling more weird annotations. As a result, a complete training cycle on TotalText dataset can be performed bug-free. The weights of DBNet and FCENet pretrained on TotalText are also released.
 								Read [Changelog](https://mmocr.readthedocs.io/en/dev-1.x/notes/changelog.html) for more details!
 								## What's New in MMOCR 1.0
-												[Docs] Update readme according to the guideline (#1047)

* [Docs] Update readme according to the guideline

* fix

* fix cn links
											
										
										
											2022-06-01 10:24:07 +08:00
-												[Docs] readme update (#1359)

* updata install

* update

* update link

* update pic link
											
										
										
											2022-09-01 14:08:10 +08:00
+. **New engines**. MMOCR 1.x is based on [MMEngine](https://github.com/open-mmlab/mmengine), which provides a general and powerful runner that allows more flexible customizations and significantly simplifies the entrypoints of high-level interfaces.
-												[Docs] Update readme according to the guideline (#1047)

* [Docs] Update readme according to the guideline

* fix

* fix cn links
											
										
										
											2022-06-01 10:24:07 +08:00
-												[Docs] readme update (#1359)

* updata install

* update

* update link

* update pic link
											
										
										
											2022-09-01 14:08:10 +08:00
+. **Unified interfaces**. As a part of the OpenMMLab 2.0 projects, MMOCR 1.x unifies and refactors the interfaces and internal logics of train, testing, datasets, models, evaluation, and visualization. All the OpenMMLab 2.0 projects share the same design in those interfaces and logics to allow the emergence of multi-task/modality algorithms.
-												[Docs] Update readme according to the guideline (#1047)

* [Docs] Update readme according to the guideline

* fix

* fix cn links
											
										
										
											2022-06-01 10:24:07 +08:00
-												[Docs] readme update (#1359)

* updata install

* update

* update link

* update pic link
											
										
										
											2022-09-01 14:08:10 +08:00
+. **Cross project calling**. Benefiting from the unified design, you can use the models implemented in other OpenMMLab projects, such as MMDet. We provide an example of how to use MMDetection's Mask R-CNN through `MMDetWrapper`. Check our documents for more details. More wrappers will be released in the future.
 . **Stronger visualization**. We provide a series of useful tools which are mostly based on brand-new visualizers. As a result, it is more convenient for the users to explore the models and datasets now.
 . **More documentation and tutorials**. We add a bunch of documentation and tutorials to help users get started more smoothly. Read it [here](https://mmocr.readthedocs.io/en/dev-1.x/).
-												Bump version to 1.0.0rc3 (#1510)

* Bump version to 1.0.0rc3

* update changelog

* fix
											
										
										
											2022-11-03 19:56:16 +08:00
+. **One-stop Dataset Preparaion**. Multiple datasets are instantly ready with only one line of command, via our [Dataset Preparer](https://mmocr.readthedocs.io/en/dev-1.x/user_guides/data_prepare/dataset_preparer.html).
-												Bump version to 1.0.0rc4 (#1600)

* Bump version to 1.0.0rc4

* update changelog

* fix

* update readme

* Update README.md

Co-authored-by: liukuikun <24622904+Harold-lkk@users.noreply.github.com>

Co-authored-by: liukuikun <24622904+Harold-lkk@users.noreply.github.com>
											
										
										
											2022-12-06 17:24:35 +08:00
+. **Embracing more `projects/`**: We now introduce `projects/` folder, where some experimental features, frameworks and models can be placed, only needed to satisfy the minimum requirement on the code quality. Everyone is welcome to post their implementation of any great ideas in this folder! Learn more from our [example project](https://github.com/open-mmlab/mmocr/blob/dev-1.x/projects/example_project/).
 . **More models**. MMOCR 1.0 supports more tasks and more state-of-the-art models!
-												[Docs] Update readme according to the guideline (#1047)

* [Docs] Update readme according to the guideline

* fix

* fix cn links
											
										
										
											2022-06-01 10:24:07 +08:00
+								## Installation
-												[Docs] readme update (#1359)

* updata install

* update

* update link

* update pic link
											
										
										
											2022-09-01 14:08:10 +08:00
+								MMOCR depends on [PyTorch](https://pytorch.org/), [MMEngine](https://github.com/open-mmlab/mmengine), [MMCV](https://github.com/open-mmlab/mmcv) and [MMDetection](https://github.com/open-mmlab/mmdetection).
-												[Docs] Update readme according to the guideline (#1047)

* [Docs] Update readme according to the guideline

* fix

* fix cn links
											
										
										
											2022-06-01 10:24:07 +08:00
+								Below are quick steps for installation.
-												[Docs] readme update (#1359)

* updata install

* update

* update link

* update pic link
											
										
										
											2022-09-01 14:08:10 +08:00
+								Please refer to [Install Guide](https://mmocr.readthedocs.io/en/dev-1.x/get_started/install.html) for more detailed instruction.
-												[Docs] Update readme according to the guideline (#1047)

* [Docs] Update readme according to the guideline

* fix

* fix cn links
											
										
										
											2022-06-01 10:24:07 +08:00
 								```shell
 								conda create -n open-mmlab python=3.8 pytorch=1.10 cudatoolkit=11.3 torchvision -c pytorch -y
 								conda activate open-mmlab
 								pip3 install openmim
-												[Docs] readme update (#1359)

* updata install

* update

* update link

* update pic link
											
										
										
											2022-09-01 14:08:10 +08:00
+								mim install mmengine
 								mim install 'mmcv>=2.0.0rc1'
 								mim install 'mmdet>=3.0.0rc0'
-												[Docs] Update readme according to the guideline (#1047)

* [Docs] Update readme according to the guideline

* fix

* fix cn links
											
										
										
											2022-06-01 10:24:07 +08:00
+								git clone https://github.com/open-mmlab/mmocr.git
 								cd mmocr
-												[Docs] readme update (#1359)

* updata install

* update

* update link

* update pic link
											
										
										
											2022-09-01 14:08:10 +08:00
+								git checkout 1.x
-												[Docs] Update readme according to the guideline (#1047)

* [Docs] Update readme according to the guideline

* fix

* fix cn links
											
										
										
											2022-06-01 10:24:07 +08:00
+								pip3 install -e .
 								```
 								## Get Started
-												[Docs] readme update (#1359)

* updata install

* update

* update link

* update pic link
											
										
										
											2022-09-01 14:08:10 +08:00
+								Please see [Quick Run](https://mmocr.readthedocs.io/en/dev-1.x/get_started/quick_run.html) for the basic usage of MMOCR.
-												[Docs] Update readme according to the guideline (#1047)

* [Docs] Update readme according to the guideline

* fix

* fix cn links
											
										
										
											2022-06-01 10:24:07 +08:00
-												[Docs] readme update (#1359)

* updata install

* update

* update link

* update pic link
											
										
										
											2022-09-01 14:08:10 +08:00
+								## [Model Zoo](https://mmocr.readthedocs.io/en/dev-1.x/modelzoo.html)
-												Add methods to readme

Signed-off-by: lizz <lizz@sensetime.com>

											
										
										
											2021-04-10 18:04:36 +08:00
 								Supported algorithms:
-												[Docs] oclip readme (#1505)

* [WIP] oclip docs

* oclip readthe docs

* rename oclip-resnet to resnet-oclip

* updata hemean

* updata link

* updata title
											
										
										
											2022-11-03 19:01:16 +08:00
+								<details open>
 								<summary>BackBone</summary>
 								- [x] [oCLIP](configs/backbone/oclip/README.md) (ECCV'2022)
 								</details>
-												Add methods to readme

Signed-off-by: lizz <lizz@sensetime.com>

											
										
										
											2021-04-10 18:04:36 +08:00
+								<details open>
-												update readme (#211)

* update readme

* updare README_zh-CN.md

* update link
											
										
										
											2021-05-18 16:25:23 +08:00
+								<summary>Text Detection</summary>
-												Add methods to readme

Signed-off-by: lizz <lizz@sensetime.com>

											
										
										
											2021-04-10 18:04:36 +08:00
-												[Feature] Add DBNet++ (#973)

* add dbnet++

* fix docstring

* fix some param names

* fix

* fix docstring

* add init

* add doc; remove configs

* add dbnet++ to readme

* fix readme

* update config

* update readme

* update readme

* update ocr.py

* update metafile.yml

* update readme

* update readme

* move to dbnetpp

* fix paths

* fix head level

* fix typo

* update demo.md

* Update configs/textdet/dbnetpp/README.md

Co-authored-by: Tong Gao <gaotongxiao@gmail.com>

* fix typo

* fix link
											
										
										
											2022-05-05 14:39:20 +08:00
+								- [x] [DBNet](configs/textdet/dbnet/README.md) (AAAI'2020) / [DBNet++](configs/textdet/dbnetpp/README.md) (TPAMI'2022)
-												Add methods to readme

Signed-off-by: lizz <lizz@sensetime.com>

											
										
										
											2021-04-10 18:04:36 +08:00
+								- [x] [Mask R-CNN](configs/textdet/maskrcnn/README.md) (ICCV'2017)
 								- [x] [PANet](configs/textdet/panet/README.md) (ICCV'2019)
 								- [x] [PSENet](configs/textdet/psenet/README.md) (CVPR'2019)
 								- [x] [TextSnake](configs/textdet/textsnake/README.md) (ECCV'2018)
-												update readme (#211)

* update readme

* updare README_zh-CN.md

* update link
											
										
										
											2021-05-18 16:25:23 +08:00
+								- [x] [DRRG](configs/textdet/drrg/README.md) (CVPR'2020)
-												fixed dead links of FCENet Readme (#267)


											
										
										
											2021-06-08 17:52:32 +08:00
+								- [x] [FCENet](configs/textdet/fcenet/README.md) (CVPR'2021)
-												update readme (#211)

* update readme

* updare README_zh-CN.md

* update link
											
										
										
											2021-05-18 16:25:23 +08:00
 								</details>
 								<details open>
 								<summary>Text Recognition</summary>
-												[Model] Full ABINet Framework (#651)


Co-authored-by: liukuikun <24622904+Harold-lkk@users.noreply.github.com>
											
										
										
											2021-12-15 11:21:54 +08:00
+								- [x] [ABINet](configs/textrecog/abinet/README.md) (CVPR'2021)
-												[ASTER] Update ASTER config (#1629)

* update aster config

* update

* update en api

* Update configs/textrecog/aster/metafile.yml

Co-authored-by: Tong Gao <gaotongxiao@gmail.com>
											
										
										
											2022-12-15 19:49:55 +08:00
+								- [x] [ASTER](configs/textrecog/aster/README.md) (TPAMI'2018)
-												Fix readme crnn link (#265)

* fix readme crnn link

* fix readme crnn-CN link
											
										
										
											2021-06-08 14:09:47 +08:00
+								- [x] [CRNN](configs/textrecog/crnn/README.md) (TPAMI'2016)
-												[Model] Add MASTER (#807)

* fix #794: add MASTER

* fix conflict add MASTER

* fix conflict add MASTER

* fix conflict add MASTER

* fix conflict add MASTER

* fix conflict add MASTER

* fix conflict add MASTER

* fix conflict add MASTER

* Fix linting

* after git rebase main

* after git rebase main

* fix conflict add MASTER

* fix conflict add MASTER

* after git rebase main

* fix conflict add MASTER

* fix conflict add MASTER

* fix conflict add MASTER

* after git rebase main

* add GCAModule to plugins

* coexist master and master_old

* fix merge mmocr 0.5.0 conflict

* fix lint error

* update

* [fix] remove remains in __init__

* [update] update code in review

* update readme for master

* Add docstr to MasterDecoder, refined MasterDecoder, remove MASTERLoss

* Unify the output length of MasterDecoder in train and test mode; add test for it, remove MasterLoss

* update readme

* update

* update metafile,README,demo/README,config,ocr.py

* Update mmocr/utils/ocr.py

* update

Co-authored-by: gaotongxiao <gaotongxiao@gmail.com>
Co-authored-by: Mountchicken <mountchicken@outlook.com>
											
										
										
											2022-05-05 16:06:15 +08:00
+								- [x] [MASTER](configs/textrecog/master/README.md) (PR'2021)
-												Add methods to readme

Signed-off-by: lizz <lizz@sensetime.com>

											
										
										
											2021-04-10 18:04:36 +08:00
+								- [x] [NRTR](configs/textrecog/nrtr/README.md) (ICDAR'2019)
 								- [x] [RobustScanner](configs/textrecog/robust_scanner/README.md) (ECCV'2020)
 								- [x] [SAR](configs/textrecog/sar/README.md) (AAAI'2019)
-												[Feature] Add Satrn (#405)

* Add SATRN

* Create satrn_small_academic.py

* Update README.md

* change config name

* Update mmocr/models/textrecog/backbones/shallow_cnn.py

Co-authored-by: Tong Gao <gaotongxiao@gmail.com>

* Update configs/textrecog/satrn/satrn_academic.py

Co-authored-by: Tong Gao <gaotongxiao@gmail.com>

* Update configs/textrecog/satrn/satrn_small.py

Co-authored-by: Tong Gao <gaotongxiao@gmail.com>

* Update shallow_cnn.py

* Update mmocr/models/textrecog/layers/transformer_layer.py

Co-authored-by: Tong Gao <gaotongxiao@gmail.com>

* Update mmocr/models/textrecog/layers/transformer_layer.py

Co-authored-by: Tong Gao <gaotongxiao@gmail.com>

* Update test_ocr_encoder.py

* change keep_aspect_ratio=False

* Update transformer_layer.py

* Update configs/textrecog/satrn/satrn_small.py

Co-authored-by: Tong Gao <gaotongxiao@gmail.com>

* Update configs/textrecog/satrn/satrn_academic.py

Co-authored-by: Tong Gao <gaotongxiao@gmail.com>

* Update mmocr/models/textrecog/layers/transformer_layer.py

Co-authored-by: Tong Gao <gaotongxiao@gmail.com>

* Update mmocr/models/textrecog/layers/transformer_layer.py

Co-authored-by: Tong Gao <gaotongxiao@gmail.com>

* Update mmocr/models/textrecog/layers/transformer_layer.py

Co-authored-by: Tong Gao <gaotongxiao@gmail.com>

* Update mmocr/models/textrecog/layers/transformer_layer.py

Co-authored-by: Tong Gao <gaotongxiao@gmail.com>

* Update mmocr/models/textrecog/layers/transformer_layer.py

Co-authored-by: Tong Gao <gaotongxiao@gmail.com>

* Update mmocr/models/textrecog/layers/transformer_layer.py

Co-authored-by: Tong Gao <gaotongxiao@gmail.com>

* Update mmocr/models/textrecog/layers/transformer_layer.py

Co-authored-by: Tong Gao <gaotongxiao@gmail.com>

* Update mmocr/models/textrecog/layers/transformer_layer.py

Co-authored-by: Tong Gao <gaotongxiao@gmail.com>

* Update mmocr/models/textrecog/layers/transformer_layer.py

Co-authored-by: Tong Gao <gaotongxiao@gmail.com>

* Update transformer_layer.py

* Apply suggestions from code review

Co-authored-by: Tong Gao <gaotongxiao@gmail.com>

* Update transformer_layer.py

* update satrn readme

* add satrn to ocr.py

* add satrn_sm and fix configs

* add a test for config

* add copyright info

* use mmocr registry

Co-authored-by: Tong Gao <gaotongxiao@gmail.com>
											
										
										
											2021-08-19 22:02:58 +08:00
+								- [x] [SATRN](configs/textrecog/satrn/README.md) (CVPR'2020 Workshop on Text and Documents in the Deep Learning Era)
-												[Model] Add SVTR framework and configs (#1621)

* [Model] Add SVTR framework and configs

* update

* update transform names

* update base config

* fix cfg

* update cfgs

* fix

* update cfg

* update decoder

* fix encoder

* fix encoder

* fix

* update cfg

* update name
											
										
										
											2023-01-06 16:07:06 +08:00
+								- [x] [SVTR](configs/textrecog/svtr/README.md) (IJCAI'2022)
-												update readme (#211)

* update readme

* updare README_zh-CN.md

* update link
											
										
										
											2021-05-18 16:25:23 +08:00
 								</details>
 								<details open>
 								<summary>Key Information Extraction</summary>
-												Add methods to readme

Signed-off-by: lizz <lizz@sensetime.com>

											
										
										
											2021-04-10 18:04:36 +08:00
+								- [x] [SDMG-R](configs/kie/sdmgr/README.md) (ArXiv'2021)
 								</details>
-												[Feature] abcnet v1 infer (#1598)

* bezier align

* Update projects/ABCNet/README.md

* Update projects/ABCNet/README.md

* update

* updata home readme

Co-authored-by: Tong Gao <gaotongxiao@gmail.com>
											
										
										
											2022-12-06 16:47:02 +08:00
+								<details open>
 								<summary>Text Spotting</summary>
 								- [x] [ABCNet](projects/ABCNet/README.md) (CVPR'2020)
 								</details>
-												[Docs] readme update (#1359)

* updata install

* update

* update link

* update pic link
											
										
										
											2022-09-01 14:08:10 +08:00
+								Please refer to [model_zoo](https://mmocr.readthedocs.io/en/dev-1.x/modelzoo.html) for more details.
-												update readme (#211)

* update readme

* updare README_zh-CN.md

* update link
											
										
										
											2021-05-18 16:25:23 +08:00
-												[Docs] Update readme according to the guideline (#1047)

* [Docs] Update readme according to the guideline

* fix

* fix cn links
											
										
										
											2022-06-01 10:24:07 +08:00
+								## Contributing
-												initial commit

											
										
										
											2021-04-02 15:55:42 +08:00
-												[Docs] Update readme according to the guideline (#1047)

* [Docs] Update readme according to the guideline

* fix

* fix cn links
											
										
										
											2022-06-01 10:24:07 +08:00
+								We appreciate all contributions to improve MMOCR. Please refer to [CONTRIBUTING.md](.github/CONTRIBUTING.md) for the contributing guidelines.
 								## Acknowledgement
 								MMOCR is an open-source project that is contributed by researchers and engineers from various colleges and companies. We appreciate all the contributors who implement their methods or add new features, as well as users who give valuable feedbacks.
 								We hope the toolbox and benchmark could serve the growing research community by providing a flexible toolkit to reimplement existing methods and develop their own new OCR methods.
-												initial commit

											
										
										
											2021-04-02 15:55:42 +08:00
-												Add cite info (#47)

* Add cite info

* Update README.md

* Update README.md

Co-authored-by: jeffreykuang <kuangzhh@gmail.com>

Co-authored-by: jeffreykuang <kuangzhh@gmail.com>
											
										
										
											2021-04-11 13:46:21 +08:00
+								## Citation
 								If you find this project useful in your research, please consider cite:
 								```bibtex
-												update citation information (#440)


											
										
										
											2021-08-19 17:05:30 +08:00
+								@article{mmocr2021,
-												Add cite info (#47)

* Add cite info

* Update README.md

* Update README.md

Co-authored-by: jeffreykuang <kuangzhh@gmail.com>

Co-authored-by: jeffreykuang <kuangzhh@gmail.com>
											
										
										
											2021-04-11 13:46:21 +08:00
+								    title={MMOCR:  A Comprehensive Toolbox for Text Detection, Recognition and Understanding},
-												update citation information (#440)


											
										
										
											2021-08-19 17:05:30 +08:00
+								    author={Kuang, Zhanghui and Sun, Hongbin and Li, Zhizhong and Yue, Xiaoyu and Lin, Tsui Hin and Chen, Jianyong and Wei, Huaqiang and Zhu, Yiqin and Gao, Tong and Zhang, Wenwei and Chen, Kai and Zhang, Wayne and Lin, Dahua},
 								    journal= {arXiv preprint arXiv:2108.06543},
-												Add cite info (#47)

* Add cite info

* Update README.md

* Update README.md

Co-authored-by: jeffreykuang <kuangzhh@gmail.com>

Co-authored-by: jeffreykuang <kuangzhh@gmail.com>
											
										
										
											2021-04-11 13:46:21 +08:00
+								    year={2021}
 								}
 								```
-												[Docs] Update readme according to the guideline (#1047)

* [Docs] Update readme according to the guideline

* fix

* fix cn links
											
										
										
											2022-06-01 10:24:07 +08:00
+								## License
-												initial commit

											
										
										
											2021-04-02 15:55:42 +08:00
-												[Docs] Update readme according to the guideline (#1047)

* [Docs] Update readme according to the guideline

* fix

* fix cn links
											
										
										
											2022-06-01 10:24:07 +08:00
+								This project is released under the [Apache 2.0 license](LICENSE).
-												initial commit

											
										
										
											2021-04-02 15:55:42 +08:00
 								## Projects in OpenMMLab
-												[Docs] readme update (#1359)

* updata install

* update

* update link

* update pic link
											
										
										
											2022-09-01 14:08:10 +08:00
+								- [MMEngine](https://github.com/open-mmlab/mmengine): OpenMMLab foundational library for training deep learning models
-												[Docs] Add MMCV (#954)


											
										
										
											2022-04-20 23:00:39 +08:00
+								- [MMCV](https://github.com/open-mmlab/mmcv): OpenMMLab foundational library for computer vision.
-												[Docs] Update README.MD (#806)


											
										
										
											2022-03-01 15:55:02 +08:00
+								- [MIM](https://github.com/open-mmlab/mim): MIM installs OpenMMLab packages.
-												initial commit

											
										
										
											2021-04-02 15:55:42 +08:00
+								- [MMClassification](https://github.com/open-mmlab/mmclassification): OpenMMLab image classification toolbox and benchmark.
 								- [MMDetection](https://github.com/open-mmlab/mmdetection): OpenMMLab detection toolbox and benchmark.
 								- [MMDetection3D](https://github.com/open-mmlab/mmdetection3d): OpenMMLab's next-generation platform for general 3D object detection.
-												[Docs] Update README.MD (#806)


											
										
										
											2022-03-01 15:55:02 +08:00
+								- [MMRotate](https://github.com/open-mmlab/mmrotate): OpenMMLab rotated object detection toolbox and benchmark.
-												initial commit

											
										
										
											2021-04-02 15:55:42 +08:00
+								- [MMSegmentation](https://github.com/open-mmlab/mmsegmentation): OpenMMLab semantic segmentation toolbox and benchmark.
-												[Docs] Update README.MD (#806)


											
										
										
											2022-03-01 15:55:02 +08:00
+								- [MMOCR](https://github.com/open-mmlab/mmocr): OpenMMLab text detection, recognition, and understanding toolbox.
 								- [MMPose](https://github.com/open-mmlab/mmpose): OpenMMLab pose estimation toolbox and benchmark.
 								- [MMHuman3D](https://github.com/open-mmlab/mmhuman3d): OpenMMLab 3D human parametric model toolbox and benchmark.
 								- [MMSelfSup](https://github.com/open-mmlab/mmselfsup): OpenMMLab self-supervised learning toolbox and benchmark.
 								- [MMRazor](https://github.com/open-mmlab/mmrazor): OpenMMLab model compression toolbox and benchmark.
 								- [MMFewShot](https://github.com/open-mmlab/mmfewshot): OpenMMLab fewshot learning toolbox and benchmark.
-												initial commit

											
										
										
											2021-04-02 15:55:42 +08:00
+								- [MMAction2](https://github.com/open-mmlab/mmaction2): OpenMMLab's next-generation action understanding toolbox and benchmark.
 								- [MMTracking](https://github.com/open-mmlab/mmtracking): OpenMMLab video perception toolbox and benchmark.
-												[Docs] Add MMFlow & MIM (#597)

* [Docs] Add MMFlow

* add mim
											
										
										
											2021-11-19 14:16:42 +08:00
+								- [MMFlow](https://github.com/open-mmlab/mmflow): OpenMMLab optical flow toolbox and benchmark.
-												[Docs] Update README.MD (#806)


											
										
										
											2022-03-01 15:55:02 +08:00
+								- [MMEditing](https://github.com/open-mmlab/mmediting): OpenMMLab image and video editing toolbox.
 								- [MMGeneration](https://github.com/open-mmlab/mmgeneration): OpenMMLab image and video generative models toolbox.
 								- [MMDeploy](https://github.com/open-mmlab/mmdeploy): OpenMMLab model deployment framework.
-												Issue Template (#1663)

* [Template] Refactor issue template (#1449)

* Refactor issue template

* add contact

* [Template] issue template (#1489)

* improve issue template

* fix comment

Co-authored-by: liukuikun <24622904+Harold-lkk@users.noreply.github.com>
											
										
										
											2023-01-06 17:29:28 +08:00
 								## Welcome to the OpenMMLab community
 								Scan the QR code below to follow the OpenMMLab team's [**Zhihu Official Account**](https://www.zhihu.com/people/openmmlab) and join the OpenMMLab team's [**QQ Group**](https://jq.qq.com/?_wv=1027&k=aCvMxdr3), or join the official communication WeChat group by adding the WeChat, or join our [**Slack**](https://join.slack.com/t/mmocrworkspace/shared_invite/zt-1ifqhfla8-yKnLO_aKhVA2h71OrK8GZw)
 								<div align="center">
 								<img src="https://raw.githubusercontent.com/open-mmlab/mmcv/master/docs/en/_static/zhihu_qrcode.jpg" height="400" />  <img src="https://raw.githubusercontent.com/open-mmlab/mmcv/master/docs/en/_static/qq_group_qrcode.jpg" height="400" />  <img src="https://raw.githubusercontent.com/open-mmlab/mmcv/master/docs/en/_static/wechat_qrcode.jpg" height="400" />
 								</div>
 								We will provide you with the OpenMMLab community
 								- 📢 share the latest core technologies of AI frameworks
 								- 💻 Explaining PyTorch common module source Code
 								- 📰 News related to the release of OpenMMLab
 								- 🚀 Introduction of cutting-edge algorithms developed by OpenMMLab
 								  🏃 Get the more efficient answer and feedback
 								- 🔥 Provide a platform for communication with developers from all walks of life
 								The OpenMMLab community looks forward to your participation! 👬