29 Commits

Author SHA1 Message Date
Xinyu Wang
14c75da7bd
[Feature] Add FUNSD Converter (#808)
* Add FUNSD Converter

* Update tools/data/textrecog/funsd_converter.py

Co-authored-by: Tong Gao <gaotongxiao@gmail.com>

* Update tools/data/textrecog/funsd_converter.py

Co-authored-by: Tong Gao <gaotongxiao@gmail.com>

* Update tools/data/textdet/funsd_converter.py

Co-authored-by: Tong Gao <gaotongxiao@gmail.com>

* blank line between sections

Co-authored-by: Tong Gao <gaotongxiao@gmail.com>

* fix incorrect docstrings

* fix docstrings & fix timer

* add --preserve-vertical arg for preserving vertical texts

* fix --preserve-vertical

* [doc] fix recog.md incorrect description

* fix docstring style

Co-authored-by: Tong Gao <gaotongxiao@gmail.com>

* fix docstring spaces

Co-authored-by: Tong Gao <gaotongxiao@gmail.com>
2022-03-04 12:25:54 +08:00
Tong Gao
ac4462f374
[Feature] Add CurvedSyntext150k Converter (#719)
* [Feature] Add bezier_to_polygon to box_util

* Add num_sample to parameter

* add sort_point util

* update docstring

* Add curvedsyntext converter
2022-03-02 11:02:14 +08:00
Tong Gao
3110ab7863
[Enhancement] Add windows CI (#790)
* [Enhancement] Add windows CI

* [Enhancement] Add windows CI

* update

* update

* update

* [Fix] using assert will keep lmdb file opend and fail to cleanup in test_loader.py

* [Fix] map size should be small on windows in lmdb_util.py

* [Fix] Fix some bugs

* [Fix] Fix some bugs

* [Fix] Fix some bugs

* remove comments & fix bugs

Co-authored-by: Mountchicken <mountchicken@outlook.com>
2022-03-02 10:34:15 +08:00
Tong Gao
91f98bc645
[Enhancement] Add open-mmlab precommit hook (#787) 2022-02-22 12:52:04 +08:00
Tong Gao
218f9f08d4
[Fix] Use yaml.safe_load instead of load (#753) 2022-01-26 14:29:30 +08:00
liukuikun
2f429d5e40
Extend totaltext converter to support text fields (#728)
* Extend totaltext converter to support text fieldols/

* fix bug

* fix comment typo

Co-authored-by: Tong Gao <gaotongxiao@gmail.com>

Co-authored-by: Tong Gao <gaotongxiao@gmail.com>
2022-01-14 16:00:53 +08:00
liukuikun
c736989615
[Feature] Extend ctw1500 converter to support text fields (#729)
* Extend ctw1500 converter to support text fieldols/

* remove args for debug
2022-01-14 15:30:48 +08:00
Tong Gao
bdbeb69076
[Fix] Remove depreciated image sanity check (#661) 2021-12-10 12:50:41 +08:00
Hongbin Sun
a50b0c9fb9
[Feature] Support openset kie (#498)
* add openset kie dataset

* updare readme

* add anno convert script

* update docstring

* update script

* add & update docstring

* fix typo

* update docstring format
2021-11-11 14:47:38 +08:00
Darwin Bautista
80741e1479
[Feature] Add converter for the Open Images v5 text annotations by Krylov et al. (#497)
* Add converter for the OpenVINO annotations for Open Images by Krylov et al.

Open Images V5 Text Annotation and Yet Another Mask Text Spotter
Paper: https://arxiv.org/abs/2106.12326

* docs fix & add chinese docs
2021-10-28 16:49:36 +08:00
Tong Gao
d683b14283
[Fix] Totaltext_converter: skip invalid annotations (#438)
* [Fix] Skip invalid annoataions
2021-08-20 11:23:05 +08:00
Tong Gao
b8f7ead74c
[Enhancement] Add copyright info (#439)
* add copyright info
2021-08-17 17:39:30 +08:00
Tong Gao
884755d05d
Fix #112: Remove the need of drop_orientation_info in data preprocessing steps (#375)
* ctw1500 ignore orientation

* restore maskrcnn config

* ignore_orientation support for icdar datasets

* update docs

* ignore orientation for total text

* Add LoadOCRImageFromFile

* Fix typo

* simplify design

* remove LoadOCRImageFromFile

* update chinese docs
2021-07-20 23:02:25 +08:00
Tong Gao
02e3b98684
fix syntext_converter (#361) 2021-07-12 02:07:50 +00:00
quincylin1
243f47dc03
add totaltext for recog and det (#357)
* add totaltext for recog and det

* add setup

* fix doc

* fix based on comments
2021-07-08 21:52:50 +08:00
Tong Gao
68df4fbe80
[Feature] Add synthtext converter and update docs (#351)
* Add synthtext converter and update docs

* minor docs fix
2021-07-07 15:54:29 +08:00
GT
e6cb750922
add TextOCR dataset converter (#293)
* textocr converter for text recog

* textocr converter for text detection

* update documentation

* remove unnecessary garbage collection lines

* multi-processing textocr converter

* json->mmcv, fix documentation
2021-06-21 03:06:10 +00:00
quincylin1
d7fa9544e6
added totaltext recog converter (#273)
* added totaltext recog converter

* modified datasets.md and totaltext_converter.py

* added Note to datasets.md

* deleted comments
2021-06-11 11:09:35 +08:00
quincylin1
271129f812
Feature/iss 262 (#266)
* fix issue#262

* fix #262: modified totaltext_converter and added totaltext for datasets.md

* fix issue#262: modified datasets.md

* fix issue#262: removed download json

* Update totaltext_converter.py

Co-authored-by: Hongbin Sun <hongbin306@gmail.com>
2021-06-08 13:13:22 +00:00
Hongbin Sun
4882c8a317
dataset preparation docs (#255) 2021-06-01 21:59:40 +08:00
lizz
b10b6408ef
Add list_from_file and list_to_file (#226)
* Add list_from_file and list_to_file

Signed-off-by: lizz <lizz@sensetime.com>

* Add test list_to_file and list_from_file

* more

* Fix tests
2021-05-24 06:01:42 +00:00
lizz
06b75780a0
Fix typos (#207)
Signed-off-by: lizz <lizz@sensetime.com>
2021-05-18 05:44:52 +00:00
Hongbin Sun
b058fdcb4e
mv data_convert_util to mmocr (#96)
* mv data_convert_util to mmocr

* update

* rm bracket
2021-04-19 21:03:52 +08:00
Hongbin Sun
1a129a1e98
add svt converter (#65)
* add svt converter

* fix str fmt

* fix str fmt

* update convert script
2021-04-14 18:33:14 +08:00
lizz
44ca9c2a61
Remove usage of \ (#49)
* Remove usage of \

Signed-off-by: lizz <lizz@sensetime.com>

* rebase

Signed-off-by: lizz <lizz@sensetime.com>

* typos

Signed-off-by: lizz <lizz@sensetime.com>

* Remove test dependency on tools/

Signed-off-by: lizz <lizz@sensetime.com>

* Remove usage of \

Signed-off-by: lizz <lizz@sensetime.com>

* rebase

Signed-off-by: lizz <lizz@sensetime.com>

* typos

Signed-off-by: lizz <lizz@sensetime.com>

* Remove test dependency on tools/

Signed-off-by: lizz <lizz@sensetime.com>

* typo

Signed-off-by: lizz <lizz@sensetime.com>

* KIE in keywords

Signed-off-by: lizz <lizz@sensetime.com>

* some renames

Signed-off-by: lizz <lizz@sensetime.com>

* kill isort skip

Signed-off-by: lizz <lizz@sensetime.com>

* aggregation discrimination

Signed-off-by: lizz <lizz@sensetime.com>

* aggregation discrimination

Signed-off-by: lizz <lizz@sensetime.com>

* tiny

Signed-off-by: lizz <lizz@sensetime.com>

* fix bug: model infer on cpu

Co-authored-by: Hongbin Sun <hongbin306@gmail.com>
2021-04-06 12:16:46 +00:00
lizz
09ffd284ee Remove test dependency on tools
Signed-off-by: lizz <lizz@sensetime.com>
2021-04-06 10:57:25 +08:00
Hongbin Sun
3727cab712 fix #21: update sdmgr readme and config 2021-04-05 16:09:13 +08:00
jeffreykuang
31f1c007c0 [feature]: add textdet 2021-04-03 01:03:52 +08:00
Hongbin Sun
4ecd0cea8a add sar, seg and other components 2021-04-02 23:54:57 +08:00