16 Commits

Author SHA1 Message Date
Xinyu Wang
14c75da7bd
[Feature] Add FUNSD Converter (#808)
* Add FUNSD Converter

* Update tools/data/textrecog/funsd_converter.py

Co-authored-by: Tong Gao <gaotongxiao@gmail.com>

* Update tools/data/textrecog/funsd_converter.py

Co-authored-by: Tong Gao <gaotongxiao@gmail.com>

* Update tools/data/textdet/funsd_converter.py

Co-authored-by: Tong Gao <gaotongxiao@gmail.com>

* blank line between sections

Co-authored-by: Tong Gao <gaotongxiao@gmail.com>

* fix incorrect docstrings

* fix docstrings & fix timer

* add --preserve-vertical arg for preserving vertical texts

* fix --preserve-vertical

* [doc] fix recog.md incorrect description

* fix docstring style

Co-authored-by: Tong Gao <gaotongxiao@gmail.com>

* fix docstring spaces

Co-authored-by: Tong Gao <gaotongxiao@gmail.com>
2022-03-04 12:25:54 +08:00
Tong Gao
218f9f08d4
[Fix] Use yaml.safe_load instead of load (#753) 2022-01-26 14:29:30 +08:00
liukuikun
2f429d5e40
Extend totaltext converter to support text fields (#728)
* Extend totaltext converter to support text fieldols/

* fix bug

* fix comment typo

Co-authored-by: Tong Gao <gaotongxiao@gmail.com>

Co-authored-by: Tong Gao <gaotongxiao@gmail.com>
2022-01-14 16:00:53 +08:00
liukuikun
c736989615
[Feature] Extend ctw1500 converter to support text fields (#729)
* Extend ctw1500 converter to support text fieldols/

* remove args for debug
2022-01-14 15:30:48 +08:00
Tong Gao
bdbeb69076
[Fix] Remove depreciated image sanity check (#661) 2021-12-10 12:50:41 +08:00
Tong Gao
b8f7ead74c
[Enhancement] Add copyright info (#439)
* add copyright info
2021-08-17 17:39:30 +08:00
Tong Gao
884755d05d
Fix #112: Remove the need of drop_orientation_info in data preprocessing steps (#375)
* ctw1500 ignore orientation

* restore maskrcnn config

* ignore_orientation support for icdar datasets

* update docs

* ignore orientation for total text

* Add LoadOCRImageFromFile

* Fix typo

* simplify design

* remove LoadOCRImageFromFile

* update chinese docs
2021-07-20 23:02:25 +08:00
quincylin1
243f47dc03
add totaltext for recog and det (#357)
* add totaltext for recog and det

* add setup

* fix doc

* fix based on comments
2021-07-08 21:52:50 +08:00
GT
e6cb750922
add TextOCR dataset converter (#293)
* textocr converter for text recog

* textocr converter for text detection

* update documentation

* remove unnecessary garbage collection lines

* multi-processing textocr converter

* json->mmcv, fix documentation
2021-06-21 03:06:10 +00:00
quincylin1
271129f812
Feature/iss 262 (#266)
* fix issue#262

* fix #262: modified totaltext_converter and added totaltext for datasets.md

* fix issue#262: modified datasets.md

* fix issue#262: removed download json

* Update totaltext_converter.py

Co-authored-by: Hongbin Sun <hongbin306@gmail.com>
2021-06-08 13:13:22 +00:00
Hongbin Sun
4882c8a317
dataset preparation docs (#255) 2021-06-01 21:59:40 +08:00
lizz
b10b6408ef
Add list_from_file and list_to_file (#226)
* Add list_from_file and list_to_file

Signed-off-by: lizz <lizz@sensetime.com>

* Add test list_to_file and list_from_file

* more

* Fix tests
2021-05-24 06:01:42 +00:00
lizz
06b75780a0
Fix typos (#207)
Signed-off-by: lizz <lizz@sensetime.com>
2021-05-18 05:44:52 +00:00
Hongbin Sun
b058fdcb4e
mv data_convert_util to mmocr (#96)
* mv data_convert_util to mmocr

* update

* rm bracket
2021-04-19 21:03:52 +08:00
lizz
44ca9c2a61
Remove usage of \ (#49)
* Remove usage of \

Signed-off-by: lizz <lizz@sensetime.com>

* rebase

Signed-off-by: lizz <lizz@sensetime.com>

* typos

Signed-off-by: lizz <lizz@sensetime.com>

* Remove test dependency on tools/

Signed-off-by: lizz <lizz@sensetime.com>

* Remove usage of \

Signed-off-by: lizz <lizz@sensetime.com>

* rebase

Signed-off-by: lizz <lizz@sensetime.com>

* typos

Signed-off-by: lizz <lizz@sensetime.com>

* Remove test dependency on tools/

Signed-off-by: lizz <lizz@sensetime.com>

* typo

Signed-off-by: lizz <lizz@sensetime.com>

* KIE in keywords

Signed-off-by: lizz <lizz@sensetime.com>

* some renames

Signed-off-by: lizz <lizz@sensetime.com>

* kill isort skip

Signed-off-by: lizz <lizz@sensetime.com>

* aggregation discrimination

Signed-off-by: lizz <lizz@sensetime.com>

* aggregation discrimination

Signed-off-by: lizz <lizz@sensetime.com>

* tiny

Signed-off-by: lizz <lizz@sensetime.com>

* fix bug: model infer on cpu

Co-authored-by: Hongbin Sun <hongbin306@gmail.com>
2021-04-06 12:16:46 +00:00
jeffreykuang
31f1c007c0 [feature]: add textdet 2021-04-03 01:03:52 +08:00