Commit Graph

243 Commits (914c8af7bf9b48ccf904f126f7bd15e9ee07afac)

Author SHA1 Message Date
gaotongxiao cd4e520cb9 Add KIEDataSample 2022-07-21 10:51:03 +08:00
wangxinyu c0c0f4b565 [PSE] PSE Postprocessor 2022-07-21 10:51:03 +08:00
wangxinyu 4a04982806 [PSE] PSE Loss 2022-07-21 10:51:03 +08:00
gaotongxiao 0716c97cf6 Add BoundedScaleAspectJitter 2022-07-21 10:51:03 +08:00
wangxinyu 00ba46b5b9 [PSE] PSE Neck FPNF 2022-07-21 10:51:03 +08:00
wangxinyu 05990c58d9 [Refactor] PSE Head 2022-07-21 10:51:03 +08:00
gaotongxiao 490d6cd806 Fix MaskedCELoss 2022-07-21 10:51:03 +08:00
jiangqing.vendor fded755af2 [FCENet] Add FCENet loss 2022-07-21 10:51:03 +08:00
Mountchicken 17606c25fc add fce head 2022-07-21 10:51:02 +08:00
liukuikun 200899b2a0 [PAN Postprocessor]pan postprocessor 2022-07-21 10:51:02 +08:00
jiangqing.vendor 13920924ce [MASTER] Add master plugin 2022-07-21 10:51:02 +08:00
liukuikun b6e031666b [PAN] pan head 2022-07-21 10:51:02 +08:00
jiangqing.vendor 55c99dd0c1 [Update] Update TextDetRandomCropFlip 2022-07-21 10:51:02 +08:00
liukuikun 7c3789d64e [PANLoss] pan loss 2022-07-21 10:51:02 +08:00
liukuikun d636adeb1f fix ut 2022-07-21 10:51:02 +08:00
gaotongxiao 8f7c0e2977 Fix imgaug 2022-07-21 10:51:02 +08:00
gaotongxiao 21d0dd71dc [TextSnake] Refactor textsnake loss 2022-07-21 10:51:02 +08:00
wangxinyu bf7c738798 [TextSnake] TextSnake Neck 2022-07-21 10:51:02 +08:00
wangxinyu acd2bcc452 [TextSnake] TextSnake Postprocessor 2022-07-21 10:51:02 +08:00
wangxinyu f7731c43bd [TextSnake] TextSnake Head 2022-07-21 10:51:02 +08:00
liukuikun a353a28a1a fix test path 2022-07-21 10:51:02 +08:00
jiangqing.vendor a135580912 [MASTER] Add Master decoder 2022-07-21 10:51:01 +08:00
liukuikun f03ed3ce11 [Transform] RandomFlip 2022-07-21 10:51:01 +08:00
liukuikun d5a2d20574 [Enchance] ce ignore char 2022-07-21 10:51:01 +08:00
wangxinyu b44869059b [SATRN] SATRN Backbone 2022-07-21 10:51:01 +08:00
wangxinyu 401088913b [SATRN] SATRN Encoder 2022-07-21 10:51:01 +08:00
gaotongxiao 35e5138b5d Cleanup test tmp files in test crnn decoder 2022-07-21 10:51:01 +08:00
liukuikun dfe93dc7d2 [Transform] ScaleAspectJitter 2022-07-21 10:51:01 +08:00
gaotongxiao da175b44a4 [Fix] Fix RandomRotate 2022-07-21 10:51:01 +08:00
jiangqing.vendor 50f229d9fe [NRTR] NRTR Encoder 2022-07-21 10:51:01 +08:00
jiangqing.vendor 8614070e36 [NRTR] NRTR Decoder 2022-07-21 10:51:01 +08:00
jiangqing.vendor d41921f03d [NRTR] NRTR backbone 2022-07-21 10:51:01 +08:00
jiangqing.vendor 25e819f6bf [Fix] Fix TextDetRandomCrop 2022-07-21 10:51:01 +08:00
jiangqing.vendor b3b1ef146b [Fix] Check transform's unit test and visualize 2022-07-21 10:51:01 +08:00
jiangqing.vendor d859fcad1c [Update] Add toy dataset in 2.0 form for test 2022-07-21 10:51:00 +08:00
liukuikun 2a852f23b5 [Fix] fix hmean iou 2022-07-21 10:51:00 +08:00
gaotongxiao b585dbcdd7 Fix score field in DBPostprocessor 2022-07-21 10:51:00 +08:00
gaotongxiao a4952a6dd6 Fix RandomCrop 2022-07-21 10:51:00 +08:00
liukuikun 38eef984c2 [Refactor]CELoss 2022-07-21 10:51:00 +08:00
liukuikun f4a8e0f3a9 [Fix] fix metric and ut 2022-07-21 10:51:00 +08:00
gaotongxiao 8396b2014e Refactor FPNC 2022-07-21 10:51:00 +08:00
gaotongxiao 3a9f9e6b61 Handling a corner case in offset_polygon 2022-07-21 10:51:00 +08:00
gaotongxiao 32ef9cc3cf [DBNet] Add DBHead 2022-07-21 10:51:00 +08:00
gaotongxiao 7a66a84b64 [DBNet] Add DBPostProcessor 2022-07-21 10:51:00 +08:00
gaotongxiao cd3d173b18 [DBNet] Add DBLoss 2022-07-21 10:51:00 +08:00
gaotongxiao 747b2a14dc Refactor ResNet backbone 2022-07-21 10:51:00 +08:00
gaotongxiao 43c50eee82 fix loss 2022-07-21 10:51:00 +08:00
gaotongxiao 1e1da7b395 [DBNet] Add MaskedSmmothL1Loss, MasedBalancedBCELoss and MaskedDiceLoss 2022-07-21 10:50:59 +08:00
gaotongxiao 0f0f68baf1 Fix crop_polygon, Resize and poly2shapely 2022-07-21 10:50:59 +08:00
liukuikun bbbefaeb31 build LabelData in baseprocessor 2022-07-21 10:50:59 +08:00
liukuikun 7582fdea41 [Refactor] CTCLoss 2022-07-21 10:50:59 +08:00
liukuikun 3aae157aec [Refactor] crnn decoder 2022-07-21 10:50:59 +08:00
jiangqing.vendor f173cd3543 [Refactor] Refactor WordMetric and CharMetric 2022-07-21 10:50:59 +08:00
liukuikun 4fd048aa24 [Fix] fix base recog loss 2022-07-21 10:50:59 +08:00
liukuikun fe43259a05 [Refactor] train and test 2022-07-21 10:50:59 +08:00
wangxinyu ee48713a89 [Refactor] TextDetLocalVisualizer 2022-07-21 10:50:59 +08:00
jiangqing.vendor c78be99f6b [Refactor] Refactor TextRecogVisualizer 2022-07-21 10:50:59 +08:00
gaotongxiao 7e7a526f37 New Hmean-iou metric 2022-07-21 10:50:59 +08:00
liukuikun f47f3eff03 fix polygon type 2022-07-21 10:50:58 +08:00
liukuikun 05e31e09bc [Fix] base decoder forget passing dictionary 2022-07-21 10:50:58 +08:00
jiangqing.vendor 4706cc7eca [Refactor] Refactor SquareResizePad 2022-07-21 10:50:58 +08:00
liukuikun 2f4679e908 LoadAnnotations 2022-07-21 10:50:58 +08:00
gaotongxiao be30df5d50 Rename TextRecog directories 2022-07-21 10:50:58 +08:00
gaotongxiao f820a50752 Rename TextDet directories 2022-07-21 10:50:58 +08:00
liukuikun 84a61ba816 RescaleToHeight and PadToWidth 2022-07-21 10:50:58 +08:00
liukuikun a05e3f19c5 [Feature] TextRecogPostprocessor 2022-07-21 10:50:58 +08:00
liukuikun e8f57d6540 [Feature] PackDet/RecogInput 2022-07-21 10:50:58 +08:00
jiangqing.vendor 5dc791adbb [Refactor] Refactor randomscaling 2022-07-21 10:50:58 +08:00
jiangqing.vendor d2808e6b84 [refactor] Refactor Crop-Related Operations 2022-07-21 10:50:58 +08:00
gaotongxiao 7b6778c5d8 add torchvisionwrapper 2022-07-21 10:50:58 +08:00
liukuikun 0b5d2df310 [Refactor] BaseDecoder 2022-07-21 10:50:58 +08:00
liukuikun 6cd38a038f [Refactor] base recog loss 2022-07-21 10:50:57 +08:00
wangxinyu ac4eb34843 [Refactor] RandomCropFlip 2022-07-21 10:50:57 +08:00
gaotongxiao 79186b61ec ImgAug: ignores->ignored 2022-07-21 10:50:57 +08:00
wangxinyu 178030bad6 [Refactor] Refactor transform.RandomRotate 2022-07-21 10:50:57 +08:00
jiangqing.vendor f29853d9cd [Feature] Add Resize 2022-07-21 10:50:57 +08:00
gaotongxiao 6478499073 add mode to rescale_polygon(s) 2022-07-21 10:50:57 +08:00
gaotongxiao df2f7b69db Add recognition data migrator 2022-07-21 10:50:57 +08:00
gaotongxiao 7b09da485b [Fix] Support textrecog, add more tests 2022-07-21 10:50:57 +08:00
liukuikun 0f041d4250 [Fix] updata LabelData import 2022-07-21 10:50:57 +08:00
gaotongxiao f6b72b244b Add ImgAug and tests 2022-07-21 10:50:57 +08:00
gaotongxiao cb85f857aa Add BaseTextDetPostProcessor 2022-07-21 10:50:57 +08:00
gaotongxiao f0c6d44ce8 Add dump_ocr_data 2022-07-21 10:50:56 +08:00
liukuikun 98bc90bd1c [Feature] TextRecogSample 2022-07-21 10:50:56 +08:00
liukuikun c920edfb3a [Refactor] split labelconverter to Dictionary 2022-07-21 10:50:56 +08:00
liukuikun c47c5711c1 [Feature] TextDetSample 2022-07-21 10:50:56 +08:00
wangxinyu.vendor 41c1671e7b Refactor PyramidRescale 2022-07-21 10:50:56 +08:00
gaotongxiao b5fc589320 Add CI 2022-07-21 10:50:55 +08:00
gaotongxiao 69e6c80558 move tests to old_tests, add empty test folders 2022-07-21 10:50:55 +08:00
Tong Gao 1ad4caa7cb
[Enhancement] Allow choosing metrics to report in text recognition tasks (#989)
* [Enhancement] Allow choosing metrics to report in text recognition tasks

* add acc support

* refine docstr
2022-05-05 16:10:38 +08:00
Jianquan Ye 061d353b2a
[Model] Add MASTER (#807)
* fix #794: add MASTER

* fix conflict add MASTER

* fix conflict add MASTER

* fix conflict add MASTER

* fix conflict add MASTER

* fix conflict add MASTER

* fix conflict add MASTER

* fix conflict add MASTER

* Fix linting

* after git rebase main

* after git rebase main

* fix conflict add MASTER

* fix conflict add MASTER

* after git rebase main

* fix conflict add MASTER

* fix conflict add MASTER

* fix conflict add MASTER

* after git rebase main

* add GCAModule to plugins

* coexist master and master_old

* fix merge mmocr 0.5.0 conflict

* fix lint error

* update

* [fix] remove remains in __init__

* [update] update code in review

* update readme for master

* Add docstr to MasterDecoder, refined MasterDecoder, remove MASTERLoss

* Unify the output length of MasterDecoder in train and test mode; add test for it, remove MasterLoss

* update readme

* update

* update metafile,README,demo/README,config,ocr.py

* Update mmocr/utils/ocr.py

* update

Co-authored-by: gaotongxiao <gaotongxiao@gmail.com>
Co-authored-by: Mountchicken <mountchicken@outlook.com>
2022-05-05 16:06:15 +08:00
Xinyu Wang fbc138d2d3
[Feature] Add DBNet++ (#973)
* add dbnet++

* fix docstring

* fix some param names

* fix

* fix docstring

* add init

* add doc; remove configs

* add dbnet++ to readme

* fix readme

* update config

* update readme

* update readme

* update ocr.py

* update metafile.yml

* update readme

* update readme

* move to dbnetpp

* fix paths

* fix head level

* fix typo

* update demo.md

* Update configs/textdet/dbnetpp/README.md

Co-authored-by: Tong Gao <gaotongxiao@gmail.com>

* fix typo

* fix link
2022-05-05 14:39:20 +08:00
Tong Gao 6b180db93d
[Enhancement] Automatically report mean scores when applicable (#995) 2022-05-05 12:57:19 +08:00
Hongbin Sun a2d741b8a7
[Feature] Add labelme converter for textdet and textrecog (#972)
* add labelme converter

* move to common

* add labelme sample annos

* add doc

* remove useless field generated by labelme to reduce size

* add recog_format option; add skip ignored instances while cropping

* set warp as false by default

* update doc

* fix typo

Co-authored-by: xinke-wang <wangxinyu2017@gmail.com>
Co-authored-by: Xinyu Wang <45810070+xinke-wang@users.noreply.github.com>
2022-05-03 17:28:22 +08:00
Qing Jiang c91763ad09
[Fix] update label.lmdb (#991)
* loss

* fix

* hotfix
2022-04-29 23:26:32 +08:00
Qing Jiang 92ef554a82
[Feature] Add recog2lmdb and new toy dataset files (#979)
* loss

* fix

* add img2lmdb and test files

* update

* add reference

* fix lint

* fix typo

* use total_numer instead to fit mmocr's lmdbloader

* reorganize and update

* fix lint

* update test file

* refactor and update

* fix test

* update doc in tools

* fix lint

* update old lmdb test file

* update

* mask the unittest for recog2lmdb and use json format for label_only

* remove if __name__

* fix case, doc, typo, formats

* fix typos

* fix docs and variable names

* Apply suggestions from code review

Co-authored-by: Xinyu Wang <45810070+xinke-wang@users.noreply.github.com>

* update test_loader.py and fix a bug

Co-authored-by: gaotongxiao <gaotongxiao@gmail.com>
Co-authored-by: Xinyu Wang <45810070+xinke-wang@users.noreply.github.com>
2022-04-29 22:30:36 +08:00
Qing Jiang d8549ade85
[Feature] Support loading images and labels from lmdb file (#982)
* loss

* fix

* add LoadImageFromLMDB

* update

* new version

* add json

* fix bug

* Compatible with older versions

* modify test_loader

* add docstring and update test_loader

* fix a bug

* fix docstring and update doc

* update

* Update mmocr/datasets/utils/backend.py

Co-authored-by: Xinyu Wang <45810070+xinke-wang@users.noreply.github.com>

* Update docs/en/tutorials/dataset_types.md

Co-authored-by: Xinyu Wang <45810070+xinke-wang@users.noreply.github.com>

* fix lint

Co-authored-by: Xinyu Wang <45810070+xinke-wang@users.noreply.github.com>
2022-04-29 15:46:36 +08:00
Tong Gao 064a2b8c45
[Feature] Support computing mean scores in UniformConcatDataset (#981)
* Get avg results in UniformConcatDataset

* add docstr

* Fix

* fix test

* fix typo
2022-04-29 14:48:21 +08:00
Tong Gao 0c8fa52b22
[Fix] Fix ImgAug transform (#949)
* [Fix] Fix bbox case in ImgAug

* Finialize the augmentation and add tests
2022-04-28 22:52:11 +08:00
Tong Gao 888f700134
[Enhancement] Support dynamic threshold range in eval_hmean (#962)
* [Enhancement] Support dynamic threshold range in eval_hmean

* upgrade textdetdataset, add deprecate warning
2022-04-22 17:07:36 +08:00