Commit Graph

118 Commits (3869582decffd172e4a2c0ccf94efb4ec1b79c36)

Author SHA1 Message Date
topduke 8a52c99ad8
[New] add rec CPPD model (#10990)
* fix gris_sample data type bug when use fp16

* fix gris_sample data type bug when use fp16

* fix v4rec batchsize

* fix bug of hang when multi gpus training(sampler)

* add rec algorithm cppd

* delete cppd useless code

* update cppd bug

* add rec algorithm cppd

* update cppd trainedmodel url

* add cppd en doc
2023-09-25 15:43:45 +08:00
ToddBear 75d16610f4
Add new recognition method "ParseQ" (#10836)
* Update PP-OCRv4_introduction.md

* Update PP-OCRv4_introduction.md (#10616)

* Update PP-OCRv4_introduction.md

* Update PP-OCRv4_introduction.md

* Update PP-OCRv4_introduction.md

* Update README.md

* Cherrypicking GH-10217 and GH-10216 to PaddlePaddle:Release/2.7 (#10655)

* Don't break overall processing on a bad image

* Add preprocessing common to OCR tasks
Add preprocessing to options

* Update requirements.txt (#10656)

added missing pyyaml library

* [TIPC]update xpu tipc script (#10658)

* fix-typo (#10642)

Co-authored-by: Dennis <dvorst@users.noreply.github.com>
Co-authored-by: shiyutang <34859558+shiyutang@users.noreply.github.com>

* 修改数据增强导致的DSR报错 (#10662) (#10681)

* 修改数据增强导致的DSR报错

* 错误修改回滚

* Update algorithm_overview_en.md (#10670)

Fixed simple spelling errors.

* Implement recoginition method ParseQ

* Document update for new recognition method ParseQ

* add prediction for parseq

* Update rec_vit_parseq.yml

* Update rec_r31_sar.yml

* Update rec_r31_sar.yml

* Update rec_r50_fpn_srn.yml

* Update rec_vit_parseq.py

* Update rec_vit_parseq.yml

* Update rec_parseq_head.py

* Update rec_img_aug.py

* Update rec_vit_parseq.yml

* Update __init__.py

* Update predict_rec.py

* Update paddleocr.py

* Update requirements.txt

* Update utility.py

* Update utility.py

---------

Co-authored-by: xiaoting <31891223+tink2123@users.noreply.github.com>
Co-authored-by: topduke <784990967@qq.com>
Co-authored-by: dyning <dyning.2003@163.com>
Co-authored-by: UserUnknownFactor <63057995+UserUnknownFactor@users.noreply.github.com>
Co-authored-by: itasli <ilyas.tasli@outlook.fr>
Co-authored-by: Kai Song <50285351+USTCKAY@users.noreply.github.com>
Co-authored-by: dvorst <87502756+dvorst@users.noreply.github.com>
Co-authored-by: Dennis <dvorst@users.noreply.github.com>
Co-authored-by: shiyutang <34859558+shiyutang@users.noreply.github.com>
Co-authored-by: Dec20B <1192152456@qq.com>
Co-authored-by: ncoffman <51147417+ncoffman@users.noreply.github.com>
2023-09-07 16:36:47 +08:00
topduke 43abe2fa4b
V4Rec code pr (#9725)
* v4rec code

* v4rec add nrtrloss

* Add V4rec backbone file

* Add V4Rec config file.

* Fix V4rec reparameters when export_model

* convert lvnetv3

* fix codestyle

* fix infer_rec v4rec
2023-04-19 16:04:23 +08:00
zhiminzhang0830 30201ef954
add satrn (#8433)
* add satrn

* 修复satrn导出问题

* 规范satrn config文件

* 删除SATRNRecResizeImg

---------

Co-authored-by: zhiminzhang0830 <zhangzhimin04@baidu.com>
2023-02-08 11:25:50 +08:00
andyj 9059f747e6
support new version of numpy (#8921)
* add finetune en doc & test=document_fix

* fix dead link & test=document_fix

* fix dead link & test=document_fix

* update check img

* fix det res dtype

* update args default type & test=document_fix

* fix numpy version

* support numpy1.24.0
2023-01-30 16:05:14 +08:00
andyjpaddle 27f4e8a857 rm visionlan invalid params 2022-10-27 07:51:45 +00:00
zhoujun 3907c72a08
Merge pull request #7940 from dorren002/new_branch
add handwritten mathematical expression recognition algorithm CAN
2022-10-19 19:28:59 +08:00
andyj 38290a2853
update visionlan (#7989) 2022-10-19 17:12:12 +08:00
dorren f875556541 update can transform method and add copyright info for new file 2022-10-17 17:18:14 +08:00
andyjpaddle 9606bec16a fix visionlan default dict 2022-10-17 07:41:36 +00:00
dorren c57effb84f update can data loading method and tipc configs, revert precommit config 2022-10-17 15:04:42 +08:00
zhoujun 823a8391f1
Merge pull request #7741 from zhiminzhang0830/rfl_branch
add text recognition algorithm rflearning
2022-10-10 14:06:13 +08:00
littletomatodonkey 60559a7ec1
add ser+re hub serving (#7844)
* add ser+re hub serving

* fix
2022-10-09 13:28:12 +08:00
zhiminzhang0830 0002349df3 add text recognition algorithm rflearning 2022-09-27 10:54:31 +08:00
huangjun12 b5268dc3a0 add centripetal text model 2022-09-15 11:08:16 +00:00
WenmuZhou ec22e60cb4 fix boxlabel error 2022-08-16 03:34:40 +00:00
WenmuZhou 4369552ea2 add in and out box_format 2022-08-15 10:04:06 +00:00
WenmuZhou ba284a2b74 Merge branch 'dygraph' of https://github.com/PaddlePaddle/PaddleOCR into table_pr 2022-08-14 09:07:19 +00:00
xiaoting 7054013004
Submit SR model (#6933)
* add sr model

* update for eval

* submit sr

* polish code

* polish code

* polish code

* update sr model

* update doc

* update doc

* update doc

* fix typo

* format code

* update metric

* fix export
2022-08-12 10:49:54 +08:00
WenmuZhou d5ea6f21f8 Merge branch 'dygraph' of https://github.com/PaddlePaddle/PaddleOCR into table_pr 2022-08-09 11:36:49 +00:00
WenmuZhou 92f8552121 add "<td></td>" to dict when "<td></td>" not in file 2022-08-09 10:50:30 +00:00
andyjpaddle 3f65b360ef
Add rec algo VisionLAN (#6943)
* add vl

* add vl

* add vl

* add ref

* fix head out

* add visionlan doc

* fix vl infer

* update dict
2022-08-09 11:29:43 +08:00
WenmuZhou 6fe01d43dc Merge branch 'dygraph' of https://github.com/PaddlePaddle/PaddleOCR into table_pr 2022-08-08 06:58:15 +00:00
WenmuZhou ddaa2c2552 add SLANet 2022-08-08 06:50:27 +00:00
littletomatodonkey 9e4ae9dc12
add vqa code (#7096)
* add vqa code

* add order ocr info

* rename tb-yx order

* polish configs

* add trt offline-tuning

* fix seed and remove unused configs
2022-08-06 15:41:20 +08:00
WenmuZhou 160e5e384a rename 2.0 to 2_0 2022-08-02 19:41:29 +08:00
smilelite 484bf2f7dc modified SPINLabelEncode SPINLabelDecode 2022-07-14 22:26:10 +08:00
smilelite f614274672 modified label_ops 2022-07-12 22:15:00 +08:00
smilelite cb370419ec modified pr 2022-07-11 23:59:45 +08:00
smilelite c11ec3ce5b merge upstream 2022-07-10 10:21:18 +08:00
littletomatodonkey 7a99588dd8
add more dataset yamls and fix re exceptions (#6791)
* add more dataset yamls and fix re exceptions
2022-07-06 13:58:46 +08:00
WenmuZhou adda5ddce9 fix bug 2022-07-05 02:51:07 +00:00
WenmuZhou d1b31bf85c add ref 2022-07-04 10:22:25 +00:00
WenmuZhou 929ee4669f Merge branch 'dygraph' of https://github.com/PaddlePaddle/PaddleOCR into tablemaster 2022-07-04 09:21:43 +00:00
WenmuZhou 8d46a1fbbe add dygraph2static support of layoutlm series SER model 2022-07-01 08:52:08 +00:00
littletomatodonkey e13ec733a6
unify kie and ser for vqa data format (#6704)
* unify kie and ser for vqa data format

* fix config and label ops

* fix doc

* add distort bbox
2022-06-30 15:23:31 +08:00
topduke 671415bbd8
[New Rec]add rec ViTSTR & ABINet algorithm. (#6414)
* add rec vitstr algorithm.

* fix cpu_thread and precision

* fix svtr tipc

* modify vitstr name

* modify vitstr config batchsize

* [New Rec] add vitstr and ABINet

* add rec_resnet45

* svtr ch large model

* [application] svtr ch model

* [application] svtr ch model

* [application] svtr ch model

* add abinet_rec_aug and trained model

* aug p infe

* fix ci export bug

* fix abinet ci bug
2022-06-28 15:06:53 +08:00
WenmuZhou 5dfcc98306 fix bug 2022-06-27 09:37:07 +00:00
LDOUBLEV 715fc2377e fix 2022-06-22 12:29:06 +08:00
LDOUBLEV 114d3399b8 fix training 2022-06-21 17:59:44 +08:00
LDOUBLEV 4957a923fb fix training 2022-06-21 17:55:49 +08:00
WenmuZhou a0c33908d5 add TableMaster 2022-06-16 13:24:38 +00:00
smilelite bbca1e0d66 add pr 2022-06-12 13:53:29 +08:00
LDOUBLEV a6f5282973 fix 2022-05-12 16:14:18 +08:00
LDOUBLEV e94d674623 fix 2022-05-12 16:12:09 +08:00
tink2123 9f5d0d59ef fix mkldnn for ppocrv3, and fix some typo 2022-04-27 06:24:28 +00:00
andyjpaddle f6532a0e51
add ppocrv3 rec (#6033)
* add ppocrv3 rec
2022-04-26 16:19:31 +08:00
OneYearIsEnough 6e607a0fa1
[Feature] Add PREN Scene Text Recognition Model(Accepted in CVPR2021) (#5563)
* [Feature] add PREN scene text recognition model

* [Patch] Optimize yml File

* [Patch] Save Label/Pred Preprocess Time Cost

* [BugFix] Modify Shape Conversion to Fit for Inference Model Exportion

* [Patch] ?

* [Patch] ?

* 啥情况...
2022-02-28 21:48:00 +08:00
WenmuZhou 1bcbd31815 add layoutlmv2 2022-02-12 07:17:38 +00:00
WenmuZhou c703a5891a fix re infer bug 2022-01-12 02:29:07 +00:00