ficodex
42d280907a
Fix ValueError in text detection due to ambiguous array evaluation ( #11287 )
...
This commit resolves the issue where `dt_boxes` was evaluated as a boolean
in Python 3.11, leading to a ValueError. The condition `if not dt_boxes:` is
replaced with `if dt_boxes.size == 0:` to explicitly check for an empty array,
ensuring compatibility with Python 3.11 and maintaining the intended functionality.
2023-11-22 20:10:32 +08:00
shiyutang
e3fc6393e0
[Cherry-pick] Cherry-pick from release/2.6 ( #11092 )
...
* Update recognition_en.md (#10059 )
ic15_dict.txt only have 36 digits
* Update ocr_rec.h (#9469 )
It is enough to include preprocess_op.h, we do not need to include ocr_cls.h.
* 补充num_classes注释说明 (#10073 )
ser_vi_layoutxlm_xfund_zh.yml中的Architecture.Backbone.num_classes所赋值会设置给Loss.num_classes,
由于采用BIO标注,假设字典中包含n个字段(包含other)时,则类别数为2n-1;假设字典中包含n个字段(不含other)时,则类别数为2n+1。
* Update algorithm_overview_en.md (#9747 )
Fix links to super-resolution algorithm docs
* 改进文档`deploy/hubserving/readme.md`和`doc/doc_ch/models_list.md` (#9110 )
* Update readme.md
* Update readme.md
* Update readme.md
* Update models_list.md
* trim trailling spaces @ `deploy/hubserving/readme_en.md`
* `s/shell/bash/` @ `deploy/hubserving/readme_en.md`
* Update `deploy/hubserving/readme_en.md` to sync with `deploy/hubserving/readme.md`
* Update deploy/hubserving/readme_en.md to sync with `deploy/hubserving/readme.md`
* Update deploy/hubserving/readme_en.md to sync with `deploy/hubserving/readme.md`
* Update `doc/doc_en/models_list_en.md` to sync with `doc/doc_ch/models_list_en.md`
* using Grammarly to weak `deploy/hubserving/readme_en.md`
* using Grammarly to tweak `doc/doc_en/models_list_en.md`
* `ocr_system` module will return with values of field `confidence`
* Update README_CN.md
* 修复测试服务中图片转Base64的引用地址错误。 (#8334 )
* Update application.md
* [Doc] Fix 404 link. (#10318 )
* Update PP-OCRv3_det_train.md
* Update knowledge_distillation.md
* Update config.md
* Fix fitz camelCase deprecation and .PDF not being recognized as pdf file (#10181 )
* Fix fitz camelCase deprecation and .PDF not being recognized as pdf file
* refactor get_image_file_list function
* Update customize.md (#10325 )
* Update FAQ.md (#10345 )
* Update FAQ.md (#10349 )
* Don't break overall processing on a bad image (#10216 )
* Add preprocessing common to OCR tasks (#10217 )
Add preprocessing to options
* [MLU] add mlu device for infer (#10249 )
* Create newfeature.md
* Update newfeature.md
* remove unused imported module, so can avoid PyInstaller packaged binary's start-time not found module error. (#10502 )
* CV套件建设专项活动 - 文字识别返回单字识别坐标 (#10515 )
* modification of return word box
* update_implements
* Update rec_postprocess.py
* Update utility.py
* Update README_ch.md
* revert README_ch.md update
* Fixed Layout recovery README file (#10493 )
Co-authored-by: Shubham Chambhare <shubhamchambhare@zoop.one>
* update_doc
* bugfix
---------
Co-authored-by: ChuongLoc <89434232+ChuongLoc@users.noreply.github.com>
Co-authored-by: Wang Xin <xinwang614@gmail.com>
Co-authored-by: tanjh <dtdhinjapan@gmail.com>
Co-authored-by: Louis Maddox <lmmx@users.noreply.github.com>
Co-authored-by: n0099 <n@n0099.net>
Co-authored-by: zhenliang li <37922155+shouyong@users.noreply.github.com>
Co-authored-by: itasli <ilyas.tasli@outlook.fr>
Co-authored-by: UserUnknownFactor <63057995+UserUnknownFactor@users.noreply.github.com>
Co-authored-by: PeiyuLau <135964669+PeiyuLau@users.noreply.github.com>
Co-authored-by: kerneltravel <kjpioo2006@gmail.com>
Co-authored-by: ToddBear <43341135+ToddBear@users.noreply.github.com>
Co-authored-by: Ligoml <39876205+Ligoml@users.noreply.github.com>
Co-authored-by: Shubham Chambhare <59397280+Shubham654@users.noreply.github.com>
Co-authored-by: Shubham Chambhare <shubhamchambhare@zoop.one>
Co-authored-by: andyj <87074272+andyjpaddle@users.noreply.github.com>
2023-10-18 17:37:23 +08:00
Gmgge
2fdac319b1
fix:修复通道数不匹配造成的PPOCRLabel启动失败问题#10748,根据更新日志发现#10655,由于paddleocr中增加了对透明色的需求,由于在check_img中以及存在对单通道图像转三通道图像的处理,因此将该四通道图像处理成三通道的图像也放入该check_img,并统一三通道图像输出逻辑。 ( #10847 )
...
add:由于函数复杂度增高,增加注释。
2023-09-21 14:51:32 +08:00
ToddBear
75d16610f4
Add new recognition method "ParseQ" ( #10836 )
...
* Update PP-OCRv4_introduction.md
* Update PP-OCRv4_introduction.md (#10616 )
* Update PP-OCRv4_introduction.md
* Update PP-OCRv4_introduction.md
* Update PP-OCRv4_introduction.md
* Update README.md
* Cherrypicking GH-10217 and GH-10216 to PaddlePaddle:Release/2.7 (#10655 )
* Don't break overall processing on a bad image
* Add preprocessing common to OCR tasks
Add preprocessing to options
* Update requirements.txt (#10656 )
added missing pyyaml library
* [TIPC]update xpu tipc script (#10658 )
* fix-typo (#10642 )
Co-authored-by: Dennis <dvorst@users.noreply.github.com>
Co-authored-by: shiyutang <34859558+shiyutang@users.noreply.github.com>
* 修改数据增强导致的DSR报错 (#10662 ) (#10681 )
* 修改数据增强导致的DSR报错
* 错误修改回滚
* Update algorithm_overview_en.md (#10670 )
Fixed simple spelling errors.
* Implement recoginition method ParseQ
* Document update for new recognition method ParseQ
* add prediction for parseq
* Update rec_vit_parseq.yml
* Update rec_r31_sar.yml
* Update rec_r31_sar.yml
* Update rec_r50_fpn_srn.yml
* Update rec_vit_parseq.py
* Update rec_vit_parseq.yml
* Update rec_parseq_head.py
* Update rec_img_aug.py
* Update rec_vit_parseq.yml
* Update __init__.py
* Update predict_rec.py
* Update paddleocr.py
* Update requirements.txt
* Update utility.py
* Update utility.py
---------
Co-authored-by: xiaoting <31891223+tink2123@users.noreply.github.com>
Co-authored-by: topduke <784990967@qq.com>
Co-authored-by: dyning <dyning.2003@163.com>
Co-authored-by: UserUnknownFactor <63057995+UserUnknownFactor@users.noreply.github.com>
Co-authored-by: itasli <ilyas.tasli@outlook.fr>
Co-authored-by: Kai Song <50285351+USTCKAY@users.noreply.github.com>
Co-authored-by: dvorst <87502756+dvorst@users.noreply.github.com>
Co-authored-by: Dennis <dvorst@users.noreply.github.com>
Co-authored-by: shiyutang <34859558+shiyutang@users.noreply.github.com>
Co-authored-by: Dec20B <1192152456@qq.com>
Co-authored-by: ncoffman <51147417+ncoffman@users.noreply.github.com>
2023-09-07 16:36:47 +08:00
Gmgge
3a6fe48afd
根据推理对三通道的图像需求,以及opencv中imread参数说明IMREAD_COLOR(If set, always convert image to the 3 channel BGR color image.),因此修改该读取参数,以解决后续通道不匹配问题。 ( #10777 )
2023-08-31 14:31:28 +08:00
UserUnknownFactor
b3912fcf7a
Cherrypicking GH-10217 and GH-10216 to PaddlePaddle:dygraph ( #10654 )
...
* Don't break overall processing on a bad image
* Add preprocessing common to OCR tasks
Add preprocessing to options
2023-08-21 16:33:03 +08:00
WilliamQf
2bd552c801
实现功能:当--savefile为true时,在--output下以当前图片名称后接“.txt”为文件名保存ocr推理结果,解决了issues: ( #10628 )
...
* 实现功能:当--savefile为true时,在--output下以当前图片名称后接“.txt”为文件名保存ocr推理结果,解决了issues:
https://github.com/PaddlePaddle/PaddleOCR/issues/10533
* 删除了乱码字符
* 1.删除了乱码字符
* 1.删除了乱码字符
2023-08-21 10:25:47 +08:00
andyj
681467d4ea
[bug fix] fix none res in recovery ( #10603 )
...
* add finetune en doc & test=document_fix
* fix dead link & test=document_fix
* fix dead link & test=document_fix
* update check img
* fix det res dtype
* update args default type & test=document_fix
* fix numpy version
* support numpy1.24.0
* fix doc & test=document_fix
* update doc
* update doc, test=document_fix
* fix pdf2word in whl, test=document_fix
* fix none res in recovery
* update version
* format code
2023-08-10 16:55:26 +08:00
andyj
254786752a
[BUG FIX] Fix pdf2word in whl ( #10584 )
...
* add finetune en doc & test=document_fix
* fix dead link & test=document_fix
* fix dead link & test=document_fix
* update check img
* fix det res dtype
* update args default type & test=document_fix
* fix numpy version
* support numpy1.24.0
* fix doc & test=document_fix
* update doc
* update doc, test=document_fix
* fix pdf2word in whl, test=document_fix
2023-08-09 19:15:49 +08:00
xiaoting
2f70e4b7f6
upload paddleocr whl to pypi ( #10524 )
...
* upload paddleocr whl to pypi
* Update README_ch.md
* Update README_ch.md
* Update quickstart.md
* Update README_ch.md
* Update README_ch.md
2023-08-06 11:17:13 +08:00
Bryan YW
1556922669
Update paddleocr.py ( #10066 )
...
cherry-pick package compatibility PR
2023-07-18 11:36:49 +08:00
andyj
83beede6b5
Update check img ( #8558 )
...
* add finetune en doc & test=document_fix
* fix dead link & test=document_fix
* fix dead link & test=document_fix
* update check img
2022-12-07 09:51:54 +08:00
zhoujun
59b3eade31
Merge pull request #8066 from WenmuZhou/doc2
...
update PP-Structurev to PP-StructureV
2022-10-25 14:20:10 +08:00
WenmuZhou
cad701d411
fix benckmark error when benckmark=false
2022-10-24 17:10:05 +08:00
WenmuZhou
b2bc6b746c
update doc
2022-10-24 05:36:34 +00:00
WenmuZhou
2145d8c4ec
add recovery requirements to whl
2022-10-20 17:03:47 +08:00
an1018
99698aed54
add_pdf2docx_api
2022-10-12 21:28:48 +08:00
andyjpaddle
b947dc3fb3
fix whl for layout recovery
2022-10-11 08:42:47 +00:00
WenmuZhou
c647a6da28
add re to ppstructure system
2022-10-10 05:39:41 +00:00
andyj
059349ab74
Merge pull request #7834 from andyjpaddle/update_whl
...
update doc for whl
2022-10-08 19:21:53 +08:00
andyjpaddle
a5df6c34e0
update doc for whl
2022-10-08 08:28:30 +00:00
Xie Xuyang
f699d787ff
fix #7534 ( #7535 )
2022-09-29 11:19:44 +08:00
vivien
f9baf0eabc
update bytes support
2022-08-31 19:59:29 +08:00
Leif
975975d30e
Update PPOCRLabel
...
Update PPOCRLabel
2022-08-25 16:32:44 +08:00
an1018
8756253800
update doc
2022-08-24 10:07:30 +08:00
an1018
2976dab987
Merge branch 'dygraph' of https://github.com/PaddlePaddle/PaddleOCR into add_layout_hub
2022-08-24 09:04:59 +08:00
an1018
9c424ff164
update doc
2022-08-23 23:28:49 +08:00
an1018
d5d78b486b
update doc
2022-08-23 16:11:18 +08:00
WenmuZhou
f7b35bbbc4
do not print table result
2022-08-22 08:00:49 +00:00
littletomatodonkey
8c7c45420f
polish kie doc and code ( #7255 )
...
* add fapiao kie
* fix readme
* fix fanli
* add readme
* add how to do kie en
* add algo kie
* add algo overview en
* rename vqa to kie
* fix read gif
2022-08-21 10:55:49 +08:00
zhoujun
aec5dbf8f7
update model size ( #7269 )
...
* update model size
* update layout dict in whl
2022-08-20 16:10:45 +08:00
WenmuZhou
604f531b9a
add layout model
2022-08-17 04:40:07 +00:00
WenmuZhou
b26ce23774
rm unused code
2022-08-16 10:55:24 +00:00
WenmuZhou
bb53c8d100
add table model link
2022-08-16 10:46:09 +00:00
WenmuZhou
731688c2dd
add PP-Structurev2 to whl
2022-08-10 14:51:57 +00:00
andyjpaddle
e2b2f33092
update some note
2022-05-30 09:25:21 +00:00
WenmuZhou
f31980266a
merge upstream
2022-05-10 14:28:02 +08:00
andyjpaddle
565abdb632
update version
2022-05-10 02:46:13 +00:00
andyjpaddle
b16c3d4234
update v3 rec name
2022-05-10 02:41:54 +00:00
andyjpaddle
4d406f8825
update whl for rec image shape
2022-05-07 09:06:13 +00:00
andyjpaddle
cedf8b5669
update whl for multi lang
2022-05-07 06:05:47 +00:00
andyjpaddle
bd147c61a9
update whl for multi lang rec
2022-05-07 03:52:06 +00:00
andyjpaddle
738ff190dd
update doc for whl and python cpp infer quick
2022-04-29 03:38:59 +00:00
WenmuZhou
bbff7c55c1
The whl package supports separate table recognition and layout analysis
2022-04-22 13:24:45 +08:00
tink2123
3a28266735
update paddleocr version no
2022-04-02 09:36:48 +00:00
WenmuZhou
b3fbfc3109
rename OCRSystem to StructureSystem
2022-03-30 09:24:24 +00:00
WenmuZhou
463622b1f9
rm unused code
2022-03-23 14:26:40 +00:00
WenmuZhou
e693619888
add relative import
2022-03-23 14:22:06 +00:00
WenmuZhou
f85b3ea8eb
fix bug in whl import fce
2022-03-18 10:08:58 +00:00
WenmuZhou
496887f83f
change print to logger.debug
2022-03-17 14:20:41 +00:00