Empathy
d3e362a3a0
1、PPOCRLabel现在支持从中文路径导入图片,原本导入含中文路径的图片会导致崩溃。 ( #11236 )
...
2、PPOCRLabel现在支持移动被其他框覆盖的锚点,原本无法移动被覆盖的锚点。
3、修复utility.py中误输入字符导致的语法错误。
4、修复setValue()应输入int,实际输入float导致的类型错误。
5、修复paddleocr中未import predict_system的错误。
6、修复canvas.py中部分输入参数类型错误
7、修复了LabelList不兼容搜狗输入法或win11输入法的问题。原本使用搜狗输入法修改标注数据时,仅输入一个字母就会失去焦点并提交数据变更,导致无法输入完整的汉字。现在将处理逻辑改为失去焦点时仍不提交数据变更,直到切换item或按下enter键才提交。
8、新增扩大选框的功能
1、PPOCRLabel now supports importing images from Chinese paths, originally importing images containing Chinese paths would cause a crash.
2、PPOCRLabel now supports moving anchor points that are covered by other boxes, originally it could not move the covered anchor points.
3、Fix the syntax error caused by mistakenly inputting characters in utility.py.
4、Repair the type error caused by inputting int but float in setValue().
5、Repair the error of not import predict_system in paddleocr.
6、Fix some input parameter type errors in canvas.py.
7、LabelList can't use Sogou Input Method or Win11 Input Method to input text.
8、Add function of expand box.
2023-12-13 10:16:39 +08:00
ficodex
42d280907a
Fix ValueError in text detection due to ambiguous array evaluation ( #11287 )
...
This commit resolves the issue where `dt_boxes` was evaluated as a boolean
in Python 3.11, leading to a ValueError. The condition `if not dt_boxes:` is
replaced with `if dt_boxes.size == 0:` to explicitly check for an empty array,
ensuring compatibility with Python 3.11 and maintaining the intended functionality.
2023-11-22 20:10:32 +08:00
shiyutang
e3fc6393e0
[Cherry-pick] Cherry-pick from release/2.6 ( #11092 )
...
* Update recognition_en.md (#10059 )
ic15_dict.txt only have 36 digits
* Update ocr_rec.h (#9469 )
It is enough to include preprocess_op.h, we do not need to include ocr_cls.h.
* 补充num_classes注释说明 (#10073 )
ser_vi_layoutxlm_xfund_zh.yml中的Architecture.Backbone.num_classes所赋值会设置给Loss.num_classes,
由于采用BIO标注,假设字典中包含n个字段(包含other)时,则类别数为2n-1;假设字典中包含n个字段(不含other)时,则类别数为2n+1。
* Update algorithm_overview_en.md (#9747 )
Fix links to super-resolution algorithm docs
* 改进文档`deploy/hubserving/readme.md`和`doc/doc_ch/models_list.md` (#9110 )
* Update readme.md
* Update readme.md
* Update readme.md
* Update models_list.md
* trim trailling spaces @ `deploy/hubserving/readme_en.md`
* `s/shell/bash/` @ `deploy/hubserving/readme_en.md`
* Update `deploy/hubserving/readme_en.md` to sync with `deploy/hubserving/readme.md`
* Update deploy/hubserving/readme_en.md to sync with `deploy/hubserving/readme.md`
* Update deploy/hubserving/readme_en.md to sync with `deploy/hubserving/readme.md`
* Update `doc/doc_en/models_list_en.md` to sync with `doc/doc_ch/models_list_en.md`
* using Grammarly to weak `deploy/hubserving/readme_en.md`
* using Grammarly to tweak `doc/doc_en/models_list_en.md`
* `ocr_system` module will return with values of field `confidence`
* Update README_CN.md
* 修复测试服务中图片转Base64的引用地址错误。 (#8334 )
* Update application.md
* [Doc] Fix 404 link. (#10318 )
* Update PP-OCRv3_det_train.md
* Update knowledge_distillation.md
* Update config.md
* Fix fitz camelCase deprecation and .PDF not being recognized as pdf file (#10181 )
* Fix fitz camelCase deprecation and .PDF not being recognized as pdf file
* refactor get_image_file_list function
* Update customize.md (#10325 )
* Update FAQ.md (#10345 )
* Update FAQ.md (#10349 )
* Don't break overall processing on a bad image (#10216 )
* Add preprocessing common to OCR tasks (#10217 )
Add preprocessing to options
* [MLU] add mlu device for infer (#10249 )
* Create newfeature.md
* Update newfeature.md
* remove unused imported module, so can avoid PyInstaller packaged binary's start-time not found module error. (#10502 )
* CV套件建设专项活动 - 文字识别返回单字识别坐标 (#10515 )
* modification of return word box
* update_implements
* Update rec_postprocess.py
* Update utility.py
* Update README_ch.md
* revert README_ch.md update
* Fixed Layout recovery README file (#10493 )
Co-authored-by: Shubham Chambhare <shubhamchambhare@zoop.one>
* update_doc
* bugfix
---------
Co-authored-by: ChuongLoc <89434232+ChuongLoc@users.noreply.github.com>
Co-authored-by: Wang Xin <xinwang614@gmail.com>
Co-authored-by: tanjh <dtdhinjapan@gmail.com>
Co-authored-by: Louis Maddox <lmmx@users.noreply.github.com>
Co-authored-by: n0099 <n@n0099.net>
Co-authored-by: zhenliang li <37922155+shouyong@users.noreply.github.com>
Co-authored-by: itasli <ilyas.tasli@outlook.fr>
Co-authored-by: UserUnknownFactor <63057995+UserUnknownFactor@users.noreply.github.com>
Co-authored-by: PeiyuLau <135964669+PeiyuLau@users.noreply.github.com>
Co-authored-by: kerneltravel <kjpioo2006@gmail.com>
Co-authored-by: ToddBear <43341135+ToddBear@users.noreply.github.com>
Co-authored-by: Ligoml <39876205+Ligoml@users.noreply.github.com>
Co-authored-by: Shubham Chambhare <59397280+Shubham654@users.noreply.github.com>
Co-authored-by: Shubham Chambhare <shubhamchambhare@zoop.one>
Co-authored-by: andyj <87074272+andyjpaddle@users.noreply.github.com>
2023-10-18 17:37:23 +08:00
Gmgge
2fdac319b1
fix:修复通道数不匹配造成的PPOCRLabel启动失败问题#10748,根据更新日志发现#10655,由于paddleocr中增加了对透明色的需求,由于在check_img中以及存在对单通道图像转三通道图像的处理,因此将该四通道图像处理成三通道的图像也放入该check_img,并统一三通道图像输出逻辑。 ( #10847 )
...
add:由于函数复杂度增高,增加注释。
2023-09-21 14:51:32 +08:00
ToddBear
75d16610f4
Add new recognition method "ParseQ" ( #10836 )
...
* Update PP-OCRv4_introduction.md
* Update PP-OCRv4_introduction.md (#10616 )
* Update PP-OCRv4_introduction.md
* Update PP-OCRv4_introduction.md
* Update PP-OCRv4_introduction.md
* Update README.md
* Cherrypicking GH-10217 and GH-10216 to PaddlePaddle:Release/2.7 (#10655 )
* Don't break overall processing on a bad image
* Add preprocessing common to OCR tasks
Add preprocessing to options
* Update requirements.txt (#10656 )
added missing pyyaml library
* [TIPC]update xpu tipc script (#10658 )
* fix-typo (#10642 )
Co-authored-by: Dennis <dvorst@users.noreply.github.com>
Co-authored-by: shiyutang <34859558+shiyutang@users.noreply.github.com>
* 修改数据增强导致的DSR报错 (#10662 ) (#10681 )
* 修改数据增强导致的DSR报错
* 错误修改回滚
* Update algorithm_overview_en.md (#10670 )
Fixed simple spelling errors.
* Implement recoginition method ParseQ
* Document update for new recognition method ParseQ
* add prediction for parseq
* Update rec_vit_parseq.yml
* Update rec_r31_sar.yml
* Update rec_r31_sar.yml
* Update rec_r50_fpn_srn.yml
* Update rec_vit_parseq.py
* Update rec_vit_parseq.yml
* Update rec_parseq_head.py
* Update rec_img_aug.py
* Update rec_vit_parseq.yml
* Update __init__.py
* Update predict_rec.py
* Update paddleocr.py
* Update requirements.txt
* Update utility.py
* Update utility.py
---------
Co-authored-by: xiaoting <31891223+tink2123@users.noreply.github.com>
Co-authored-by: topduke <784990967@qq.com>
Co-authored-by: dyning <dyning.2003@163.com>
Co-authored-by: UserUnknownFactor <63057995+UserUnknownFactor@users.noreply.github.com>
Co-authored-by: itasli <ilyas.tasli@outlook.fr>
Co-authored-by: Kai Song <50285351+USTCKAY@users.noreply.github.com>
Co-authored-by: dvorst <87502756+dvorst@users.noreply.github.com>
Co-authored-by: Dennis <dvorst@users.noreply.github.com>
Co-authored-by: shiyutang <34859558+shiyutang@users.noreply.github.com>
Co-authored-by: Dec20B <1192152456@qq.com>
Co-authored-by: ncoffman <51147417+ncoffman@users.noreply.github.com>
2023-09-07 16:36:47 +08:00
Gmgge
3a6fe48afd
根据推理对三通道的图像需求,以及opencv中imread参数说明IMREAD_COLOR(If set, always convert image to the 3 channel BGR color image.),因此修改该读取参数,以解决后续通道不匹配问题。 ( #10777 )
2023-08-31 14:31:28 +08:00
UserUnknownFactor
b3912fcf7a
Cherrypicking GH-10217 and GH-10216 to PaddlePaddle:dygraph ( #10654 )
...
* Don't break overall processing on a bad image
* Add preprocessing common to OCR tasks
Add preprocessing to options
2023-08-21 16:33:03 +08:00
WilliamQf
2bd552c801
实现功能:当--savefile为true时,在--output下以当前图片名称后接“.txt”为文件名保存ocr推理结果,解决了issues: ( #10628 )
...
* 实现功能:当--savefile为true时,在--output下以当前图片名称后接“.txt”为文件名保存ocr推理结果,解决了issues:
https://github.com/PaddlePaddle/PaddleOCR/issues/10533
* 删除了乱码字符
* 1.删除了乱码字符
* 1.删除了乱码字符
2023-08-21 10:25:47 +08:00
andyj
681467d4ea
[bug fix] fix none res in recovery ( #10603 )
...
* add finetune en doc & test=document_fix
* fix dead link & test=document_fix
* fix dead link & test=document_fix
* update check img
* fix det res dtype
* update args default type & test=document_fix
* fix numpy version
* support numpy1.24.0
* fix doc & test=document_fix
* update doc
* update doc, test=document_fix
* fix pdf2word in whl, test=document_fix
* fix none res in recovery
* update version
* format code
2023-08-10 16:55:26 +08:00
andyj
254786752a
[BUG FIX] Fix pdf2word in whl ( #10584 )
...
* add finetune en doc & test=document_fix
* fix dead link & test=document_fix
* fix dead link & test=document_fix
* update check img
* fix det res dtype
* update args default type & test=document_fix
* fix numpy version
* support numpy1.24.0
* fix doc & test=document_fix
* update doc
* update doc, test=document_fix
* fix pdf2word in whl, test=document_fix
2023-08-09 19:15:49 +08:00
xiaoting
2f70e4b7f6
upload paddleocr whl to pypi ( #10524 )
...
* upload paddleocr whl to pypi
* Update README_ch.md
* Update README_ch.md
* Update quickstart.md
* Update README_ch.md
* Update README_ch.md
2023-08-06 11:17:13 +08:00
Bryan YW
1556922669
Update paddleocr.py ( #10066 )
...
cherry-pick package compatibility PR
2023-07-18 11:36:49 +08:00
andyj
83beede6b5
Update check img ( #8558 )
...
* add finetune en doc & test=document_fix
* fix dead link & test=document_fix
* fix dead link & test=document_fix
* update check img
2022-12-07 09:51:54 +08:00
zhoujun
59b3eade31
Merge pull request #8066 from WenmuZhou/doc2
...
update PP-Structurev to PP-StructureV
2022-10-25 14:20:10 +08:00
WenmuZhou
cad701d411
fix benckmark error when benckmark=false
2022-10-24 17:10:05 +08:00
WenmuZhou
b2bc6b746c
update doc
2022-10-24 05:36:34 +00:00
WenmuZhou
2145d8c4ec
add recovery requirements to whl
2022-10-20 17:03:47 +08:00
an1018
99698aed54
add_pdf2docx_api
2022-10-12 21:28:48 +08:00
andyjpaddle
b947dc3fb3
fix whl for layout recovery
2022-10-11 08:42:47 +00:00
WenmuZhou
c647a6da28
add re to ppstructure system
2022-10-10 05:39:41 +00:00
andyj
059349ab74
Merge pull request #7834 from andyjpaddle/update_whl
...
update doc for whl
2022-10-08 19:21:53 +08:00
andyjpaddle
a5df6c34e0
update doc for whl
2022-10-08 08:28:30 +00:00
Xie Xuyang
f699d787ff
fix #7534 ( #7535 )
2022-09-29 11:19:44 +08:00
vivien
f9baf0eabc
update bytes support
2022-08-31 19:59:29 +08:00
Leif
975975d30e
Update PPOCRLabel
...
Update PPOCRLabel
2022-08-25 16:32:44 +08:00
an1018
8756253800
update doc
2022-08-24 10:07:30 +08:00
an1018
2976dab987
Merge branch 'dygraph' of https://github.com/PaddlePaddle/PaddleOCR into add_layout_hub
2022-08-24 09:04:59 +08:00
an1018
9c424ff164
update doc
2022-08-23 23:28:49 +08:00
an1018
d5d78b486b
update doc
2022-08-23 16:11:18 +08:00
WenmuZhou
f7b35bbbc4
do not print table result
2022-08-22 08:00:49 +00:00
littletomatodonkey
8c7c45420f
polish kie doc and code ( #7255 )
...
* add fapiao kie
* fix readme
* fix fanli
* add readme
* add how to do kie en
* add algo kie
* add algo overview en
* rename vqa to kie
* fix read gif
2022-08-21 10:55:49 +08:00
zhoujun
aec5dbf8f7
update model size ( #7269 )
...
* update model size
* update layout dict in whl
2022-08-20 16:10:45 +08:00
WenmuZhou
604f531b9a
add layout model
2022-08-17 04:40:07 +00:00
WenmuZhou
b26ce23774
rm unused code
2022-08-16 10:55:24 +00:00
WenmuZhou
bb53c8d100
add table model link
2022-08-16 10:46:09 +00:00
WenmuZhou
731688c2dd
add PP-Structurev2 to whl
2022-08-10 14:51:57 +00:00
andyjpaddle
e2b2f33092
update some note
2022-05-30 09:25:21 +00:00
WenmuZhou
f31980266a
merge upstream
2022-05-10 14:28:02 +08:00
andyjpaddle
565abdb632
update version
2022-05-10 02:46:13 +00:00
andyjpaddle
b16c3d4234
update v3 rec name
2022-05-10 02:41:54 +00:00
andyjpaddle
4d406f8825
update whl for rec image shape
2022-05-07 09:06:13 +00:00
andyjpaddle
cedf8b5669
update whl for multi lang
2022-05-07 06:05:47 +00:00
andyjpaddle
bd147c61a9
update whl for multi lang rec
2022-05-07 03:52:06 +00:00
andyjpaddle
738ff190dd
update doc for whl and python cpp infer quick
2022-04-29 03:38:59 +00:00
WenmuZhou
bbff7c55c1
The whl package supports separate table recognition and layout analysis
2022-04-22 13:24:45 +08:00
tink2123
3a28266735
update paddleocr version no
2022-04-02 09:36:48 +00:00
WenmuZhou
b3fbfc3109
rename OCRSystem to StructureSystem
2022-03-30 09:24:24 +00:00
WenmuZhou
463622b1f9
rm unused code
2022-03-23 14:26:40 +00:00
WenmuZhou
e693619888
add relative import
2022-03-23 14:22:06 +00:00
WenmuZhou
f85b3ea8eb
fix bug in whl import fce
2022-03-18 10:08:58 +00:00