^_^
d28cb46061
skip unnecessary method calls ( #14900 )
...
* skip unnecessary method calls in PaddleOCR.ocr
pre-check meaningless args for PaddleOCR.ocr
* style: make CI happy
2025-03-20 10:36:10 +08:00
zhangyubo0722
e8399f267b
add cli rec model ( #14740 )
2025-03-06 17:05:51 +08:00
mauryaland
44e17bae0a
allow environment variable for BASE_DIR models ( #14678 )
...
* allow environment variable for BASE_DIR models
* add doc for environment variable PADDLE_OCR_BASE_DIR
* fix code style error
2025-02-14 21:58:00 +08:00
cuicheng01
a28196c002
update SLANet inference weights for adapt to paddle3.0b2 ( #14467 )
2024-12-30 18:45:49 +08:00
TongZ
d8a20b16ad
fix: replace `rec_image_shape` when manually set ( #14371 )
2024-12-16 08:14:37 +08:00
Wang Xin
1eeca9c72f
image without any text will show a warning ( #14132 )
2024-10-31 14:08:06 +08:00
ztyf
8728b47046
pdf to markdown document ( #13942 )
2024-10-07 09:25:21 +08:00
Wang Xin
cda3e1201e
fix hubserving run error ( #13918 )
...
* fix hubserving run error
* Update paddleocr.py
2024-09-27 15:58:49 +08:00
Kayzwer
78ddaa44a0
remove unused enumerate ( #13760 )
2024-08-28 09:10:33 +08:00
Wang Xin
c556b9083e
fix layout recovery import error ( #13434 )
2024-07-20 21:19:09 +08:00
jzhang533
603b3728ac
add missing docstring in paddleocr.py using copilot ( #13344 )
...
* add missing docstring in paddleocr.py using copilot
Signed-off-by: Zhang Jun <jzhang533@gmail.com>
* Update paddleocr.py
Co-authored-by: Wang Xin <xinwang614@gmail.com>
---------
Signed-off-by: Zhang Jun <jzhang533@gmail.com>
Co-authored-by: Wang Xin <xinwang614@gmail.com>
2024-07-13 10:32:55 +08:00
myhloli
26283901a7
Solve ModuleNotFoundError: No module named 'tools.infer' ( #13348 )
...
* add layout score return
* fix(paddleocr): correct import path for predict_system
to fix ModuleNotFoundError: No module named 'tools.infer'
* fix(paddleocr): move from tools.infer import predict_system under "tools" define to slove ModuleNotFoundError: No module named 'tools.infer'
2024-07-11 19:28:16 +08:00
Wang Xin
153de46b67
add url in pyproject, and update version number ( #13274 )
2024-07-06 21:20:59 +08:00
Wang Xin
b3954d38ba
fix wrong output when --det=False ( #12589 )
...
* fix wrong output when --det=False
* using pprint to format the output
* using pprint to format the output
2024-06-05 09:55:50 +08:00
aspaul20
965f569e81
added sliding window for large image inference ( #12152 )
...
added sliding window for large image inference
2024-05-24 16:16:37 +08:00
张春乔
3a66efc7bf
【OCR Issue No.12】Modify the setuptools configuration from SETUP.py into PYPROJECT.toml ( #12013 )
...
Modify the setuptools configuration from SETUP.py into PYPROJECT.toml
2024-05-24 11:45:15 +08:00
NOEXIST
58181962dc
layout recognition refinement onnx support ( #12068 )
...
* layout recognition refinement onnx support
* fix codestyle
2024-05-09 09:35:44 +08:00
张春乔
b5eedf727e
【OCR Issue No.9】移除明确不适合放在ppocr依赖中的依赖项 ( #11946 )
...
* modify requestions
* Update requirements.txt
* Update requirements.txt
* try import pdfconvert
* try import lxml
* try import lxml
* try import premailer
* try import openpyxl
* Apply suggestions from code review
2024-04-26 16:54:49 +08:00
S M
f7117efd44
Fix the bug where Python scripts fail to execute PDF text recognition… ( #11994 )
...
* Fix the bug where Python scripts fail to execute PDF text recognition tasks, optimize the logic of judging PDF files, and add cases to the quickstart document for layout analysis.
* Add two examples of PDF layout analysis to the quickstart file of ppstructure.
* Add a return comment for the check_img function
2024-04-25 16:52:09 +08:00
Wang Xin
045e5f6ac7
add pre-commit workflow ( #11973 )
...
* add pre-commit workflow
* run 'pre-commit run --all-files'
* setup python version
2024-04-21 21:46:20 +08:00
jzhang533
5e40f85ef3
setup a workflow for publishing package to pypi ( #11804 )
2024-03-27 10:41:55 +08:00
Empathy
d3e362a3a0
1、PPOCRLabel现在支持从中文路径导入图片,原本导入含中文路径的图片会导致崩溃。 ( #11236 )
...
2、PPOCRLabel现在支持移动被其他框覆盖的锚点,原本无法移动被覆盖的锚点。
3、修复utility.py中误输入字符导致的语法错误。
4、修复setValue()应输入int,实际输入float导致的类型错误。
5、修复paddleocr中未import predict_system的错误。
6、修复canvas.py中部分输入参数类型错误
7、修复了LabelList不兼容搜狗输入法或win11输入法的问题。原本使用搜狗输入法修改标注数据时,仅输入一个字母就会失去焦点并提交数据变更,导致无法输入完整的汉字。现在将处理逻辑改为失去焦点时仍不提交数据变更,直到切换item或按下enter键才提交。
8、新增扩大选框的功能
1、PPOCRLabel now supports importing images from Chinese paths, originally importing images containing Chinese paths would cause a crash.
2、PPOCRLabel now supports moving anchor points that are covered by other boxes, originally it could not move the covered anchor points.
3、Fix the syntax error caused by mistakenly inputting characters in utility.py.
4、Repair the type error caused by inputting int but float in setValue().
5、Repair the error of not import predict_system in paddleocr.
6、Fix some input parameter type errors in canvas.py.
7、LabelList can't use Sogou Input Method or Win11 Input Method to input text.
8、Add function of expand box.
2023-12-13 10:16:39 +08:00
ficodex
42d280907a
Fix ValueError in text detection due to ambiguous array evaluation ( #11287 )
...
This commit resolves the issue where `dt_boxes` was evaluated as a boolean
in Python 3.11, leading to a ValueError. The condition `if not dt_boxes:` is
replaced with `if dt_boxes.size == 0:` to explicitly check for an empty array,
ensuring compatibility with Python 3.11 and maintaining the intended functionality.
2023-11-22 20:10:32 +08:00
shiyutang
e3fc6393e0
[Cherry-pick] Cherry-pick from release/2.6 ( #11092 )
...
* Update recognition_en.md (#10059 )
ic15_dict.txt only have 36 digits
* Update ocr_rec.h (#9469 )
It is enough to include preprocess_op.h, we do not need to include ocr_cls.h.
* 补充num_classes注释说明 (#10073 )
ser_vi_layoutxlm_xfund_zh.yml中的Architecture.Backbone.num_classes所赋值会设置给Loss.num_classes,
由于采用BIO标注,假设字典中包含n个字段(包含other)时,则类别数为2n-1;假设字典中包含n个字段(不含other)时,则类别数为2n+1。
* Update algorithm_overview_en.md (#9747 )
Fix links to super-resolution algorithm docs
* 改进文档`deploy/hubserving/readme.md`和`doc/doc_ch/models_list.md` (#9110 )
* Update readme.md
* Update readme.md
* Update readme.md
* Update models_list.md
* trim trailling spaces @ `deploy/hubserving/readme_en.md`
* `s/shell/bash/` @ `deploy/hubserving/readme_en.md`
* Update `deploy/hubserving/readme_en.md` to sync with `deploy/hubserving/readme.md`
* Update deploy/hubserving/readme_en.md to sync with `deploy/hubserving/readme.md`
* Update deploy/hubserving/readme_en.md to sync with `deploy/hubserving/readme.md`
* Update `doc/doc_en/models_list_en.md` to sync with `doc/doc_ch/models_list_en.md`
* using Grammarly to weak `deploy/hubserving/readme_en.md`
* using Grammarly to tweak `doc/doc_en/models_list_en.md`
* `ocr_system` module will return with values of field `confidence`
* Update README_CN.md
* 修复测试服务中图片转Base64的引用地址错误。 (#8334 )
* Update application.md
* [Doc] Fix 404 link. (#10318 )
* Update PP-OCRv3_det_train.md
* Update knowledge_distillation.md
* Update config.md
* Fix fitz camelCase deprecation and .PDF not being recognized as pdf file (#10181 )
* Fix fitz camelCase deprecation and .PDF not being recognized as pdf file
* refactor get_image_file_list function
* Update customize.md (#10325 )
* Update FAQ.md (#10345 )
* Update FAQ.md (#10349 )
* Don't break overall processing on a bad image (#10216 )
* Add preprocessing common to OCR tasks (#10217 )
Add preprocessing to options
* [MLU] add mlu device for infer (#10249 )
* Create newfeature.md
* Update newfeature.md
* remove unused imported module, so can avoid PyInstaller packaged binary's start-time not found module error. (#10502 )
* CV套件建设专项活动 - 文字识别返回单字识别坐标 (#10515 )
* modification of return word box
* update_implements
* Update rec_postprocess.py
* Update utility.py
* Update README_ch.md
* revert README_ch.md update
* Fixed Layout recovery README file (#10493 )
Co-authored-by: Shubham Chambhare <shubhamchambhare@zoop.one>
* update_doc
* bugfix
---------
Co-authored-by: ChuongLoc <89434232+ChuongLoc@users.noreply.github.com>
Co-authored-by: Wang Xin <xinwang614@gmail.com>
Co-authored-by: tanjh <dtdhinjapan@gmail.com>
Co-authored-by: Louis Maddox <lmmx@users.noreply.github.com>
Co-authored-by: n0099 <n@n0099.net>
Co-authored-by: zhenliang li <37922155+shouyong@users.noreply.github.com>
Co-authored-by: itasli <ilyas.tasli@outlook.fr>
Co-authored-by: UserUnknownFactor <63057995+UserUnknownFactor@users.noreply.github.com>
Co-authored-by: PeiyuLau <135964669+PeiyuLau@users.noreply.github.com>
Co-authored-by: kerneltravel <kjpioo2006@gmail.com>
Co-authored-by: ToddBear <43341135+ToddBear@users.noreply.github.com>
Co-authored-by: Ligoml <39876205+Ligoml@users.noreply.github.com>
Co-authored-by: Shubham Chambhare <59397280+Shubham654@users.noreply.github.com>
Co-authored-by: Shubham Chambhare <shubhamchambhare@zoop.one>
Co-authored-by: andyj <87074272+andyjpaddle@users.noreply.github.com>
2023-10-18 17:37:23 +08:00
Gmgge
2fdac319b1
fix:修复通道数不匹配造成的PPOCRLabel启动失败问题#10748,根据更新日志发现#10655,由于paddleocr中增加了对透明色的需求,由于在check_img中以及存在对单通道图像转三通道图像的处理,因此将该四通道图像处理成三通道的图像也放入该check_img,并统一三通道图像输出逻辑。 ( #10847 )
...
add:由于函数复杂度增高,增加注释。
2023-09-21 14:51:32 +08:00
ToddBear
75d16610f4
Add new recognition method "ParseQ" ( #10836 )
...
* Update PP-OCRv4_introduction.md
* Update PP-OCRv4_introduction.md (#10616 )
* Update PP-OCRv4_introduction.md
* Update PP-OCRv4_introduction.md
* Update PP-OCRv4_introduction.md
* Update README.md
* Cherrypicking GH-10217 and GH-10216 to PaddlePaddle:Release/2.7 (#10655 )
* Don't break overall processing on a bad image
* Add preprocessing common to OCR tasks
Add preprocessing to options
* Update requirements.txt (#10656 )
added missing pyyaml library
* [TIPC]update xpu tipc script (#10658 )
* fix-typo (#10642 )
Co-authored-by: Dennis <dvorst@users.noreply.github.com>
Co-authored-by: shiyutang <34859558+shiyutang@users.noreply.github.com>
* 修改数据增强导致的DSR报错 (#10662 ) (#10681 )
* 修改数据增强导致的DSR报错
* 错误修改回滚
* Update algorithm_overview_en.md (#10670 )
Fixed simple spelling errors.
* Implement recoginition method ParseQ
* Document update for new recognition method ParseQ
* add prediction for parseq
* Update rec_vit_parseq.yml
* Update rec_r31_sar.yml
* Update rec_r31_sar.yml
* Update rec_r50_fpn_srn.yml
* Update rec_vit_parseq.py
* Update rec_vit_parseq.yml
* Update rec_parseq_head.py
* Update rec_img_aug.py
* Update rec_vit_parseq.yml
* Update __init__.py
* Update predict_rec.py
* Update paddleocr.py
* Update requirements.txt
* Update utility.py
* Update utility.py
---------
Co-authored-by: xiaoting <31891223+tink2123@users.noreply.github.com>
Co-authored-by: topduke <784990967@qq.com>
Co-authored-by: dyning <dyning.2003@163.com>
Co-authored-by: UserUnknownFactor <63057995+UserUnknownFactor@users.noreply.github.com>
Co-authored-by: itasli <ilyas.tasli@outlook.fr>
Co-authored-by: Kai Song <50285351+USTCKAY@users.noreply.github.com>
Co-authored-by: dvorst <87502756+dvorst@users.noreply.github.com>
Co-authored-by: Dennis <dvorst@users.noreply.github.com>
Co-authored-by: shiyutang <34859558+shiyutang@users.noreply.github.com>
Co-authored-by: Dec20B <1192152456@qq.com>
Co-authored-by: ncoffman <51147417+ncoffman@users.noreply.github.com>
2023-09-07 16:36:47 +08:00
Gmgge
3a6fe48afd
根据推理对三通道的图像需求,以及opencv中imread参数说明IMREAD_COLOR(If set, always convert image to the 3 channel BGR color image.),因此修改该读取参数,以解决后续通道不匹配问题。 ( #10777 )
2023-08-31 14:31:28 +08:00
UserUnknownFactor
b3912fcf7a
Cherrypicking GH-10217 and GH-10216 to PaddlePaddle:dygraph ( #10654 )
...
* Don't break overall processing on a bad image
* Add preprocessing common to OCR tasks
Add preprocessing to options
2023-08-21 16:33:03 +08:00
WilliamQf
2bd552c801
实现功能:当--savefile为true时,在--output下以当前图片名称后接“.txt”为文件名保存ocr推理结果,解决了issues: ( #10628 )
...
* 实现功能:当--savefile为true时,在--output下以当前图片名称后接“.txt”为文件名保存ocr推理结果,解决了issues:
https://github.com/PaddlePaddle/PaddleOCR/issues/10533
* 删除了乱码字符
* 1.删除了乱码字符
* 1.删除了乱码字符
2023-08-21 10:25:47 +08:00
andyj
681467d4ea
[bug fix] fix none res in recovery ( #10603 )
...
* add finetune en doc & test=document_fix
* fix dead link & test=document_fix
* fix dead link & test=document_fix
* update check img
* fix det res dtype
* update args default type & test=document_fix
* fix numpy version
* support numpy1.24.0
* fix doc & test=document_fix
* update doc
* update doc, test=document_fix
* fix pdf2word in whl, test=document_fix
* fix none res in recovery
* update version
* format code
2023-08-10 16:55:26 +08:00
andyj
254786752a
[BUG FIX] Fix pdf2word in whl ( #10584 )
...
* add finetune en doc & test=document_fix
* fix dead link & test=document_fix
* fix dead link & test=document_fix
* update check img
* fix det res dtype
* update args default type & test=document_fix
* fix numpy version
* support numpy1.24.0
* fix doc & test=document_fix
* update doc
* update doc, test=document_fix
* fix pdf2word in whl, test=document_fix
2023-08-09 19:15:49 +08:00
xiaoting
2f70e4b7f6
upload paddleocr whl to pypi ( #10524 )
...
* upload paddleocr whl to pypi
* Update README_ch.md
* Update README_ch.md
* Update quickstart.md
* Update README_ch.md
* Update README_ch.md
2023-08-06 11:17:13 +08:00
Bryan YW
1556922669
Update paddleocr.py ( #10066 )
...
cherry-pick package compatibility PR
2023-07-18 11:36:49 +08:00
andyj
83beede6b5
Update check img ( #8558 )
...
* add finetune en doc & test=document_fix
* fix dead link & test=document_fix
* fix dead link & test=document_fix
* update check img
2022-12-07 09:51:54 +08:00
zhoujun
59b3eade31
Merge pull request #8066 from WenmuZhou/doc2
...
update PP-Structurev to PP-StructureV
2022-10-25 14:20:10 +08:00
WenmuZhou
cad701d411
fix benckmark error when benckmark=false
2022-10-24 17:10:05 +08:00
WenmuZhou
b2bc6b746c
update doc
2022-10-24 05:36:34 +00:00
WenmuZhou
2145d8c4ec
add recovery requirements to whl
2022-10-20 17:03:47 +08:00
an1018
99698aed54
add_pdf2docx_api
2022-10-12 21:28:48 +08:00
andyjpaddle
b947dc3fb3
fix whl for layout recovery
2022-10-11 08:42:47 +00:00
WenmuZhou
c647a6da28
add re to ppstructure system
2022-10-10 05:39:41 +00:00
andyj
059349ab74
Merge pull request #7834 from andyjpaddle/update_whl
...
update doc for whl
2022-10-08 19:21:53 +08:00
andyjpaddle
a5df6c34e0
update doc for whl
2022-10-08 08:28:30 +00:00
Xie Xuyang
f699d787ff
fix #7534 ( #7535 )
2022-09-29 11:19:44 +08:00
vivien
f9baf0eabc
update bytes support
2022-08-31 19:59:29 +08:00
Leif
975975d30e
Update PPOCRLabel
...
Update PPOCRLabel
2022-08-25 16:32:44 +08:00
an1018
8756253800
update doc
2022-08-24 10:07:30 +08:00
an1018
2976dab987
Merge branch 'dygraph' of https://github.com/PaddlePaddle/PaddleOCR into add_layout_hub
2022-08-24 09:04:59 +08:00
an1018
9c424ff164
update doc
2022-08-23 23:28:49 +08:00
an1018
d5d78b486b
update doc
2022-08-23 16:11:18 +08:00