PaddleOCR

Commit Graph

Author	SHA1	Message	Date
^_^	d28cb46061	skip unnecessary method calls (#14900 ) * skip unnecessary method calls in PaddleOCR.ocr pre-check meaningless args for PaddleOCR.ocr * style: make CI happy	2025-03-20 10:36:10 +08:00
zhangyubo0722	e8399f267b	add cli rec model (#14740 )	2025-03-06 17:05:51 +08:00
mauryaland	44e17bae0a	allow environment variable for BASE_DIR models (#14678 ) * allow environment variable for BASE_DIR models * add doc for environment variable PADDLE_OCR_BASE_DIR * fix code style error	2025-02-14 21:58:00 +08:00
cuicheng01	a28196c002	update SLANet inference weights for adapt to paddle3.0b2 (#14467 )	2024-12-30 18:45:49 +08:00
TongZ	d8a20b16ad	fix: replace `rec_image_shape` when manually set (#14371 )	2024-12-16 08:14:37 +08:00
Wang Xin	1eeca9c72f	image without any text will show a warning (#14132 )	2024-10-31 14:08:06 +08:00
ztyf	8728b47046	pdf to markdown document (#13942 )	2024-10-07 09:25:21 +08:00
Wang Xin	cda3e1201e	fix hubserving run error (#13918 ) * fix hubserving run error * Update paddleocr.py	2024-09-27 15:58:49 +08:00
Kayzwer	78ddaa44a0	remove unused enumerate (#13760 )	2024-08-28 09:10:33 +08:00
Wang Xin	c556b9083e	fix layout recovery import error (#13434 )	2024-07-20 21:19:09 +08:00
jzhang533	603b3728ac	add missing docstring in paddleocr.py using copilot (#13344 ) * add missing docstring in paddleocr.py using copilot Signed-off-by: Zhang Jun <jzhang533@gmail.com> * Update paddleocr.py Co-authored-by: Wang Xin <xinwang614@gmail.com> --------- Signed-off-by: Zhang Jun <jzhang533@gmail.com> Co-authored-by: Wang Xin <xinwang614@gmail.com>	2024-07-13 10:32:55 +08:00
myhloli	26283901a7	Solve ModuleNotFoundError: No module named 'tools.infer' (#13348 ) * add layout score return * fix(paddleocr): correct import path for predict_system to fix ModuleNotFoundError: No module named 'tools.infer' * fix(paddleocr): move from tools.infer import predict_system under "tools" define to slove ModuleNotFoundError: No module named 'tools.infer'	2024-07-11 19:28:16 +08:00
Wang Xin	153de46b67	add url in pyproject, and update version number (#13274 )	2024-07-06 21:20:59 +08:00
Wang Xin	b3954d38ba	fix wrong output when --det=False (#12589 ) * fix wrong output when --det=False * using pprint to format the output * using pprint to format the output	2024-06-05 09:55:50 +08:00
aspaul20	965f569e81	added sliding window for large image inference (#12152 ) added sliding window for large image inference	2024-05-24 16:16:37 +08:00
张春乔	3a66efc7bf	【OCR Issue No.12】Modify the setuptools configuration from SETUP.py into PYPROJECT.toml (#12013 ) Modify the setuptools configuration from SETUP.py into PYPROJECT.toml	2024-05-24 11:45:15 +08:00
NOEXIST	58181962dc	layout recognition refinement onnx support (#12068 ) * layout recognition refinement onnx support * fix codestyle	2024-05-09 09:35:44 +08:00
张春乔	b5eedf727e	【OCR Issue No.9】移除明确不适合放在ppocr依赖中的依赖项 (#11946 ) * modify requestions * Update requirements.txt * Update requirements.txt * try import pdfconvert * try import lxml * try import lxml * try import premailer * try import openpyxl * Apply suggestions from code review	2024-04-26 16:54:49 +08:00
S M	f7117efd44	Fix the bug where Python scripts fail to execute PDF text recognition… (#11994 ) * Fix the bug where Python scripts fail to execute PDF text recognition tasks, optimize the logic of judging PDF files, and add cases to the quickstart document for layout analysis. * Add two examples of PDF layout analysis to the quickstart file of ppstructure. * Add a return comment for the check_img function	2024-04-25 16:52:09 +08:00
Wang Xin	045e5f6ac7	add pre-commit workflow (#11973 ) * add pre-commit workflow * run 'pre-commit run --all-files' * setup python version	2024-04-21 21:46:20 +08:00
jzhang533	5e40f85ef3	setup a workflow for publishing package to pypi (#11804 )	2024-03-27 10:41:55 +08:00
Empathy	d3e362a3a0	1、PPOCRLabel现在支持从中文路径导入图片，原本导入含中文路径的图片会导致崩溃。 (#11236 ) 2、PPOCRLabel现在支持移动被其他框覆盖的锚点，原本无法移动被覆盖的锚点。 3、修复utility.py中误输入字符导致的语法错误。 4、修复setValue()应输入int，实际输入float导致的类型错误。 5、修复paddleocr中未import predict_system的错误。 6、修复canvas.py中部分输入参数类型错误 7、修复了LabelList不兼容搜狗输入法或win11输入法的问题。原本使用搜狗输入法修改标注数据时，仅输入一个字母就会失去焦点并提交数据变更，导致无法输入完整的汉字。现在将处理逻辑改为失去焦点时仍不提交数据变更，直到切换item或按下enter键才提交。 8、新增扩大选框的功能 1、PPOCRLabel now supports importing images from Chinese paths, originally importing images containing Chinese paths would cause a crash. 2、PPOCRLabel now supports moving anchor points that are covered by other boxes, originally it could not move the covered anchor points. 3、Fix the syntax error caused by mistakenly inputting characters in utility.py. 4、Repair the type error caused by inputting int but float in setValue(). 5、Repair the error of not import predict_system in paddleocr. 6、Fix some input parameter type errors in canvas.py. 7、LabelList can't use Sogou Input Method or Win11 Input Method to input text. 8、Add function of expand box.	2023-12-13 10:16:39 +08:00
ficodex	42d280907a	Fix ValueError in text detection due to ambiguous array evaluation (#11287 ) This commit resolves the issue where `dt_boxes` was evaluated as a boolean in Python 3.11, leading to a ValueError. The condition `if not dt_boxes:` is replaced with `if dt_boxes.size == 0:` to explicitly check for an empty array, ensuring compatibility with Python 3.11 and maintaining the intended functionality.	2023-11-22 20:10:32 +08:00
shiyutang	e3fc6393e0	[Cherry-pick] Cherry-pick from release/2.6 (#11092 ) * Update recognition_en.md (#10059) ic15_dict.txt only have 36 digits * Update ocr_rec.h (#9469) It is enough to include preprocess_op.h, we do not need to include ocr_cls.h. * 补充num_classes注释说明 (#10073) ser_vi_layoutxlm_xfund_zh.yml中的Architecture.Backbone.num_classes所赋值会设置给Loss.num_classes，由于采用BIO标注，假设字典中包含n个字段（包含other）时，则类别数为2n-1;假设字典中包含n个字段（不含other）时，则类别数为2n+1。 * Update algorithm_overview_en.md (#9747) Fix links to super-resolution algorithm docs * 改进文档`deploy/hubserving/readme.md`和`doc/doc_ch/models_list.md` (#9110) * Update readme.md * Update readme.md * Update readme.md * Update models_list.md * trim trailling spaces @ `deploy/hubserving/readme_en.md` * `s/shell/bash/` @ `deploy/hubserving/readme_en.md` * Update `deploy/hubserving/readme_en.md` to sync with `deploy/hubserving/readme.md` * Update deploy/hubserving/readme_en.md to sync with `deploy/hubserving/readme.md` * Update deploy/hubserving/readme_en.md to sync with `deploy/hubserving/readme.md` * Update `doc/doc_en/models_list_en.md` to sync with `doc/doc_ch/models_list_en.md` * using Grammarly to weak `deploy/hubserving/readme_en.md` * using Grammarly to tweak `doc/doc_en/models_list_en.md` * `ocr_system` module will return with values of field `confidence` * Update README_CN.md * 修复测试服务中图片转Base64的引用地址错误。 (#8334) * Update application.md * [Doc] Fix 404 link. (#10318) * Update PP-OCRv3_det_train.md * Update knowledge_distillation.md * Update config.md * Fix fitz camelCase deprecation and .PDF not being recognized as pdf file (#10181) * Fix fitz camelCase deprecation and .PDF not being recognized as pdf file * refactor get_image_file_list function * Update customize.md (#10325) * Update FAQ.md (#10345) * Update FAQ.md (#10349) * Don't break overall processing on a bad image (#10216) * Add preprocessing common to OCR tasks (#10217) Add preprocessing to options * [MLU] add mlu device for infer (#10249) * Create newfeature.md * Update newfeature.md * remove unused imported module, so can avoid PyInstaller packaged binary's start-time not found module error. (#10502) * CV套件建设专项活动 - 文字识别返回单字识别坐标 (#10515) * modification of return word box * update_implements * Update rec_postprocess.py * Update utility.py * Update README_ch.md * revert README_ch.md update * Fixed Layout recovery README file (#10493) Co-authored-by: Shubham Chambhare <shubhamchambhare@zoop.one> * update_doc * bugfix --------- Co-authored-by: ChuongLoc <89434232+ChuongLoc@users.noreply.github.com> Co-authored-by: Wang Xin <xinwang614@gmail.com> Co-authored-by: tanjh <dtdhinjapan@gmail.com> Co-authored-by: Louis Maddox <lmmx@users.noreply.github.com> Co-authored-by: n0099 <n@n0099.net> Co-authored-by: zhenliang li <37922155+shouyong@users.noreply.github.com> Co-authored-by: itasli <ilyas.tasli@outlook.fr> Co-authored-by: UserUnknownFactor <63057995+UserUnknownFactor@users.noreply.github.com> Co-authored-by: PeiyuLau <135964669+PeiyuLau@users.noreply.github.com> Co-authored-by: kerneltravel <kjpioo2006@gmail.com> Co-authored-by: ToddBear <43341135+ToddBear@users.noreply.github.com> Co-authored-by: Ligoml <39876205+Ligoml@users.noreply.github.com> Co-authored-by: Shubham Chambhare <59397280+Shubham654@users.noreply.github.com> Co-authored-by: Shubham Chambhare <shubhamchambhare@zoop.one> Co-authored-by: andyj <87074272+andyjpaddle@users.noreply.github.com>	2023-10-18 17:37:23 +08:00
Gmgge	2fdac319b1	fix:修复通道数不匹配造成的PPOCRLabel启动失败问题#10748,根据更新日志发现#10655，由于paddleocr中增加了对透明色的需求，由于在check_img中以及存在对单通道图像转三通道图像的处理，因此将该四通道图像处理成三通道的图像也放入该check_img，并统一三通道图像输出逻辑。 (#10847 ) add:由于函数复杂度增高，增加注释。	2023-09-21 14:51:32 +08:00
ToddBear	75d16610f4	Add new recognition method "ParseQ" (#10836 ) * Update PP-OCRv4_introduction.md * Update PP-OCRv4_introduction.md (#10616) * Update PP-OCRv4_introduction.md * Update PP-OCRv4_introduction.md * Update PP-OCRv4_introduction.md * Update README.md * Cherrypicking GH-10217 and GH-10216 to PaddlePaddle:Release/2.7 (#10655) * Don't break overall processing on a bad image * Add preprocessing common to OCR tasks Add preprocessing to options * Update requirements.txt (#10656) added missing pyyaml library * [TIPC]update xpu tipc script (#10658) * fix-typo (#10642) Co-authored-by: Dennis <dvorst@users.noreply.github.com> Co-authored-by: shiyutang <34859558+shiyutang@users.noreply.github.com> * 修改数据增强导致的DSR报错 (#10662) (#10681) * 修改数据增强导致的DSR报错 * 错误修改回滚 * Update algorithm_overview_en.md (#10670) Fixed simple spelling errors. * Implement recoginition method ParseQ * Document update for new recognition method ParseQ * add prediction for parseq * Update rec_vit_parseq.yml * Update rec_r31_sar.yml * Update rec_r31_sar.yml * Update rec_r50_fpn_srn.yml * Update rec_vit_parseq.py * Update rec_vit_parseq.yml * Update rec_parseq_head.py * Update rec_img_aug.py * Update rec_vit_parseq.yml * Update __init__.py * Update predict_rec.py * Update paddleocr.py * Update requirements.txt * Update utility.py * Update utility.py --------- Co-authored-by: xiaoting <31891223+tink2123@users.noreply.github.com> Co-authored-by: topduke <784990967@qq.com> Co-authored-by: dyning <dyning.2003@163.com> Co-authored-by: UserUnknownFactor <63057995+UserUnknownFactor@users.noreply.github.com> Co-authored-by: itasli <ilyas.tasli@outlook.fr> Co-authored-by: Kai Song <50285351+USTCKAY@users.noreply.github.com> Co-authored-by: dvorst <87502756+dvorst@users.noreply.github.com> Co-authored-by: Dennis <dvorst@users.noreply.github.com> Co-authored-by: shiyutang <34859558+shiyutang@users.noreply.github.com> Co-authored-by: Dec20B <1192152456@qq.com> Co-authored-by: ncoffman <51147417+ncoffman@users.noreply.github.com>	2023-09-07 16:36:47 +08:00
Gmgge	3a6fe48afd	根据推理对三通道的图像需求，以及opencv中imread参数说明IMREAD_COLOR(If set, always convert image to the 3 channel BGR color image.)，因此修改该读取参数，以解决后续通道不匹配问题。 (#10777 )	2023-08-31 14:31:28 +08:00
UserUnknownFactor	b3912fcf7a	Cherrypicking GH-10217 and GH-10216 to PaddlePaddle:dygraph (#10654 ) * Don't break overall processing on a bad image * Add preprocessing common to OCR tasks Add preprocessing to options	2023-08-21 16:33:03 +08:00
WilliamQf	2bd552c801	实现功能：当--savefile为true时，在--output下以当前图片名称后接“.txt”为文件名保存ocr推理结果，解决了issues： (#10628 ) * 实现功能：当--savefile为true时，在--output下以当前图片名称后接“.txt”为文件名保存ocr推理结果，解决了issues： https://github.com/PaddlePaddle/PaddleOCR/issues/10533 * 删除了乱码字符 * 1.删除了乱码字符 * 1.删除了乱码字符	2023-08-21 10:25:47 +08:00
andyj	681467d4ea	[bug fix] fix none res in recovery (#10603 ) * add finetune en doc & test=document_fix * fix dead link & test=document_fix * fix dead link & test=document_fix * update check img * fix det res dtype * update args default type & test=document_fix * fix numpy version * support numpy1.24.0 * fix doc & test=document_fix * update doc * update doc, test=document_fix * fix pdf2word in whl, test=document_fix * fix none res in recovery * update version * format code	2023-08-10 16:55:26 +08:00
andyj	254786752a	[BUG FIX] Fix pdf2word in whl (#10584 ) * add finetune en doc & test=document_fix * fix dead link & test=document_fix * fix dead link & test=document_fix * update check img * fix det res dtype * update args default type & test=document_fix * fix numpy version * support numpy1.24.0 * fix doc & test=document_fix * update doc * update doc, test=document_fix * fix pdf2word in whl, test=document_fix	2023-08-09 19:15:49 +08:00
xiaoting	2f70e4b7f6	upload paddleocr whl to pypi (#10524 ) * upload paddleocr whl to pypi * Update README_ch.md * Update README_ch.md * Update quickstart.md * Update README_ch.md * Update README_ch.md	2023-08-06 11:17:13 +08:00
Bryan YW	1556922669	Update paddleocr.py (#10066 ) cherry-pick package compatibility PR	2023-07-18 11:36:49 +08:00
andyj	83beede6b5	Update check img (#8558 ) * add finetune en doc & test=document_fix * fix dead link & test=document_fix * fix dead link & test=document_fix * update check img	2022-12-07 09:51:54 +08:00
zhoujun	59b3eade31	Merge pull request #8066 from WenmuZhou/doc2 update PP-Structurev to PP-StructureV	2022-10-25 14:20:10 +08:00
WenmuZhou	cad701d411	fix benckmark error when benckmark=false	2022-10-24 17:10:05 +08:00
WenmuZhou	b2bc6b746c	update doc	2022-10-24 05:36:34 +00:00
WenmuZhou	2145d8c4ec	add recovery requirements to whl	2022-10-20 17:03:47 +08:00
an1018	99698aed54	add_pdf2docx_api	2022-10-12 21:28:48 +08:00
andyjpaddle	b947dc3fb3	fix whl for layout recovery	2022-10-11 08:42:47 +00:00
WenmuZhou	c647a6da28	add re to ppstructure system	2022-10-10 05:39:41 +00:00
andyj	059349ab74	Merge pull request #7834 from andyjpaddle/update_whl update doc for whl	2022-10-08 19:21:53 +08:00
andyjpaddle	a5df6c34e0	update doc for whl	2022-10-08 08:28:30 +00:00
Xie Xuyang	f699d787ff	fix #7534 (#7535 )	2022-09-29 11:19:44 +08:00
vivien	f9baf0eabc	update bytes support	2022-08-31 19:59:29 +08:00
Leif	975975d30e	Update PPOCRLabel Update PPOCRLabel	2022-08-25 16:32:44 +08:00
an1018	8756253800	update doc	2022-08-24 10:07:30 +08:00
an1018	2976dab987	Merge branch 'dygraph' of https://github.com/PaddlePaddle/PaddleOCR into add_layout_hub	2022-08-24 09:04:59 +08:00
an1018	9c424ff164	update doc	2022-08-23 23:28:49 +08:00
an1018	d5d78b486b	update doc	2022-08-23 16:11:18 +08:00

1 2 3

117 Commits (d28cb46061503a4bd2d1308ecdb5e63b9923b8bc)