PaddleOCR

Commit Graph

Author	SHA1	Message	Date
Empathy	d3e362a3a0	1、PPOCRLabel现在支持从中文路径导入图片，原本导入含中文路径的图片会导致崩溃。 (#11236 ) 2、PPOCRLabel现在支持移动被其他框覆盖的锚点，原本无法移动被覆盖的锚点。 3、修复utility.py中误输入字符导致的语法错误。 4、修复setValue()应输入int，实际输入float导致的类型错误。 5、修复paddleocr中未import predict_system的错误。 6、修复canvas.py中部分输入参数类型错误 7、修复了LabelList不兼容搜狗输入法或win11输入法的问题。原本使用搜狗输入法修改标注数据时，仅输入一个字母就会失去焦点并提交数据变更，导致无法输入完整的汉字。现在将处理逻辑改为失去焦点时仍不提交数据变更，直到切换item或按下enter键才提交。 8、新增扩大选框的功能 1、PPOCRLabel now supports importing images from Chinese paths, originally importing images containing Chinese paths would cause a crash. 2、PPOCRLabel now supports moving anchor points that are covered by other boxes, originally it could not move the covered anchor points. 3、Fix the syntax error caused by mistakenly inputting characters in utility.py. 4、Repair the type error caused by inputting int but float in setValue(). 5、Repair the error of not import predict_system in paddleocr. 6、Fix some input parameter type errors in canvas.py. 7、LabelList can't use Sogou Input Method or Win11 Input Method to input text. 8、Add function of expand box.	2023-12-13 10:16:39 +08:00
ficodex	42d280907a	Fix ValueError in text detection due to ambiguous array evaluation (#11287 ) This commit resolves the issue where `dt_boxes` was evaluated as a boolean in Python 3.11, leading to a ValueError. The condition `if not dt_boxes:` is replaced with `if dt_boxes.size == 0:` to explicitly check for an empty array, ensuring compatibility with Python 3.11 and maintaining the intended functionality.	2023-11-22 20:10:32 +08:00
shiyutang	e3fc6393e0	[Cherry-pick] Cherry-pick from release/2.6 (#11092 ) * Update recognition_en.md (#10059) ic15_dict.txt only have 36 digits * Update ocr_rec.h (#9469) It is enough to include preprocess_op.h, we do not need to include ocr_cls.h. * 补充num_classes注释说明 (#10073) ser_vi_layoutxlm_xfund_zh.yml中的Architecture.Backbone.num_classes所赋值会设置给Loss.num_classes，由于采用BIO标注，假设字典中包含n个字段（包含other）时，则类别数为2n-1;假设字典中包含n个字段（不含other）时，则类别数为2n+1。 * Update algorithm_overview_en.md (#9747) Fix links to super-resolution algorithm docs * 改进文档`deploy/hubserving/readme.md`和`doc/doc_ch/models_list.md` (#9110) * Update readme.md * Update readme.md * Update readme.md * Update models_list.md * trim trailling spaces @ `deploy/hubserving/readme_en.md` * `s/shell/bash/` @ `deploy/hubserving/readme_en.md` * Update `deploy/hubserving/readme_en.md` to sync with `deploy/hubserving/readme.md` * Update deploy/hubserving/readme_en.md to sync with `deploy/hubserving/readme.md` * Update deploy/hubserving/readme_en.md to sync with `deploy/hubserving/readme.md` * Update `doc/doc_en/models_list_en.md` to sync with `doc/doc_ch/models_list_en.md` * using Grammarly to weak `deploy/hubserving/readme_en.md` * using Grammarly to tweak `doc/doc_en/models_list_en.md` * `ocr_system` module will return with values of field `confidence` * Update README_CN.md * 修复测试服务中图片转Base64的引用地址错误。 (#8334) * Update application.md * [Doc] Fix 404 link. (#10318) * Update PP-OCRv3_det_train.md * Update knowledge_distillation.md * Update config.md * Fix fitz camelCase deprecation and .PDF not being recognized as pdf file (#10181) * Fix fitz camelCase deprecation and .PDF not being recognized as pdf file * refactor get_image_file_list function * Update customize.md (#10325) * Update FAQ.md (#10345) * Update FAQ.md (#10349) * Don't break overall processing on a bad image (#10216) * Add preprocessing common to OCR tasks (#10217) Add preprocessing to options * [MLU] add mlu device for infer (#10249) * Create newfeature.md * Update newfeature.md * remove unused imported module, so can avoid PyInstaller packaged binary's start-time not found module error. (#10502) * CV套件建设专项活动 - 文字识别返回单字识别坐标 (#10515) * modification of return word box * update_implements * Update rec_postprocess.py * Update utility.py * Update README_ch.md * revert README_ch.md update * Fixed Layout recovery README file (#10493) Co-authored-by: Shubham Chambhare <shubhamchambhare@zoop.one> * update_doc * bugfix --------- Co-authored-by: ChuongLoc <89434232+ChuongLoc@users.noreply.github.com> Co-authored-by: Wang Xin <xinwang614@gmail.com> Co-authored-by: tanjh <dtdhinjapan@gmail.com> Co-authored-by: Louis Maddox <lmmx@users.noreply.github.com> Co-authored-by: n0099 <n@n0099.net> Co-authored-by: zhenliang li <37922155+shouyong@users.noreply.github.com> Co-authored-by: itasli <ilyas.tasli@outlook.fr> Co-authored-by: UserUnknownFactor <63057995+UserUnknownFactor@users.noreply.github.com> Co-authored-by: PeiyuLau <135964669+PeiyuLau@users.noreply.github.com> Co-authored-by: kerneltravel <kjpioo2006@gmail.com> Co-authored-by: ToddBear <43341135+ToddBear@users.noreply.github.com> Co-authored-by: Ligoml <39876205+Ligoml@users.noreply.github.com> Co-authored-by: Shubham Chambhare <59397280+Shubham654@users.noreply.github.com> Co-authored-by: Shubham Chambhare <shubhamchambhare@zoop.one> Co-authored-by: andyj <87074272+andyjpaddle@users.noreply.github.com>	2023-10-18 17:37:23 +08:00
Gmgge	2fdac319b1	fix:修复通道数不匹配造成的PPOCRLabel启动失败问题#10748,根据更新日志发现#10655，由于paddleocr中增加了对透明色的需求，由于在check_img中以及存在对单通道图像转三通道图像的处理，因此将该四通道图像处理成三通道的图像也放入该check_img，并统一三通道图像输出逻辑。 (#10847 ) add:由于函数复杂度增高，增加注释。	2023-09-21 14:51:32 +08:00
ToddBear	75d16610f4	Add new recognition method "ParseQ" (#10836 ) * Update PP-OCRv4_introduction.md * Update PP-OCRv4_introduction.md (#10616) * Update PP-OCRv4_introduction.md * Update PP-OCRv4_introduction.md * Update PP-OCRv4_introduction.md * Update README.md * Cherrypicking GH-10217 and GH-10216 to PaddlePaddle:Release/2.7 (#10655) * Don't break overall processing on a bad image * Add preprocessing common to OCR tasks Add preprocessing to options * Update requirements.txt (#10656) added missing pyyaml library * [TIPC]update xpu tipc script (#10658) * fix-typo (#10642) Co-authored-by: Dennis <dvorst@users.noreply.github.com> Co-authored-by: shiyutang <34859558+shiyutang@users.noreply.github.com> * 修改数据增强导致的DSR报错 (#10662) (#10681) * 修改数据增强导致的DSR报错 * 错误修改回滚 * Update algorithm_overview_en.md (#10670) Fixed simple spelling errors. * Implement recoginition method ParseQ * Document update for new recognition method ParseQ * add prediction for parseq * Update rec_vit_parseq.yml * Update rec_r31_sar.yml * Update rec_r31_sar.yml * Update rec_r50_fpn_srn.yml * Update rec_vit_parseq.py * Update rec_vit_parseq.yml * Update rec_parseq_head.py * Update rec_img_aug.py * Update rec_vit_parseq.yml * Update __init__.py * Update predict_rec.py * Update paddleocr.py * Update requirements.txt * Update utility.py * Update utility.py --------- Co-authored-by: xiaoting <31891223+tink2123@users.noreply.github.com> Co-authored-by: topduke <784990967@qq.com> Co-authored-by: dyning <dyning.2003@163.com> Co-authored-by: UserUnknownFactor <63057995+UserUnknownFactor@users.noreply.github.com> Co-authored-by: itasli <ilyas.tasli@outlook.fr> Co-authored-by: Kai Song <50285351+USTCKAY@users.noreply.github.com> Co-authored-by: dvorst <87502756+dvorst@users.noreply.github.com> Co-authored-by: Dennis <dvorst@users.noreply.github.com> Co-authored-by: shiyutang <34859558+shiyutang@users.noreply.github.com> Co-authored-by: Dec20B <1192152456@qq.com> Co-authored-by: ncoffman <51147417+ncoffman@users.noreply.github.com>	2023-09-07 16:36:47 +08:00
Gmgge	3a6fe48afd	根据推理对三通道的图像需求，以及opencv中imread参数说明IMREAD_COLOR(If set, always convert image to the 3 channel BGR color image.)，因此修改该读取参数，以解决后续通道不匹配问题。 (#10777 )	2023-08-31 14:31:28 +08:00
UserUnknownFactor	b3912fcf7a	Cherrypicking GH-10217 and GH-10216 to PaddlePaddle:dygraph (#10654 ) * Don't break overall processing on a bad image * Add preprocessing common to OCR tasks Add preprocessing to options	2023-08-21 16:33:03 +08:00
WilliamQf	2bd552c801	实现功能：当--savefile为true时，在--output下以当前图片名称后接“.txt”为文件名保存ocr推理结果，解决了issues： (#10628 ) * 实现功能：当--savefile为true时，在--output下以当前图片名称后接“.txt”为文件名保存ocr推理结果，解决了issues： https://github.com/PaddlePaddle/PaddleOCR/issues/10533 * 删除了乱码字符 * 1.删除了乱码字符 * 1.删除了乱码字符	2023-08-21 10:25:47 +08:00
andyj	681467d4ea	[bug fix] fix none res in recovery (#10603 ) * add finetune en doc & test=document_fix * fix dead link & test=document_fix * fix dead link & test=document_fix * update check img * fix det res dtype * update args default type & test=document_fix * fix numpy version * support numpy1.24.0 * fix doc & test=document_fix * update doc * update doc, test=document_fix * fix pdf2word in whl, test=document_fix * fix none res in recovery * update version * format code	2023-08-10 16:55:26 +08:00
andyj	254786752a	[BUG FIX] Fix pdf2word in whl (#10584 ) * add finetune en doc & test=document_fix * fix dead link & test=document_fix * fix dead link & test=document_fix * update check img * fix det res dtype * update args default type & test=document_fix * fix numpy version * support numpy1.24.0 * fix doc & test=document_fix * update doc * update doc, test=document_fix * fix pdf2word in whl, test=document_fix	2023-08-09 19:15:49 +08:00
xiaoting	2f70e4b7f6	upload paddleocr whl to pypi (#10524 ) * upload paddleocr whl to pypi * Update README_ch.md * Update README_ch.md * Update quickstart.md * Update README_ch.md * Update README_ch.md	2023-08-06 11:17:13 +08:00
Bryan YW	1556922669	Update paddleocr.py (#10066 ) cherry-pick package compatibility PR	2023-07-18 11:36:49 +08:00
andyj	83beede6b5	Update check img (#8558 ) * add finetune en doc & test=document_fix * fix dead link & test=document_fix * fix dead link & test=document_fix * update check img	2022-12-07 09:51:54 +08:00
zhoujun	59b3eade31	Merge pull request #8066 from WenmuZhou/doc2 update PP-Structurev to PP-StructureV	2022-10-25 14:20:10 +08:00
WenmuZhou	cad701d411	fix benckmark error when benckmark=false	2022-10-24 17:10:05 +08:00
WenmuZhou	b2bc6b746c	update doc	2022-10-24 05:36:34 +00:00
WenmuZhou	2145d8c4ec	add recovery requirements to whl	2022-10-20 17:03:47 +08:00
an1018	99698aed54	add_pdf2docx_api	2022-10-12 21:28:48 +08:00
andyjpaddle	b947dc3fb3	fix whl for layout recovery	2022-10-11 08:42:47 +00:00
WenmuZhou	c647a6da28	add re to ppstructure system	2022-10-10 05:39:41 +00:00
andyj	059349ab74	Merge pull request #7834 from andyjpaddle/update_whl update doc for whl	2022-10-08 19:21:53 +08:00
andyjpaddle	a5df6c34e0	update doc for whl	2022-10-08 08:28:30 +00:00
Xie Xuyang	f699d787ff	fix #7534 (#7535 )	2022-09-29 11:19:44 +08:00
vivien	f9baf0eabc	update bytes support	2022-08-31 19:59:29 +08:00
Leif	975975d30e	Update PPOCRLabel Update PPOCRLabel	2022-08-25 16:32:44 +08:00
an1018	8756253800	update doc	2022-08-24 10:07:30 +08:00
an1018	2976dab987	Merge branch 'dygraph' of https://github.com/PaddlePaddle/PaddleOCR into add_layout_hub	2022-08-24 09:04:59 +08:00
an1018	9c424ff164	update doc	2022-08-23 23:28:49 +08:00
an1018	d5d78b486b	update doc	2022-08-23 16:11:18 +08:00
WenmuZhou	f7b35bbbc4	do not print table result	2022-08-22 08:00:49 +00:00
littletomatodonkey	8c7c45420f	polish kie doc and code (#7255 ) * add fapiao kie * fix readme * fix fanli * add readme * add how to do kie en * add algo kie * add algo overview en * rename vqa to kie * fix read gif	2022-08-21 10:55:49 +08:00
zhoujun	aec5dbf8f7	update model size (#7269 ) * update model size * update layout dict in whl	2022-08-20 16:10:45 +08:00
WenmuZhou	604f531b9a	add layout model	2022-08-17 04:40:07 +00:00
WenmuZhou	b26ce23774	rm unused code	2022-08-16 10:55:24 +00:00
WenmuZhou	bb53c8d100	add table model link	2022-08-16 10:46:09 +00:00
WenmuZhou	731688c2dd	add PP-Structurev2 to whl	2022-08-10 14:51:57 +00:00
andyjpaddle	e2b2f33092	update some note	2022-05-30 09:25:21 +00:00
WenmuZhou	f31980266a	merge upstream	2022-05-10 14:28:02 +08:00
andyjpaddle	565abdb632	update version	2022-05-10 02:46:13 +00:00
andyjpaddle	b16c3d4234	update v3 rec name	2022-05-10 02:41:54 +00:00
andyjpaddle	4d406f8825	update whl for rec image shape	2022-05-07 09:06:13 +00:00
andyjpaddle	cedf8b5669	update whl for multi lang	2022-05-07 06:05:47 +00:00
andyjpaddle	bd147c61a9	update whl for multi lang rec	2022-05-07 03:52:06 +00:00
andyjpaddle	738ff190dd	update doc for whl and python cpp infer quick	2022-04-29 03:38:59 +00:00
WenmuZhou	bbff7c55c1	The whl package supports separate table recognition and layout analysis	2022-04-22 13:24:45 +08:00
tink2123	3a28266735	update paddleocr version no	2022-04-02 09:36:48 +00:00
WenmuZhou	b3fbfc3109	rename OCRSystem to StructureSystem	2022-03-30 09:24:24 +00:00
WenmuZhou	463622b1f9	rm unused code	2022-03-23 14:26:40 +00:00
WenmuZhou	e693619888	add relative import	2022-03-23 14:22:06 +00:00
WenmuZhou	f85b3ea8eb	fix bug in whl import fce	2022-03-18 10:08:58 +00:00

1 2

96 Commits (49ef54ee3cf6a3f52501dce670ea2e751c5d75d3)