PaddleOCR

Commit Graph

Author	SHA1	Message	Date
ficodex	42d280907a	Fix ValueError in text detection due to ambiguous array evaluation (#11287 ) This commit resolves the issue where `dt_boxes` was evaluated as a boolean in Python 3.11, leading to a ValueError. The condition `if not dt_boxes:` is replaced with `if dt_boxes.size == 0:` to explicitly check for an empty array, ensuring compatibility with Python 3.11 and maintaining the intended functionality.	2023-11-22 20:10:32 +08:00
shiyutang	e3fc6393e0	[Cherry-pick] Cherry-pick from release/2.6 (#11092 ) * Update recognition_en.md (#10059) ic15_dict.txt only have 36 digits * Update ocr_rec.h (#9469) It is enough to include preprocess_op.h, we do not need to include ocr_cls.h. * 补充num_classes注释说明 (#10073) ser_vi_layoutxlm_xfund_zh.yml中的Architecture.Backbone.num_classes所赋值会设置给Loss.num_classes，由于采用BIO标注，假设字典中包含n个字段（包含other）时，则类别数为2n-1;假设字典中包含n个字段（不含other）时，则类别数为2n+1。 * Update algorithm_overview_en.md (#9747) Fix links to super-resolution algorithm docs * 改进文档`deploy/hubserving/readme.md`和`doc/doc_ch/models_list.md` (#9110) * Update readme.md * Update readme.md * Update readme.md * Update models_list.md * trim trailling spaces @ `deploy/hubserving/readme_en.md` * `s/shell/bash/` @ `deploy/hubserving/readme_en.md` * Update `deploy/hubserving/readme_en.md` to sync with `deploy/hubserving/readme.md` * Update deploy/hubserving/readme_en.md to sync with `deploy/hubserving/readme.md` * Update deploy/hubserving/readme_en.md to sync with `deploy/hubserving/readme.md` * Update `doc/doc_en/models_list_en.md` to sync with `doc/doc_ch/models_list_en.md` * using Grammarly to weak `deploy/hubserving/readme_en.md` * using Grammarly to tweak `doc/doc_en/models_list_en.md` * `ocr_system` module will return with values of field `confidence` * Update README_CN.md * 修复测试服务中图片转Base64的引用地址错误。 (#8334) * Update application.md * [Doc] Fix 404 link. (#10318) * Update PP-OCRv3_det_train.md * Update knowledge_distillation.md * Update config.md * Fix fitz camelCase deprecation and .PDF not being recognized as pdf file (#10181) * Fix fitz camelCase deprecation and .PDF not being recognized as pdf file * refactor get_image_file_list function * Update customize.md (#10325) * Update FAQ.md (#10345) * Update FAQ.md (#10349) * Don't break overall processing on a bad image (#10216) * Add preprocessing common to OCR tasks (#10217) Add preprocessing to options * [MLU] add mlu device for infer (#10249) * Create newfeature.md * Update newfeature.md * remove unused imported module, so can avoid PyInstaller packaged binary's start-time not found module error. (#10502) * CV套件建设专项活动 - 文字识别返回单字识别坐标 (#10515) * modification of return word box * update_implements * Update rec_postprocess.py * Update utility.py * Update README_ch.md * revert README_ch.md update * Fixed Layout recovery README file (#10493) Co-authored-by: Shubham Chambhare <shubhamchambhare@zoop.one> * update_doc * bugfix --------- Co-authored-by: ChuongLoc <89434232+ChuongLoc@users.noreply.github.com> Co-authored-by: Wang Xin <xinwang614@gmail.com> Co-authored-by: tanjh <dtdhinjapan@gmail.com> Co-authored-by: Louis Maddox <lmmx@users.noreply.github.com> Co-authored-by: n0099 <n@n0099.net> Co-authored-by: zhenliang li <37922155+shouyong@users.noreply.github.com> Co-authored-by: itasli <ilyas.tasli@outlook.fr> Co-authored-by: UserUnknownFactor <63057995+UserUnknownFactor@users.noreply.github.com> Co-authored-by: PeiyuLau <135964669+PeiyuLau@users.noreply.github.com> Co-authored-by: kerneltravel <kjpioo2006@gmail.com> Co-authored-by: ToddBear <43341135+ToddBear@users.noreply.github.com> Co-authored-by: Ligoml <39876205+Ligoml@users.noreply.github.com> Co-authored-by: Shubham Chambhare <59397280+Shubham654@users.noreply.github.com> Co-authored-by: Shubham Chambhare <shubhamchambhare@zoop.one> Co-authored-by: andyj <87074272+andyjpaddle@users.noreply.github.com>	2023-10-18 17:37:23 +08:00
Gmgge	2fdac319b1	fix:修复通道数不匹配造成的PPOCRLabel启动失败问题#10748,根据更新日志发现#10655，由于paddleocr中增加了对透明色的需求，由于在check_img中以及存在对单通道图像转三通道图像的处理，因此将该四通道图像处理成三通道的图像也放入该check_img，并统一三通道图像输出逻辑。 (#10847 ) add:由于函数复杂度增高，增加注释。	2023-09-21 14:51:32 +08:00
ToddBear	75d16610f4	Add new recognition method "ParseQ" (#10836 ) * Update PP-OCRv4_introduction.md * Update PP-OCRv4_introduction.md (#10616) * Update PP-OCRv4_introduction.md * Update PP-OCRv4_introduction.md * Update PP-OCRv4_introduction.md * Update README.md * Cherrypicking GH-10217 and GH-10216 to PaddlePaddle:Release/2.7 (#10655) * Don't break overall processing on a bad image * Add preprocessing common to OCR tasks Add preprocessing to options * Update requirements.txt (#10656) added missing pyyaml library * [TIPC]update xpu tipc script (#10658) * fix-typo (#10642) Co-authored-by: Dennis <dvorst@users.noreply.github.com> Co-authored-by: shiyutang <34859558+shiyutang@users.noreply.github.com> * 修改数据增强导致的DSR报错 (#10662) (#10681) * 修改数据增强导致的DSR报错 * 错误修改回滚 * Update algorithm_overview_en.md (#10670) Fixed simple spelling errors. * Implement recoginition method ParseQ * Document update for new recognition method ParseQ * add prediction for parseq * Update rec_vit_parseq.yml * Update rec_r31_sar.yml * Update rec_r31_sar.yml * Update rec_r50_fpn_srn.yml * Update rec_vit_parseq.py * Update rec_vit_parseq.yml * Update rec_parseq_head.py * Update rec_img_aug.py * Update rec_vit_parseq.yml * Update __init__.py * Update predict_rec.py * Update paddleocr.py * Update requirements.txt * Update utility.py * Update utility.py --------- Co-authored-by: xiaoting <31891223+tink2123@users.noreply.github.com> Co-authored-by: topduke <784990967@qq.com> Co-authored-by: dyning <dyning.2003@163.com> Co-authored-by: UserUnknownFactor <63057995+UserUnknownFactor@users.noreply.github.com> Co-authored-by: itasli <ilyas.tasli@outlook.fr> Co-authored-by: Kai Song <50285351+USTCKAY@users.noreply.github.com> Co-authored-by: dvorst <87502756+dvorst@users.noreply.github.com> Co-authored-by: Dennis <dvorst@users.noreply.github.com> Co-authored-by: shiyutang <34859558+shiyutang@users.noreply.github.com> Co-authored-by: Dec20B <1192152456@qq.com> Co-authored-by: ncoffman <51147417+ncoffman@users.noreply.github.com>	2023-09-07 16:36:47 +08:00
Gmgge	3a6fe48afd	根据推理对三通道的图像需求，以及opencv中imread参数说明IMREAD_COLOR(If set, always convert image to the 3 channel BGR color image.)，因此修改该读取参数，以解决后续通道不匹配问题。 (#10777 )	2023-08-31 14:31:28 +08:00
UserUnknownFactor	b3912fcf7a	Cherrypicking GH-10217 and GH-10216 to PaddlePaddle:dygraph (#10654 ) * Don't break overall processing on a bad image * Add preprocessing common to OCR tasks Add preprocessing to options	2023-08-21 16:33:03 +08:00
WilliamQf	2bd552c801	实现功能：当--savefile为true时，在--output下以当前图片名称后接“.txt”为文件名保存ocr推理结果，解决了issues： (#10628 ) * 实现功能：当--savefile为true时，在--output下以当前图片名称后接“.txt”为文件名保存ocr推理结果，解决了issues： https://github.com/PaddlePaddle/PaddleOCR/issues/10533 * 删除了乱码字符 * 1.删除了乱码字符 * 1.删除了乱码字符	2023-08-21 10:25:47 +08:00
andyj	681467d4ea	[bug fix] fix none res in recovery (#10603 ) * add finetune en doc & test=document_fix * fix dead link & test=document_fix * fix dead link & test=document_fix * update check img * fix det res dtype * update args default type & test=document_fix * fix numpy version * support numpy1.24.0 * fix doc & test=document_fix * update doc * update doc, test=document_fix * fix pdf2word in whl, test=document_fix * fix none res in recovery * update version * format code	2023-08-10 16:55:26 +08:00
andyj	254786752a	[BUG FIX] Fix pdf2word in whl (#10584 ) * add finetune en doc & test=document_fix * fix dead link & test=document_fix * fix dead link & test=document_fix * update check img * fix det res dtype * update args default type & test=document_fix * fix numpy version * support numpy1.24.0 * fix doc & test=document_fix * update doc * update doc, test=document_fix * fix pdf2word in whl, test=document_fix	2023-08-09 19:15:49 +08:00
xiaoting	2f70e4b7f6	upload paddleocr whl to pypi (#10524 ) * upload paddleocr whl to pypi * Update README_ch.md * Update README_ch.md * Update quickstart.md * Update README_ch.md * Update README_ch.md	2023-08-06 11:17:13 +08:00
Bryan YW	1556922669	Update paddleocr.py (#10066 ) cherry-pick package compatibility PR	2023-07-18 11:36:49 +08:00
andyj	83beede6b5	Update check img (#8558 ) * add finetune en doc & test=document_fix * fix dead link & test=document_fix * fix dead link & test=document_fix * update check img	2022-12-07 09:51:54 +08:00
zhoujun	59b3eade31	Merge pull request #8066 from WenmuZhou/doc2 update PP-Structurev to PP-StructureV	2022-10-25 14:20:10 +08:00
WenmuZhou	cad701d411	fix benckmark error when benckmark=false	2022-10-24 17:10:05 +08:00
WenmuZhou	b2bc6b746c	update doc	2022-10-24 05:36:34 +00:00
WenmuZhou	2145d8c4ec	add recovery requirements to whl	2022-10-20 17:03:47 +08:00
an1018	99698aed54	add_pdf2docx_api	2022-10-12 21:28:48 +08:00
andyjpaddle	b947dc3fb3	fix whl for layout recovery	2022-10-11 08:42:47 +00:00
WenmuZhou	c647a6da28	add re to ppstructure system	2022-10-10 05:39:41 +00:00
andyj	059349ab74	Merge pull request #7834 from andyjpaddle/update_whl update doc for whl	2022-10-08 19:21:53 +08:00
andyjpaddle	a5df6c34e0	update doc for whl	2022-10-08 08:28:30 +00:00
Xie Xuyang	f699d787ff	fix #7534 (#7535 )	2022-09-29 11:19:44 +08:00
vivien	f9baf0eabc	update bytes support	2022-08-31 19:59:29 +08:00
Leif	975975d30e	Update PPOCRLabel Update PPOCRLabel	2022-08-25 16:32:44 +08:00
an1018	8756253800	update doc	2022-08-24 10:07:30 +08:00
an1018	2976dab987	Merge branch 'dygraph' of https://github.com/PaddlePaddle/PaddleOCR into add_layout_hub	2022-08-24 09:04:59 +08:00
an1018	9c424ff164	update doc	2022-08-23 23:28:49 +08:00
an1018	d5d78b486b	update doc	2022-08-23 16:11:18 +08:00
WenmuZhou	f7b35bbbc4	do not print table result	2022-08-22 08:00:49 +00:00
littletomatodonkey	8c7c45420f	polish kie doc and code (#7255 ) * add fapiao kie * fix readme * fix fanli * add readme * add how to do kie en * add algo kie * add algo overview en * rename vqa to kie * fix read gif	2022-08-21 10:55:49 +08:00
zhoujun	aec5dbf8f7	update model size (#7269 ) * update model size * update layout dict in whl	2022-08-20 16:10:45 +08:00
WenmuZhou	604f531b9a	add layout model	2022-08-17 04:40:07 +00:00
WenmuZhou	b26ce23774	rm unused code	2022-08-16 10:55:24 +00:00
WenmuZhou	bb53c8d100	add table model link	2022-08-16 10:46:09 +00:00
WenmuZhou	731688c2dd	add PP-Structurev2 to whl	2022-08-10 14:51:57 +00:00
andyjpaddle	e2b2f33092	update some note	2022-05-30 09:25:21 +00:00
WenmuZhou	f31980266a	merge upstream	2022-05-10 14:28:02 +08:00
andyjpaddle	565abdb632	update version	2022-05-10 02:46:13 +00:00
andyjpaddle	b16c3d4234	update v3 rec name	2022-05-10 02:41:54 +00:00
andyjpaddle	4d406f8825	update whl for rec image shape	2022-05-07 09:06:13 +00:00
andyjpaddle	cedf8b5669	update whl for multi lang	2022-05-07 06:05:47 +00:00
andyjpaddle	bd147c61a9	update whl for multi lang rec	2022-05-07 03:52:06 +00:00
andyjpaddle	738ff190dd	update doc for whl and python cpp infer quick	2022-04-29 03:38:59 +00:00
WenmuZhou	bbff7c55c1	The whl package supports separate table recognition and layout analysis	2022-04-22 13:24:45 +08:00
tink2123	3a28266735	update paddleocr version no	2022-04-02 09:36:48 +00:00
WenmuZhou	b3fbfc3109	rename OCRSystem to StructureSystem	2022-03-30 09:24:24 +00:00
WenmuZhou	463622b1f9	rm unused code	2022-03-23 14:26:40 +00:00
WenmuZhou	e693619888	add relative import	2022-03-23 14:22:06 +00:00
WenmuZhou	f85b3ea8eb	fix bug in whl import fce	2022-03-18 10:08:58 +00:00
WenmuZhou	496887f83f	change print to logger.debug	2022-03-17 14:20:41 +00:00

1 2

95 Commits (a0901d2175d52b1afbd2dbe503a3f723c1db0b6b)