Commit Graph

6138 Commits (5e40f85ef380cad90b72a6c24024bdf4e8c7b011)
 

Author SHA1 Message Date
jzhang533 5e40f85ef3
setup a workflow for publishing package to pypi (#11804) 2024-03-27 10:41:55 +08:00
zxcd 8c9d3f91b1
adapter new type promotion rule for Paddle 2.6 (#11698) 2024-03-18 11:55:55 +08:00
xiaoting b583b4773f
cherry-pick for lazy import pymupdf and pre-commit (#11692)
Co-authored-by: jzhang533 <jzhang533@gmail.com>
2024-03-13 12:34:31 +08:00
Matej Kollár efc01375c9
Fix dead links (#11520) 2024-03-06 13:01:02 +08:00
xiaoting 3869582dec
rm QR code (#11532)
* rm QR code in the document

* rm QR code
2024-01-24 11:54:31 +08:00
xiaoting 5e3dfb49b7
rm QR code in the document (#11512) 2024-01-24 11:39:25 +08:00
Ran chongzhi 448ee6bec1
[Feature]Complete the ppocrv4_act (#11345)
* ppocrv4_act

* update

* fix bugs when run act on ppocrv4_dedt_server

* modify act config files

* modify test code and update results

* 新增数据处理的脚本

* fix

* Add batch testing script

* fix

* fix

* fix

* update det_server inference on tesla v100

* update model urls

---------

Co-authored-by: tangshiyu <tangshiyu@baidu.com>
2024-01-19 11:12:25 +08:00
co63oc 3b6f117c44
Fix (#11448) 2024-01-02 11:02:13 +08:00
sheiy 49ef54ee3c
chore: add notes for docker gpu deploy PP-OCRv4 (#11390)
* chore: add notes for docker gpu deploy PP-OCRv4

* chore: add notes for docker gpu deploy PP-OCRv4

* Update Dockerfile
2024-01-02 10:49:32 +08:00
zhangyubo0722 414d085166
update paddlex of readme (#11422) 2023-12-28 14:25:29 +08:00
firmament2008 b5e5dba3be
Fix QPointF IndexError: list index out of range (#11393)
* Fix QPointF IndexError: list index out of range

当QPointF 获取异常时,self.center  赋予默认值

* 增加QPointF异常时的提醒信息
2023-12-27 19:47:04 +08:00
Yesir 1f6712c370
Update zeros' comment in rec_abinet_head.py (#11374)
Bug fixes | One of code comments | maybe here it's B,N,C
2023-12-27 19:45:24 +08:00
Weihang Wang 25ffa816f7
doc: add doc for satrn (#11397) 2023-12-27 19:41:17 +08:00
marswen 0382bfb02d
Optimize prediction on long image and deduplicate similar boxes with multiple lables (#11366)
* Handle conflict where a box is simultaneously recognized as multiple labels

* Split large height image recursively and process each with overlap to enhance performance

* Fix error when dt_box result is empty

* Add split operation on horizon side

* Slide on horizon may suffer line completeness, so that add more strict condition.

* Optimize recognition of overlap boxes.
2023-12-21 10:32:42 +08:00
Xue Yadong c708180ce9
fix bug for rec_postprocess.py (#11389)
Co-authored-by: xueyadong <xueyadong@baidu.com>
2023-12-19 11:06:25 +08:00
Empathy d3e362a3a0
1、PPOCRLabel现在支持从中文路径导入图片,原本导入含中文路径的图片会导致崩溃。 (#11236)
2、PPOCRLabel现在支持移动被其他框覆盖的锚点,原本无法移动被覆盖的锚点。
3、修复utility.py中误输入字符导致的语法错误。
4、修复setValue()应输入int,实际输入float导致的类型错误。
5、修复paddleocr中未import predict_system的错误。
6、修复canvas.py中部分输入参数类型错误
7、修复了LabelList不兼容搜狗输入法或win11输入法的问题。原本使用搜狗输入法修改标注数据时,仅输入一个字母就会失去焦点并提交数据变更,导致无法输入完整的汉字。现在将处理逻辑改为失去焦点时仍不提交数据变更,直到切换item或按下enter键才提交。
8、新增扩大选框的功能

1、PPOCRLabel now supports importing images from Chinese paths, originally importing images containing Chinese paths would cause a crash.
2、PPOCRLabel now supports moving anchor points that are covered by other boxes, originally it could not move the covered anchor points.
3、Fix the syntax error caused by mistakenly inputting characters in utility.py.
4、Repair the type error caused by inputting int but float in setValue().
5、Repair the error of not import predict_system in paddleocr.
6、Fix some input parameter type errors in canvas.py.
7、LabelList can't use Sogou Input Method or Win11 Input Method to input text.
8、Add function of expand box.
2023-12-13 10:16:39 +08:00
TracebaK 317eac13b5
Add device check in check_gpu (#11293) 2023-12-06 11:12:58 +08:00
gmm 856ce76be3
Dygraph fix max_mem_reserved for benchmark (#11341)
* fix profile

* fix python3.10

* add max_mem_reserved for benchmark

* fix benchmark
2023-12-05 14:55:38 +08:00
xiaoting dc001ac44a
Update utility.py 2023-11-30 12:32:17 +08:00
RuohengMa 317f56a3f4
Refine according to comment (#11301) 2023-11-24 16:50:34 +08:00
RuohengMa c8544d04ec
Fix bug when running on XPU (#11299) 2023-11-23 16:58:48 +08:00
gmm a0901d2175
Dygraph【benchmark】add max_mem_reserved for benchmark (#11284)
* fix profile

* fix python3.10

* add max_mem_reserved for benchmark
2023-11-22 20:11:14 +08:00
ficodex 42d280907a
Fix ValueError in text detection due to ambiguous array evaluation (#11287)
This commit resolves the issue where `dt_boxes` was evaluated as a boolean
in Python 3.11, leading to a ValueError. The condition `if not dt_boxes:` is
replaced with `if dt_boxes.size == 0:` to explicitly check for an empty array,
ensuring compatibility with Python 3.11 and maintaining the intended functionality.
2023-11-22 20:10:32 +08:00
黃鼎勲 (Ding Hsun Huang) 80459f59fe
解決recognition的train test分割程式執行後的文檔每行間多出一行空格 (#11280)
使用gen_ocr_train_val_test.py分割recognition data後產生的train.txt、val.txt和test.txt每行label間多出一行空格,導致訓練時出現異常,移除換行\n後便可正常運行。

Co-authored-by: Wayne Huang <dinghsun@gmail.com>
2023-11-22 20:10:12 +08:00
sylarwcy 68b384292b
v4 det cml configs (#11258)
* fixed several bugs

1. 修复了找不到模型的问题,原因是PPLCNetNew不在可选模型名称的列表内,将PPLCNetNew改为PPLCNetV3;
2. 修复了db_fpn.py解析in_channels的报错问题,db_fpn.py中按序列解析in_channels,而lcnetv3.py中det为false时输出是数值,为true才返回序列。在两个student模型的backbone中添加"det: true";
3. 减少CPU占用,将cal_metric_during_train: true 改为 false;
4. 修复了训练过程中eval时的显存溢出问题。通过限制过大的测试数据可解决该问题,具体调整是,在eval→DetResizeForTest的配置中增加"limit_side_len: 960,limit_type: max"。

* 恢复cal_metric_during_train的设置
2023-11-17 20:38:25 +08:00
Alexnet fb38d0be18
fix SAR inference, when batch size>1, norm_img_batch and valid_ratios shapes do not match (#11238) 2023-11-15 10:02:03 +08:00
ToddBear 73c08b7e4c
Add tipc for "ParseQ" method (#10843)
* Create rec_vit_parseq.yml

* Create train_infer_python.txt
2023-11-01 16:11:46 +08:00
fzhi bf6b100f85
update uygur dict (#11125) 2023-10-26 15:32:55 +08:00
shiyutang 987b3112cd
valid_out (#11126) 2023-10-25 15:35:10 +08:00
xiaoting bf59c42a0f
fix satrn export for paddle2.5 (#11096) 2023-10-19 15:00:11 +08:00
Kai Song a0218a8b83
[TIPC]update tipc scripts (#11097) 2023-10-18 20:05:52 +08:00
shiyutang e3fc6393e0
[Cherry-pick] Cherry-pick from release/2.6 (#11092)
* Update recognition_en.md (#10059)

ic15_dict.txt only have 36 digits

* Update ocr_rec.h (#9469)

It is enough to include preprocess_op.h, we do not need to include ocr_cls.h.

* 补充num_classes注释说明 (#10073)

ser_vi_layoutxlm_xfund_zh.yml中的Architecture.Backbone.num_classes所赋值会设置给Loss.num_classes,
由于采用BIO标注,假设字典中包含n个字段(包含other)时,则类别数为2n-1;假设字典中包含n个字段(不含other)时,则类别数为2n+1。

* Update algorithm_overview_en.md (#9747)

Fix links to super-resolution algorithm docs

* 改进文档`deploy/hubserving/readme.md`和`doc/doc_ch/models_list.md` (#9110)

* Update readme.md

* Update readme.md

* Update readme.md

* Update models_list.md

* trim trailling spaces @ `deploy/hubserving/readme_en.md`

* `s/shell/bash/` @ `deploy/hubserving/readme_en.md`

* Update `deploy/hubserving/readme_en.md` to sync with `deploy/hubserving/readme.md`

* Update deploy/hubserving/readme_en.md to sync with `deploy/hubserving/readme.md`

* Update deploy/hubserving/readme_en.md to sync with `deploy/hubserving/readme.md`

* Update `doc/doc_en/models_list_en.md` to sync with `doc/doc_ch/models_list_en.md`

* using Grammarly to weak `deploy/hubserving/readme_en.md`

* using Grammarly to tweak `doc/doc_en/models_list_en.md`

* `ocr_system` module will return with values of field `confidence`

* Update README_CN.md

* 修复测试服务中图片转Base64的引用地址错误。 (#8334)

* Update application.md

* [Doc] Fix 404 link.  (#10318)

* Update PP-OCRv3_det_train.md

* Update knowledge_distillation.md

* Update config.md

* Fix fitz camelCase deprecation and .PDF not being recognized as pdf file (#10181)

* Fix fitz camelCase deprecation and .PDF not being recognized as pdf file

* refactor get_image_file_list function

* Update customize.md (#10325)

* Update FAQ.md (#10345)

* Update FAQ.md (#10349)

* Don't break overall processing on a bad image (#10216)

* Add preprocessing common to OCR tasks (#10217)

Add preprocessing to options

* [MLU] add mlu device for infer (#10249)

* Create newfeature.md

* Update newfeature.md

* remove unused imported module, so can avoid PyInstaller packaged binary's start-time not found module error. (#10502)

* CV套件建设专项活动 - 文字识别返回单字识别坐标 (#10515)

* modification of return word box

* update_implements

* Update rec_postprocess.py

* Update utility.py

* Update README_ch.md

* revert README_ch.md update

* Fixed Layout recovery README file (#10493)

Co-authored-by: Shubham Chambhare <shubhamchambhare@zoop.one>

* update_doc

* bugfix

---------

Co-authored-by: ChuongLoc <89434232+ChuongLoc@users.noreply.github.com>
Co-authored-by: Wang Xin <xinwang614@gmail.com>
Co-authored-by: tanjh <dtdhinjapan@gmail.com>
Co-authored-by: Louis Maddox <lmmx@users.noreply.github.com>
Co-authored-by: n0099 <n@n0099.net>
Co-authored-by: zhenliang li <37922155+shouyong@users.noreply.github.com>
Co-authored-by: itasli <ilyas.tasli@outlook.fr>
Co-authored-by: UserUnknownFactor <63057995+UserUnknownFactor@users.noreply.github.com>
Co-authored-by: PeiyuLau <135964669+PeiyuLau@users.noreply.github.com>
Co-authored-by: kerneltravel <kjpioo2006@gmail.com>
Co-authored-by: ToddBear <43341135+ToddBear@users.noreply.github.com>
Co-authored-by: Ligoml <39876205+Ligoml@users.noreply.github.com>
Co-authored-by: Shubham Chambhare <59397280+Shubham654@users.noreply.github.com>
Co-authored-by: Shubham Chambhare <shubhamchambhare@zoop.one>
Co-authored-by: andyj <87074272+andyjpaddle@users.noreply.github.com>
2023-10-18 17:37:23 +08:00
Sagar J ce728358d7
Update algorithm_overview_en.md (#10734)
fix: tutorial link
2023-10-18 14:45:27 +08:00
Sagar J 60acb26abf
Update README.md (#10733)
typo error
2023-10-13 10:30:02 +08:00
Sagar J c1134599e7
Update quickstart_en.md (#10732)
typo error
2023-10-13 10:28:34 +08:00
itasli 2213807323
refactored splitTrainVal and added multiOS path support (#11069) 2023-10-13 10:27:26 +08:00
Viktor Bordiuzha d0d77fe273
Fixed a bug where changing "max_text_length" for VisionLAN did not change the length of the output text (#11025) 2023-10-12 11:07:07 +08:00
topduke 3786b27307
add cppd u14m train model and doc (#11052)
* add cppd u14m train model

* add cppd u14m train model and doc
2023-10-11 17:15:01 +08:00
Sagar J f59d0929b0
Update how_to_do_kie_en.md (#10731)
fix: typo error
ID card is mentioned instead of ID No.
2023-10-11 10:08:42 +08:00
xiaoting 799c144ab3
update ppocrv4_framework (#11048) 2023-10-08 10:52:10 +08:00
xiaoting 6abf08a06e
fix import collection for py310 (#11012) 2023-09-27 15:40:16 +08:00
Sagar J 79dabcc9c7
Update kie_datasets_en.md (#10735)
fix: typo error correction
2023-09-27 11:14:33 +08:00
zhangyubo0722 4ba32bc91c
support eval pre epoch (#11003) 2023-09-26 18:50:42 +08:00
zhangyubo0722 e49e491417
add svtr large model (#10937)
* add svtr large model

* [WIP]add svtr large model
2023-09-26 14:38:29 +08:00
Nyakku Shigure 2751cb3a11
fix `cls_x` and `bbox_x` is possibly unbound (#10991) 2023-09-25 17:15:09 +08:00
topduke 8a52c99ad8
[New] add rec CPPD model (#10990)
* fix gris_sample data type bug when use fp16

* fix gris_sample data type bug when use fp16

* fix v4rec batchsize

* fix bug of hang when multi gpus training(sampler)

* add rec algorithm cppd

* delete cppd useless code

* update cppd bug

* add rec algorithm cppd

* update cppd trainedmodel url

* add cppd en doc
2023-09-25 15:43:45 +08:00
Gmgge 2fdac319b1
fix:修复通道数不匹配造成的PPOCRLabel启动失败问题#10748,根据更新日志发现#10655,由于paddleocr中增加了对透明色的需求,由于在check_img中以及存在对单通道图像转三通道图像的处理,因此将该四通道图像处理成三通道的图像也放入该check_img,并统一三通道图像输出逻辑。 (#10847)
add:由于函数复杂度增高,增加注释。
2023-09-21 14:51:32 +08:00
xlg-go ebc67db25b
rec_r45_abinet for export model (#10892)
* When exporting the inference model for ABINet, adapt to the 'image_shape' of ABINetRecResizeImg.

* restore h
2023-09-21 14:50:58 +08:00
xiaoting 5d68082518
rm fluid for paddle dev (#10931) 2023-09-19 10:16:35 +08:00
ToddBear 75d16610f4
Add new recognition method "ParseQ" (#10836)
* Update PP-OCRv4_introduction.md

* Update PP-OCRv4_introduction.md (#10616)

* Update PP-OCRv4_introduction.md

* Update PP-OCRv4_introduction.md

* Update PP-OCRv4_introduction.md

* Update README.md

* Cherrypicking GH-10217 and GH-10216 to PaddlePaddle:Release/2.7 (#10655)

* Don't break overall processing on a bad image

* Add preprocessing common to OCR tasks
Add preprocessing to options

* Update requirements.txt (#10656)

added missing pyyaml library

* [TIPC]update xpu tipc script (#10658)

* fix-typo (#10642)

Co-authored-by: Dennis <dvorst@users.noreply.github.com>
Co-authored-by: shiyutang <34859558+shiyutang@users.noreply.github.com>

* 修改数据增强导致的DSR报错 (#10662) (#10681)

* 修改数据增强导致的DSR报错

* 错误修改回滚

* Update algorithm_overview_en.md (#10670)

Fixed simple spelling errors.

* Implement recoginition method ParseQ

* Document update for new recognition method ParseQ

* add prediction for parseq

* Update rec_vit_parseq.yml

* Update rec_r31_sar.yml

* Update rec_r31_sar.yml

* Update rec_r50_fpn_srn.yml

* Update rec_vit_parseq.py

* Update rec_vit_parseq.yml

* Update rec_parseq_head.py

* Update rec_img_aug.py

* Update rec_vit_parseq.yml

* Update __init__.py

* Update predict_rec.py

* Update paddleocr.py

* Update requirements.txt

* Update utility.py

* Update utility.py

---------

Co-authored-by: xiaoting <31891223+tink2123@users.noreply.github.com>
Co-authored-by: topduke <784990967@qq.com>
Co-authored-by: dyning <dyning.2003@163.com>
Co-authored-by: UserUnknownFactor <63057995+UserUnknownFactor@users.noreply.github.com>
Co-authored-by: itasli <ilyas.tasli@outlook.fr>
Co-authored-by: Kai Song <50285351+USTCKAY@users.noreply.github.com>
Co-authored-by: dvorst <87502756+dvorst@users.noreply.github.com>
Co-authored-by: Dennis <dvorst@users.noreply.github.com>
Co-authored-by: shiyutang <34859558+shiyutang@users.noreply.github.com>
Co-authored-by: Dec20B <1192152456@qq.com>
Co-authored-by: ncoffman <51147417+ncoffman@users.noreply.github.com>
2023-09-07 16:36:47 +08:00