Commit Graph

6291 Commits (397b5641f9326da4f4f08a1a4e8f29ccd81a078c)
 

Author SHA1 Message Date
dyning 397b5641f9
updata 2.9, adding new models and supporting all-in-one full developm… (#13933)
* updata 2.9, adding new models and supporting all-in-one full development tools

* updata 2.9, adding new models and supporting all-in-one full development tools
2024-10-01 06:44:47 +08:00
dyning 7a0a2d8ac0
updata 2.9, adding new models and supporting all-in-one full developm… (#13932)
* updata 2.9, adding new models and supporting all-in-one full development tools

* Update quick_start.md

---------

Co-authored-by: cuicheng01 <45199522+cuicheng01@users.noreply.github.com>
2024-10-01 05:37:42 +08:00
ztyf 269e5b8f37
1.在ppstructure管道中添加latex_ocr公式识别功能;2.添加pdf转markdown文件功能 (#13868)
* Add formula recognition in ppstructure,Convert PDF to markdown file

* Fix bug in converting to doc in formula recognition

* modify time

* Correct spelling errors in args_formula
2024-09-29 10:10:10 +08:00
zhangyubo0722 362103bd0b
fix lateocr bug (#13920) 2024-09-28 19:11:31 +08:00
Wang Xin cda3e1201e
fix hubserving run error (#13918)
* fix hubserving run error

* Update paddleocr.py
2024-09-27 15:58:49 +08:00
zhangyubo0722 2b51369324
support export after save model (#13844) 2024-09-25 01:11:01 +08:00
Wang Xin 3cc4ae9f37
set --image_dir to be required (#13896) 2024-09-23 14:17:44 +08:00
Tingquan Gao f214be5912
fix version (#13895) 2024-09-21 23:03:43 +08:00
Liu Jiaxuan ac5313d0b1
fix bugs for SLANet infer (#13861) 2024-09-13 12:53:09 +08:00
changdazhou 284a20bf10
update pretrain for benchmark (#13820) 2024-09-13 11:05:51 +08:00
Wang Xin 1c8233d5bc
fix SRN algorithm infer error (#13851) 2024-09-11 19:16:30 +08:00
WangZhen 4832bb62ad
Fix pir dy2st train (#13853) 2024-09-11 18:54:37 +08:00
Wang Xin 33d5570249
fix the CI running errors in tests. (#13846) 2024-09-09 17:38:07 +08:00
Wang Xin 5b54ac4606
update kie doc (#13799) 2024-09-02 19:28:02 +08:00
johnlockejrr ada310811a
Add Syriac script support (#13800)
* Add Syriac Language support dictionary

The Syriac Script is a Unicode block containing characters for all forms of the Syriac alphabet, including the Estrangela, Serto, Eastern Syriac, and the Christian Palestinian Aramaic variants. It is used in Literary Syriac, Neo-Aramaic, and Arabic among Syriac-speaking Christians. It was used historically to write Armenian, Persian, Ottoman Turkish, and Malayalam. The script, like Arabic and Hebrew is RTL.

https://en.wikipedia.org/wiki/Syriac_(Unicode_block)
https://en.wikipedia.org/wiki/Syriac_language

* Add Syriac script support for training

The Syriac Script is a Unicode block containing characters for all forms of the Syriac alphabet, including the Estrangela, Serto, Eastern Syriac, and the Christian Palestinian Aramaic variants. It is used in Literary Syriac, Neo-Aramaic, and Arabic among Syriac-speaking Christians. It was used historically to write Armenian, Persian, Ottoman Turkish, and Malayalam. The script, like Arabic and Hebrew is RTL.

https://en.wikipedia.org/wiki/Syriac_(Unicode_block)
https://en.wikipedia.org/wiki/Syriac_language
2024-09-01 20:10:42 +08:00
johnlockejrr 6225a90ef0
Add support for Hebrew Language and Alphabet (#13797)
* Add Hebrew language support for training

https://en.wikipedia.org/wiki/Unicode_and_HTML_for_the_Hebrew_alphabet

* Add Hebrew language dictionary

https://en.wikipedia.org/wiki/Unicode_and_HTML_for_the_Hebrew_alphabet

* Add Samaritan Script dictionary

Samaritan Script is RTL like Arabic and Hebrew, used for Samaritan Hebrew and Aramaic, sometimes has Arabic letters in some texts.

https://en.wikipedia.org/wiki/Samaritan_(Unicode_block)
https://en.wikipedia.org/wiki/Samaritan_Hebrew
https://en.wikipedia.org/wiki/Samaritan_Aramaic_language

* Add Samaritan Script training

Samaritan Script is RTL like Arabic and Hebrew, used for Samaritan Hebrew and Aramaic, sometimes has Arabic letters in some texts.

https://en.wikipedia.org/wiki/Samaritan_(Unicode_block)
https://en.wikipedia.org/wiki/Samaritan_Hebrew
https://en.wikipedia.org/wiki/Samaritan_Aramaic_language

* Update hebrew_dict.txt
2024-09-01 09:18:37 +08:00
topduke 77f4c01f69
Fix doc link in docs (#13792)
* fix configs link

* fix configs link

* fix configs link

* fix configs link
2024-08-30 18:35:25 +08:00
Sunflower7788 aabff3958c
fix setting of make border epoch (#13783) 2024-08-29 22:27:28 +08:00
Wang Xin 759317feb6
docs: Update applications/overview.md (#13763) 2024-08-28 16:38:25 +08:00
Kayzwer 78ddaa44a0
remove unused enumerate (#13760) 2024-08-28 09:10:33 +08:00
achieve_dream e9fca96d4f
rename MKLDNN to OneDNN (#13757)
[Windows Inference Library (version 3.0.0-beta1) - MKL](https://www.paddlepaddle.org.cn/inference/master/guides/install/download_lib.html#windows) renames the mkldnn folder to onednn.
2024-08-27 12:52:00 +08:00
liuhongen1234567 30811fba3c
Repair the bug in the inference script for LaTeX OCR (#13750)
* test

* dataprocess_abspath2relpath

* repair bug in infer/predict_rec for latexocr
2024-08-26 14:21:41 +08:00
Wang Xin ba1c81358f
use setuptools-scm extracts PaddleOCR versions (#13716)
* use setuptools-scm extracts PaddleOCR versions

* fix bug
2024-08-23 17:27:31 +08:00
liuhongen1234567 1752c56cb7
修改LaTeXOCR的数据处理部分,将生成的数据集中的绝对路径改为相对路径 (#13702)
* test

* dataprocess_abspath2relpath
2024-08-20 15:45:57 +08:00
Songling Huang 01e60ff9e1
add vietnamese char dict (#13698) 2024-08-19 22:35:40 +08:00
SWHL 0ca03dde4e
docs: Remove doc/datasets directory and fix docs/datasets documents (#13700) 2024-08-19 22:00:22 +08:00
zhangyubo0722 b12d4fff34
Code Style Unification (#13697) 2024-08-19 17:21:44 +08:00
zhangyubo0722 6986a2a2a2
Remove channel links from documentation (#13674) 2024-08-19 14:41:18 +08:00
jiqirenfeile 8812c07cd4
Update ch_PP-OCRv4_rec_distillation.yml (#13692)
Refactor YAML config to define max_text_length as an anchor for reuse
2024-08-19 08:50:59 +08:00
Songling Huang e22ce35c94
Add files via upload (#13685)
Burmese dictionary expansion
2024-08-18 21:54:43 +08:00
Wang Xin 0b0bfc1be1
cache Python dependencies and PaddleOCR files (#13682) 2024-08-16 16:08:32 +08:00
SWHL 8f7d73ee0d
chore: Update issue template (#13679) 2024-08-16 09:37:07 +08:00
Wang Xin fe564a6d55
docs: Update docs and fix markdown render error (#13678) 2024-08-16 08:21:40 +08:00
Wang Xin 08d35f6d88
docs: Update docs and remove old docs (#13662)
* update docs

* update docs
2024-08-15 18:45:05 +08:00
Tingquan Gao 543a4dd2a0
setuptools 72.2.0 result in that MANIFEST.in is invalid (#13670) 2024-08-15 18:18:16 +08:00
Wang Xin 73b1636650
docs: Update docs and remove out-of-date event (#13660)
* update doc remove out-of-date event

* Update README.md

* Update README_en.md
2024-08-15 08:23:10 +08:00
liuhongen1234567 5f0b90a110
Fix some issues with LaTeXOCR in paddleX (#13646)
* repair_some_Bug_for_paddlex

* style2

* style2

* add_epilson_for groupnorm
2024-08-14 11:30:25 +08:00
Wang Xin 6dc021115c
disable automatic checks for new version albumentations (#13583) 2024-08-07 07:00:05 +08:00
changdazhou 20de659502
fix download bug when use multi gpus (#13610) 2024-08-06 21:15:52 +08:00
SWHL d7ea48ea92
docs: Add android_demo docs (#13601)
* docs: Shorten the image path

* docs: Remove duplicate images

* docs: Fix docs errors

* docs: Add android_demo docs
2024-08-06 10:21:57 +08:00
Wang Xin 09b7f6cf8b
ci: Skip text files when running test ci on push (#13582) 2024-08-05 21:26:55 +08:00
SWHL d590a8d39a
docs: Fix docs errors (#13588)
* docs: Shorten the image path

* docs: Remove duplicate images

* docs: Fix docs errors
2024-08-04 18:02:01 +08:00
SWHL 74529b2a29
docs: Shorten the image path and remove dupliate images (#13585)
* docs: Shorten the image path

* docs: Remove duplicate images
2024-08-03 23:44:19 +08:00
Wang Xin 9a90c8ff33
update dive into OCR book link (#13581) 2024-08-02 19:42:56 +08:00
Wang Xin 28c7be63ec
improve the reading experience of some documents (#13562) 2024-08-02 19:41:36 +08:00
changdazhou b6211b936b
support benchmark for paddlepaddle3.0 (#13574) 2024-08-02 19:24:40 +08:00
Gmgge d69bf81907
docs: Update the pdf file path in the operation demonstration (#13575) 2024-08-02 17:09:02 +08:00
changdazhou 9c19e6dffe
fix bug for paddlepaddle3.0 (#13568) 2024-08-01 22:50:44 +08:00
Wang Xin 6c9bae667b
skip text files when running test ci (#13561)
* skip text files when running test ci

* fix bug
2024-08-01 11:45:32 +08:00
SWHL 5de9392333
chore(Issue_template): Add validation of Environment and MPE code (#13559) 2024-08-01 09:37:02 +08:00