47 Commits

Author SHA1 Message Date
ztyf
269e5b8f37
1.在ppstructure管道中添加latex_ocr公式识别功能;2.添加pdf转markdown文件功能 (#13868)
* Add formula recognition in ppstructure,Convert PDF to markdown file

* Fix bug in converting to doc in formula recognition

* modify time

* Correct spelling errors in args_formula
2024-09-29 10:10:10 +08:00
Wang Xin
446f1cffbd
fix bug when layout_predictor is None (#13279) 2024-07-06 19:14:08 +08:00
myhloli
4f54aa61c6
add layout score return (#13068) 2024-06-14 13:06:34 +08:00
张春乔
b5eedf727e
【OCR Issue No.9】移除明确不适合放在ppocr依赖中的依赖项 (#11946)
* modify requestions

* Update requirements.txt

* Update requirements.txt

* try import pdfconvert

* try import lxml

* try import lxml

* try import premailer

* try import openpyxl

* Apply suggestions from code review
2024-04-26 16:54:49 +08:00
Wang Xin
045e5f6ac7
add pre-commit workflow (#11973)
* add pre-commit workflow

* run 'pre-commit run --all-files'

* setup python version
2024-04-21 21:46:20 +08:00
Luo Peng
667fda88ed
Enhance StructureSystem to achieve higher OCR recognition accuracy (#11916)
Closes #10270 and #11665.
2024-04-16 10:08:13 +08:00
ToddBear
b3f9f681d9
CV套件建设专项活动 - 文字识别返回单字识别坐标 (#10515) (#10537)
* modification of return word box

* update_implements

* Update rec_postprocess.py

* Update utility.py
2023-08-10 15:12:01 +08:00
user1018
f68813eb2a
optimize recovery (#8346)
* optimize recovery

* update
2022-11-17 16:18:05 +08:00
an1018
99698aed54 add_pdf2docx_api 2022-10-12 21:28:48 +08:00
WenmuZhou
d4a4c07c56 add ser to ppstructure system 2022-10-10 06:31:44 +00:00
WenmuZhou
c647a6da28 add re to ppstructure system 2022-10-10 05:39:41 +00:00
WenmuZhou
344c6d2d14 Merge branch 'dygraph' of https://github.com/PaddlePaddle/PaddleOCR into re1 2022-09-22 10:58:18 +08:00
WenmuZhou
de36384a66 fix angle 0 image rotate error 2022-09-22 09:32:19 +08:00
user1018
03d881685a
update code_doc (#7667)
* update code_doc

* update code_doc
2022-09-21 19:53:00 +08:00
an1018
9c424ff164 update doc 2022-08-23 23:28:49 +08:00
an1018
d5d78b486b update doc 2022-08-23 16:11:18 +08:00
littletomatodonkey
8c7c45420f
polish kie doc and code (#7255)
* add fapiao kie

* fix readme

* fix fanli

* add readme

* add how to do kie en

* add algo kie

* add algo overview en

* rename vqa to kie

* fix read gif
2022-08-21 10:55:49 +08:00
user1018
b7d99acd2e
update recovery (#7259)
* update recovery

* update recovery

* update recovery

* update recovery

* update recovery
2022-08-19 20:15:37 +08:00
WenmuZhou
73c77ff79d add image_orientation and update quickstart 2022-08-11 10:56:19 +00:00
WenmuZhou
73ca6c2e7f add PP-Structurev2 to hubserving 2022-08-10 14:15:52 +00:00
WenmuZhou
ddaa2c2552 add SLANet 2022-08-08 06:50:27 +00:00
an1018
2a9f27887c update 2022-07-14 18:09:43 +08:00
an1018
7e5e95d624 add recovery 2022-05-07 16:55:20 +08:00
WenmuZhou
bbff7c55c1 The whl package supports separate table recognition and layout analysis 2022-04-22 13:24:45 +08:00
WenmuZhou
4c5d266d84 add PP-Structure to hub serving 2022-03-30 14:15:12 +00:00
WenmuZhou
b3fbfc3109 rename OCRSystem to StructureSystem 2022-03-30 09:24:24 +00:00
WenmuZhou
340a8c24d0 add help 2022-03-11 00:09:27 +00:00
WenmuZhou
087022a120 update whl to 2.4 2022-01-06 10:55:46 +00:00
WenmuZhou
e16ae81e15 add vqa_ser to ppstructure predict pipeline 2021-12-13 07:38:05 +00:00
WenmuZhou
f2afa214e9 opt_doc and make layout_path_model Configurable 2021-08-03 14:47:55 +08:00
WenmuZhou
6c19d15a57 save figure 2021-08-02 17:22:31 +08:00
WenmuZhou
4030fd1e82 merge paddlestructure whl to paddleocr whl 2021-08-02 15:28:07 +08:00
WenmuZhou
445312179f remove style char 2021-08-01 15:10:40 +08:00
WenmuZhou
aeefefde91 rename test1 2 ppstructure 2021-07-27 19:18:55 +08:00
WenmuZhou
b2260182e4 rename folder 2021-06-15 15:47:57 +08:00
WenmuZhou
5cce398ebb fix structure pipeline infer bug 2021-06-10 17:12:14 +08:00
WenmuZhou
037e17fc82 merge dygraph 2021-06-10 14:24:59 +08:00
zhoujun
3302a0b1ad
Revert "add table eval and predict script" (#3062) 2021-06-09 19:06:44 +08:00
WenmuZhou
0ee6137d75 del pad 2021-06-08 15:28:40 +08:00
WenmuZhou
d27360a9b8 move draw_result to utilitu.py 2021-06-08 14:16:32 +08:00
WenmuZhou
dec76eb75d add pad for small image in det 2021-06-08 10:52:47 +08:00
WenmuZhou
48eba02894 opt output 2021-06-07 14:45:25 +08:00
WenmuZhou
59671466e3 opt output 2021-06-07 13:52:52 +08:00
WenmuZhou
bc0d766425 init commit for paddlestructure 2021-06-05 22:00:17 +08:00
WenmuZhou
20466055b2 add save_dir to args 2021-06-03 20:09:05 +08:00
WenmuZhou
794362481e add table eval and predict script 2021-06-03 16:39:31 +08:00
root
d721519335 add ppstructure dir 2021-06-01 11:48:32 +00:00