From 22123a52d7d7e40bdf8a7e15e04860b6c9a02101 Mon Sep 17 00:00:00 2001
From: WenmuZhou <572459439@qq.com>
Date: Tue, 10 May 2022 10:36:57 +0800
Subject: [PATCH 1/2] update PP-Structure en doc, test=document_fix
---
ppstructure/README.md | 8 ++++----
1 file changed, 4 insertions(+), 4 deletions(-)
diff --git a/ppstructure/README.md b/ppstructure/README.md
index 0febf233d..72670e335 100644
--- a/ppstructure/README.md
+++ b/ppstructure/README.md
@@ -40,7 +40,7 @@ The main features of PP-Structure are as follows:
### 4.1 Layout analysis and table recognition
-
+
The figure shows the pipeline of layout analysis + table recognition. The image is first divided into four areas of image, text, title and table by layout analysis, and then OCR detection and recognition is performed on the three areas of image, text and title, and the table is performed table recognition, where the image will also be stored for use.
@@ -48,7 +48,7 @@ The figure shows the pipeline of layout analysis + table recognition. The image
* SER
*
- | 
+ | 
---|---
Different colored boxes in the figure represent different categories. For xfun dataset, there are three categories: query, answer and header:
@@ -62,7 +62,7 @@ The corresponding category and OCR recognition results are also marked at the to
* RE
- | 
+ | 
---|---
@@ -76,7 +76,7 @@ Start from [Quick Installation](./docs/quickstart.md)
### 6.1 Layout analysis and table recognition
-
+
In PP-Structure, the image will be divided into 5 types of areas **text, title, image list and table**. For the first 4 types of areas, directly use PP-OCR system to complete the text detection and recognition. For the table area, after the table structuring process, the table in image is converted into an Excel file with the same table style.
From f31980266a73f80b09c693f2d626c004252a0a7a Mon Sep 17 00:00:00 2001
From: WenmuZhou <572459439@qq.com>
Date: Tue, 10 May 2022 14:28:02 +0800
Subject: [PATCH 2/2] merge upstream
---
paddleocr.py | 2 +-
1 file changed, 1 insertion(+), 1 deletion(-)
diff --git a/paddleocr.py b/paddleocr.py
index 9eff2d0f3..a1265f79d 100644
--- a/paddleocr.py
+++ b/paddleocr.py
@@ -47,7 +47,7 @@ __all__ = [
]
SUPPORT_DET_MODEL = ['DB']
-VERSION = '2.5.0.2'
+VERSION = '2.5.0.3'
SUPPORT_REC_MODEL = ['CRNN', 'SVTR_LCNet']
BASE_DIR = os.path.expanduser("~/.paddleocr/")