docs: Fix formatting (#14891)
* docs: Fix formatting * Fix typo * Fix translation * Fix formatting * Fix formattingpull/15006/head
parent
715b1d9aa4
commit
332d9d5112
|
@ -5,7 +5,7 @@ comments: true
|
|||
|
||||
# OCR datasets
|
||||
|
||||
Here is a list of public datasets commonly used in OCR, which are being continuously updated. Welcome to contribute datasets~
|
||||
Here is a list of public datasets commonly used in OCR, which are being continuously updated. Welcome to contribute datasets!
|
||||
|
||||
## 1. Text detection
|
||||
|
||||
|
|
|
@ -34,10 +34,10 @@ Take rec_chinese_lite_train_v2.0.yml as an example
|
|||
| checkpoints | set model parameter path | None | Used to load parameters after interruption to continue training|
|
||||
| use_visualdl | Set whether to enable visualdl for visual log display | False | [Tutorial](https://www.paddlepaddle.org.cn/paddle/visualdl) |
|
||||
| use_wandb | Set whether to enable W&B for visual log display | False | [Documentation](https://docs.wandb.ai/)
|
||||
| infer_img | Set inference image path or folder path | ./infer_img | \||
|
||||
| infer_img | Set inference image path or folder path | ./infer_img | \ |
|
||||
| character_dict_path | Set dictionary path | ./ppocr/utils/ppocr_keys_v1.txt | If the character_dict_path is None, model can only recognize number and lower letters |
|
||||
| max_text_length | Set the maximum length of text | 25 | \ |
|
||||
| use_space_char | Set whether to recognize spaces | True | \| |
|
||||
| use_space_char | Set whether to recognize spaces | True | \ |
|
||||
| label_list | Set the angle supported by the direction classifier | ['0','180'] | Only valid in angle classifier model |
|
||||
| save_res_path | Set the save address of the test model results | ./output/det_db/predicts_db.txt | Only valid in the text detection model |
|
||||
|
||||
|
|
|
@ -55,7 +55,7 @@ Optimizer:
|
|||
|
||||
### 2.3 Evaluation Indicators
|
||||
|
||||
(1) Detection stage: First, evaluate according to the IOU of the detection frame and the labeled frame. If the IOU is greater than a certain threshold, it is judged that the detection is accurate. Here, the detection frame and the label frame are different from the general general target detection frame, and they are represented by polygons. Detection accuracy: the percentage of the correct detection frame number in all detection frames is mainly used to judge the detection index. Detection recall rate: the percentage of correct detection frames in all marked frames, which is mainly an indicator of missed detection.
|
||||
(1) Detection stage: First, evaluate according to the IOU of the detection frame and the labeled frame. If the IOU is greater than a certain threshold, it is judged that the detection is accurate. Here, the detection frame and the label frame are different from the general target detection frame, and they are represented by polygons. Detection accuracy: the percentage of the correct detection frame number in all detection frames is mainly used to judge the detection index. Detection recall rate: the percentage of correct detection frames in all marked frames, which is mainly an indicator of missed detection.
|
||||
|
||||
(2) Recognition stage: Character recognition accuracy, that is, the ratio of correctly recognized text lines to the number of marked text lines. Only the entire line of text recognition pairs can be regarded as correct recognition.
|
||||
|
||||
|
|
|
@ -2,9 +2,9 @@
|
|||
comments: true
|
||||
---
|
||||
|
||||
Here we have sorted out some Chinese OCR training and prediction tricks, which are being updated continuously. You are welcome to contribute more OCR tricks ~
|
||||
Here we have sorted out some Chinese OCR training and prediction tricks, which are being updated continuously. You are welcome to contribute more OCR tricks!
|
||||
|
||||
#### 1、Replace Backbone Network
|
||||
#### 1. Replace Backbone Network
|
||||
|
||||
- **Problem Description**
|
||||
|
||||
|
@ -17,7 +17,7 @@ Here we have sorted out some Chinese OCR training and prediction tricks, which a
|
|||
|
||||
- In order to replace the backbone network of text recognition, we need to pay attention to the descending position of network width and height stride. Since the ratio between width and height is large in chinese text recognition, the frequency of height decrease is less and the frequency of width decrease is more. You can refer the [modifies of MobileNetV3](https://github.com/PaddlePaddle/PaddleOCR/blob/develop/ppocr/modeling/backbones/rec_mobilenet_v3.py) in PaddleOCR.
|
||||
|
||||
#### 2、Long Chinese Text Recognition
|
||||
#### 2. Long Chinese Text Recognition
|
||||
|
||||
- **Problem Description**
|
||||
The maximum resolution of Chinese recognition model during training is [3,32,320], if the text image to be recognized is too long, as shown in the figure below, how to adapt?
|
||||
|
@ -50,7 +50,7 @@ Here we have sorted out some Chinese OCR training and prediction tricks, which a
|
|||
return padding_im
|
||||
```
|
||||
|
||||
#### 3、Space Recognition
|
||||
#### 3. Space Recognition
|
||||
|
||||
- **Problem Description**
|
||||
|
||||
|
|
Loading…
Reference in New Issue