Merge pull request #6595 from nbswords/release/2.5
Fix OCR engine + SER command in vqa readmepull/6616/head
commit
4729304147
|
@ -192,7 +192,7 @@ Finally, `precision`, `recall`, `hmean` and other indicators will be printed
|
|||
Use the following command to complete the series prediction of `OCR engine + SER`, taking the pretrained SER model as an example:
|
||||
|
||||
```shell
|
||||
CUDA_VISIBLE_DEVICES=0 python3 tools/infer_vqa_token_ser.py -c configs/vqa/ser/layoutxlm.yml -o Architecture.Backbone.checkpoints=pretrain/ser_LayoutXLM_xfun_zh/Global.infer_img=doc/vqa/input/zh_val_42.jpg
|
||||
CUDA_VISIBLE_DEVICES=0 python3 tools/infer_vqa_token_ser.py -c configs/vqa/ser/layoutxlm.yml -o Architecture.Backbone.checkpoints=pretrain/ser_LayoutXLM_xfun_zh/ Global.infer_img=doc/vqa/input/zh_val_42.jpg
|
||||
````
|
||||
|
||||
Finally, the prediction result visualization image and the prediction result text file will be saved in the directory configured by the `config.Global.save_res_path` field. The prediction result text file is named `infer_results.txt`.
|
||||
|
|
Loading…
Reference in New Issue