[Docs] TorchServe docs (#539)

* Add torchserve docs * polish contents and fix the visiualization bug in torchserve testing * polish content
2021-10-22 16:49:28 +08:00 · 2021-10-22 16:49:28 +08:00 · 027cb7aac5
parent 5671939bbc
commit 027cb7aac5
5 changed files with 364 additions and 0 deletions
--- a/docs/index.rst
+++ b/docs/index.rst
@ -13,6 +13,7 @@ You can switch between English and Chinese in the lower-left corner of the layou
   training.md
   testing.md
   deployment.md
+   model_serving.md

 .. toctree::
   :maxdepth: 2
--- a/docs/model_serving.md
+++ b/docs/model_serving.md
@ -0,0 +1,180 @@
+# Model Serving
+
+`MMOCR` provides some utilities that facilitate the model serving process.
+Here is a quick walkthrough of necessary steps that let the models to serve through an API.
+
+## Install TorchServe
+
+You can follow the steps on the [official website](https://github.com/pytorch/serve#install-torchserve-and-torch-model-archiver) to install `TorchServe` and
+`torch-model-archiver`.
+
+## Convert model from MMOCR to TorchServe
+
+We provide a handy tool to convert any `.pth` model into `.mar` model
+for TorchServe.
+
+```shell
+python tools/deployment/mmocr2torchserve.py ${CONFIG_FILE} ${CHECKPOINT_FILE} \
+--output-folder ${MODEL_STORE} \
+--model-name ${MODEL_NAME}
+```
+
+:::{note}
+${MODEL_STORE} needs to be an absolute path to a folder.
+:::
+
+For example:
+
+```shell
+python tools/deployment/mmocr2torchserve.py \
+  configs/textdet/dbnet/dbnet_r18_fpnc_1200e_icdar2015.py \
+  checkpoints/dbnet_r18_fpnc_1200e_icdar2015.pth \
+  --output-folder ./checkpoints \
+  --model-name dbnet
+```
+
+## Start Serving
+
+### From your Local Machine
+
+Getting your models prepared, the next step is to start the service with a one-line command:
+
+```bash
+# To load all the models in ./checkpoints
+torchserve --start --model-store ./checkpoints --models all
+# Or, if you only want one model to serve, say dbnet
+torchserve --start --model-store ./checkpoints --models dbnet=dbnet.mar
+```
+
+Then you can access inference, management and metrics services
+through TorchServe's REST API.
+You can find their usages in [TorchServe REST API](https://github.com/pytorch/serve/blob/master/docs/rest_api.md).
+
+| Service           |  Address                                                            |
+| ------------------- | ----------------------- |
+| Inference | `http://127.0.0.1:8080` |
+| Management | `http://127.0.0.1:8081` |
+| Metrics | `http://127.0.0.1:8082` |
+
+:::{note}
+By default, TorchServe binds port number `8080`, `8081` and `8082` to its services.
+You can change such behavior by modifying and saving the contents below to `config.properties`, and running TorchServe with option `--ts-config config.preperties`.
+
+```bash
+inference_address=http://0.0.0.0:8080
+management_address=http://0.0.0.0:8081
+metrics_address=http://0.0.0.0:8082
+number_of_netty_threads=32
+job_queue_size=1000
+model_store=/home/model-server/model-store
+```
+
+:::
+
+
+### From Docker
+
+A better alternative to serve your models is through Docker. We provide a Dockerfile
+that frees you from those tedious and error-prone environmental setup steps.
+
+#### Build `mmocr-serve` Docker image
+
+```shell
+docker build -t mmocr-serve:latest docker/serve/
+```
+
+#### Run `mmocr-serve` with Docker
+
+In order to run Docker in GPU, you need to install [nvidia-docker](https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/install-guide.html); or you can omit the `--gpus` argument for a CPU-only session.
+
+The command below will run `mmocr-serve` with a gpu, bind the ports of `8080` (inference),
+`8081` (management) and `8082` (metrics) from container to `127.0.0.1`, and mount
+the checkpoint folder `./checkpoints` from the host machine to `/home/model-server/model-store`
+of the container. For more information, please check the official docs for [running TorchServe with docker](https://github.com/pytorch/serve/blob/master/docker/README.md#running-torchserve-in-a-production-docker-environment).
+
+```shell
+docker run --rm \
+--cpus 8 \
+--gpus device=0 \
+-p8080:8080 -p8081:8081 -p8082:8082 \
+--mount type=bind,source=`realpath ./checkpoints`,target=/home/model-server/model-store \
+mmocr-serve:latest
+```
+
+:::{note}
+`realpath ./checkpoints` points to the absolute path of "./checkpoints", and you can replace it with the absolute path where you store torchserve models.
+:::
+
+Upon running the docker, you can access inference, management and metrics services
+through TorchServe's REST API.
+You can find their usages in [TorchServe REST API](https://github.com/pytorch/serve/blob/master/docs/rest_api.md).
+
+| Service           |  Address                                                            |
+| ------------------- | ----------------------- |
+| Inference | `http://127.0.0.1:8080` |
+| Management | `http://127.0.0.1:8081` |
+| Metrics | `http://127.0.0.1:8082` |
+
+
+
+## 4. Test deployment
+
+Inference API allows user to post an image to a model and returns the prediction result.
+
+```shell
+curl http://127.0.0.1:8080/predictions/${MODEL_NAME} -T demo/demo_text_det.jpg
+```
+
+For example,
+
+```shell
+curl http://127.0.0.1:8080/predictions/dbnet -T demo/demo_text_det.jpg
+```
+
+For detection models, you should obtain a json with an object named `boundary_result`. Each array inside has float numbers representing x, y
+coordinates of boundary vertices in clockwise order, and the last float number as the
+confidence score.
+
+```json
+{
+  "boundary_result": [
+    [
+      221.18990004062653,
+      226.875,
+      221.18990004062653,
+      212.625,
+      244.05868631601334,
+      212.625,
+      244.05868631601334,
+      226.875,
+      0.80883354575186
+    ]
+  ]
+}
+```
+
+For recognition models, the response should look like:
+
+```json
+{
+  "text": "sier",
+  "score": 0.5247521847486496
+}
+```
+
+And you can use `test_torchserve.py` to compare result of TorchServe and PyTorch by visualizing them.
+
+```shell
+python tools/deployment/test_torchserve.py ${IMAGE_FILE} ${CONFIG_FILE} ${CHECKPOINT_FILE} ${MODEL_NAME}
+[--inference-addr ${INFERENCE_ADDR}] [--device ${DEVICE}]
+```
+
+Example:
+
+```shell
+python tools/deployment/test_torchserve.py \
+  demo/demo_text_det.jpg \
+  configs/textdet/dbnet/dbnet_r18_fpnc_1200e_icdar2015.py \
+  checkpoints/dbnet_r18_fpnc_1200e_icdar2015.pth \
+  dbnet
+```
--- a/docs_zh_CN/index.rst
+++ b/docs_zh_CN/index.rst
@ -13,6 +13,7 @@
   training.md
   testing.md
   deployment.md
+   model_serving.md

 .. toctree::
   :maxdepth: 2
--- a/docs_zh_CN/model_serving.md
+++ b/docs_zh_CN/model_serving.md
@ -0,0 +1,180 @@
+# Model Serving
+
+`MMOCR` provides some utilities that facilitate the model serving process.
+Here is a quick walkthrough of necessary steps that let the models to serve through an API.
+
+## Install TorchServe
+
+You can follow the steps on the [official website](https://github.com/pytorch/serve#install-torchserve-and-torch-model-archiver) to install `TorchServe` and
+`torch-model-archiver`.
+
+## Convert model from MMOCR to TorchServe
+
+We provide a handy tool to convert any `.pth` model into `.mar` model
+for TorchServe.
+
+```shell
+python tools/deployment/mmocr2torchserve.py ${CONFIG_FILE} ${CHECKPOINT_FILE} \
+--output-folder ${MODEL_STORE} \
+--model-name ${MODEL_NAME}
+```
+
+:::{note}
+${MODEL_STORE} needs to be an absolute path to a folder.
+:::
+
+For example:
+
+```shell
+python tools/deployment/mmocr2torchserve.py \
+  configs/textdet/dbnet/dbnet_r18_fpnc_1200e_icdar2015.py \
+  checkpoints/dbnet_r18_fpnc_1200e_icdar2015.pth \
+  --output-folder ./checkpoints \
+  --model-name dbnet
+```
+
+## Start Serving
+
+### From your Local Machine
+
+Getting your models prepared, the next step is to start the service with a one-line command:
+
+```bash
+# To load all the models in ./checkpoints
+torchserve --start --model-store ./checkpoints --models all
+# Or, if you only want one model to serve, say dbnet
+torchserve --start --model-store ./checkpoints --models dbnet=dbnet.mar
+```
+
+Then you can access inference, management and metrics services
+through TorchServe's REST API.
+You can find their usages in [TorchServe REST API](https://github.com/pytorch/serve/blob/master/docs/rest_api.md).
+
+| Service           |  Address                                                            |
+| ------------------- | ----------------------- |
+| Inference | `http://127.0.0.1:8080` |
+| Management | `http://127.0.0.1:8081` |
+| Metrics | `http://127.0.0.1:8082` |
+
+:::{note}
+By default, TorchServe binds port number `8080`, `8081` and `8082` to its services.
+You can change such behavior by modifying and saving the contents below to `config.properties`, and running TorchServe with option `--ts-config config.preperties`.
+
+```bash
+inference_address=http://0.0.0.0:8080
+management_address=http://0.0.0.0:8081
+metrics_address=http://0.0.0.0:8082
+number_of_netty_threads=32
+job_queue_size=1000
+model_store=/home/model-server/model-store
+```
+
+:::
+
+
+### From Docker
+
+A better alternative to serve your models is through Docker. We provide a Dockerfile
+that frees you from those tedious and error-prone environmental setup steps.
+
+#### Build `mmocr-serve` Docker image
+
+```shell
+docker build -t mmocr-serve:latest docker/serve/
+```
+
+#### Run `mmocr-serve` with Docker
+
+In order to run Docker in GPU, you need to install [nvidia-docker](https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/install-guide.html); or you can omit the `--gpus` argument for a CPU-only session.
+
+The command below will run `mmocr-serve` with a gpu, bind the ports of `8080` (inference),
+`8081` (management) and `8082` (metrics) from container to `127.0.0.1`, and mount
+the checkpoint folder `./checkpoints` from the host machine to `/home/model-server/model-store`
+of the container. For more information, please check the official docs for [running TorchServe with docker](https://github.com/pytorch/serve/blob/master/docker/README.md#running-torchserve-in-a-production-docker-environment).
+
+```shell
+docker run --rm \
+--cpus 8 \
+--gpus device=0 \
+-p8080:8080 -p8081:8081 -p8082:8082 \
+--mount type=bind,source=`realpath ./checkpoints`,target=/home/model-server/model-store \
+mmocr-serve:latest
+```
+
+:::{note}
+`realpath ./checkpoints` points to the absolute path of "./checkpoints", and you can replace it with the absolute path where you store torchserve models.
+:::
+
+Upon running the docker, you can access inference, management and metrics services
+through TorchServe's REST API.
+You can find their usages in [TorchServe REST API](https://github.com/pytorch/serve/blob/master/docs/rest_api.md).
+
+| Service           |  Address                                                            |
+| ------------------- | ----------------------- |
+| Inference | `http://127.0.0.1:8080` |
+| Management | `http://127.0.0.1:8081` |
+| Metrics | `http://127.0.0.1:8082` |
+
+
+
+## 4. Test deployment
+
+Inference API allows user to post an image to a model and returns the prediction result.
+
+```shell
+curl http://127.0.0.1:8080/predictions/${MODEL_NAME} -T demo/demo_text_det.jpg
+```
+
+For example,
+
+```shell
+curl http://127.0.0.1:8080/predictions/dbnet -T demo/demo_text_det.jpg
+```
+
+For detection models, you should obtain a json with an object named `boundary_result`. Each array inside has float numbers representing x, y
+coordinates of boundary vertices in clockwise order, and the last float number as the
+confidence score.
+
+```json
+{
+  "boundary_result": [
+    [
+      221.18990004062653,
+      226.875,
+      221.18990004062653,
+      212.625,
+      244.05868631601334,
+      212.625,
+      244.05868631601334,
+      226.875,
+      0.80883354575186
+    ]
+  ]
+}
+```
+
+For recognition models, the response should look like:
+
+```json
+{
+  "text": "sier",
+  "score": 0.5247521847486496
+}
+```
+
+And you can use `test_torchserve.py` to compare result of TorchServe and PyTorch by visualizing them.
+
+```shell
+python tools/deployment/test_torchserve.py ${IMAGE_FILE} ${CONFIG_FILE} ${CHECKPOINT_FILE} ${MODEL_NAME}
+[--inference-addr ${INFERENCE_ADDR}] [--device ${DEVICE}]
+```
+
+Example:
+
+```shell
+python tools/deployment/test_torchserve.py \
+  demo/demo_text_det.jpg \
+  configs/textdet/dbnet/dbnet_r18_fpnc_1200e_icdar2015.py \
+  checkpoints/dbnet_r18_fpnc_1200e_icdar2015.pth \
+  dbnet
+```
--- a/tools/deployment/test_torchserve.py
+++ b/tools/deployment/test_torchserve.py
@ -34,6 +34,7 @@ def main(args):
        args.img,
        model_results,
        win_name='model_results',
+        show=True,
        score_thr=args.score_thr)
    url = 'http://' + args.inference_addr + '/predictions/' + args.model_name
    with open(args.img, 'rb') as image:
@ -42,6 +43,7 @@ def main(args):
    model.show_result(
        args.img,
        serve_results,
+        show=True,
        win_name='serve_results',
        score_thr=args.score_thr)
    assert serve_results.keys() == model_results.keys()