mmpretrain/docs/tools/model_serving.md

# Model Serving

In order to serve an `MMClassification` model with [`TorchServe`](https://pytorch.org/serve/), you can follow the steps:

## 1. Convert model from MMClassification to TorchServe

```shell
python tools/deployment/mmcls2torchserve.py ${CONFIG_FILE} ${CHECKPOINT_FILE} \
--output-folder ${MODEL_STORE} \
--model-name ${MODEL_NAME}
```

```{note}
${MODEL_STORE} needs to be an absolute path to a folder.
```

Example:

```shell
python tools/deployment/mmcls2torchserve.py \
  configs/resnet/resnet18_b32x8_imagenet.py \
  checkpoints/resnet18_8xb32_in1k_20210831-fbbb1da6.pth \
  --output-folder ./checkpoints \
  --model-name resnet18_in1k
```

## 2. Build `mmcls-serve` docker image

```shell
docker build -t mmcls-serve:latest docker/serve/
```

## 3. Run `mmcls-serve`

Check the official docs for [running TorchServe with docker](https://github.com/pytorch/serve/blob/master/docker/README.md#running-torchserve-in-a-production-docker-environment).

In order to run in GPU, you need to install [nvidia-docker](https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/install-guide.html). You can omit the `--gpus` argument in order to run in GPU.

Example:

```shell
docker run --rm \
--cpus 8 \
--gpus device=0 \
-p8080:8080 -p8081:8081 -p8082:8082 \
--mount type=bind,source=`realpath ./checkpoints`,target=/home/model-server/model-store \
mmcls-serve:latest
```

```{note}
`realpath ./checkpoints` points to the absolute path of "./checkpoints", and you can replace it with the absolute path where you store torchserve models.
```

[Read the docs](https://github.com/pytorch/serve/blob/master/docs/rest_api.md) about the Inference (8080), Management (8081) and Metrics (8082) APis

## 4. Test deployment

```shell
curl http://127.0.0.1:8080/predictions/${MODEL_NAME} -T demo/demo.JPEG
```

You should obtain a response similar to:

```json
{
  "pred_label": 58,
  "pred_score": 0.38102269172668457,
  "pred_class": "water snake"
}
```

And you can use `test_torchserver.py` to compare result of TorchServe and PyTorch, and visualize them.

```shell
python tools/deployment/test_torchserver.py ${IMAGE_FILE} ${CONFIG_FILE} ${CHECKPOINT_FILE} ${MODEL_NAME}
[--inference-addr ${INFERENCE_ADDR}] [--device ${DEVICE}]
```

Example:

```shell
python tools/deployment/test_torchserver.py \
  demo/demo.JPEG \
  configs/resnet/resnet18_b32x8_imagenet.py \
  checkpoints/resnet18_8xb32_in1k_20210831-fbbb1da6.pth \
  resnet18_in1k
```
Add mmcls2torchserve (#292) * Add mmcls2torchserve * Update docs 2021-06-12 21:45:45 +08:00			`# Model Serving`

			In order to serve an `MMClassification` model with [`TorchServe`](https://pytorch.org/serve/), you can follow the steps:

			`## 1. Convert model from MMClassification to TorchServe`

			```shell
			`python tools/deployment/mmcls2torchserve.py ${CONFIG_FILE} ${CHECKPOINT_FILE} \`
			`--output-folder ${MODEL_STORE} \`
			`--model-name ${MODEL_NAME}`
			```

[Tool] Add a tool to test TorchServe. (#468) * Add `title` option in `show_result_pyplot`. * Add test_torchserver.py * Add docs about test torchserve * Update docs and result output. * Update chinese docs. 2021-10-14 17:56:32 +08:00			```{note}
			`${MODEL_STORE} needs to be an absolute path to a folder.`
			```

			`Example:`

			```shell
			`python tools/deployment/mmcls2torchserve.py \`
			`configs/resnet/resnet18_b32x8_imagenet.py \`
			`checkpoints/resnet18_8xb32_in1k_20210831-fbbb1da6.pth \`
			`--output-folder ./checkpoints \`
			`--model-name resnet18_in1k`
			```
Add mmcls2torchserve (#292) * Add mmcls2torchserve * Update docs 2021-06-12 21:45:45 +08:00
			## 2. Build `mmcls-serve` docker image

			```shell
			`docker build -t mmcls-serve:latest docker/serve/`
			```

			## 3. Run `mmcls-serve`

			`Check the official docs for [running TorchServe with docker](https://github.com/pytorch/serve/blob/master/docker/README.md#running-torchserve-in-a-production-docker-environment).`

[Docs] rearrange docs and add multiple translation docs. (#320) * Move tools docs to `tools` folder. * Fix link error in model_serving.md * Fix typo in CONTRIBUTING.md * Add Chinese translation of CONTRIBUTING.md * Add translation of `onnx2tensorrt.md`, `pytorch2onnx.md`, `model_serving.md` and `pytorch2torchscript.md`. * Improve tools docs. * Add docs about installing mmcls via mim. 2021-06-30 20:53:09 +08:00			In order to run in GPU, you need to install [nvidia-docker](https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/install-guide.html). You can omit the `--gpus` argument in order to run in GPU.
Add mmcls2torchserve (#292) * Add mmcls2torchserve * Update docs 2021-06-12 21:45:45 +08:00
			`Example:`

			```shell
			`docker run --rm \`
			`--cpus 8 \`
			`--gpus device=0 \`
			`-p8080:8080 -p8081:8081 -p8082:8082 \`
[Tool] Add a tool to test TorchServe. (#468) * Add `title` option in `show_result_pyplot`. * Add test_torchserver.py * Add docs about test torchserve * Update docs and result output. * Update chinese docs. 2021-10-14 17:56:32 +08:00			--mount type=bind,source=`realpath ./checkpoints`,target=/home/model-server/model-store \
Add mmcls2torchserve (#292) * Add mmcls2torchserve * Update docs 2021-06-12 21:45:45 +08:00			`mmcls-serve:latest`
			```

[Tool] Add a tool to test TorchServe. (#468) * Add `title` option in `show_result_pyplot`. * Add test_torchserver.py * Add docs about test torchserve * Update docs and result output. * Update chinese docs. 2021-10-14 17:56:32 +08:00			```{note}
			`realpath ./checkpoints` points to the absolute path of "./checkpoints", and you can replace it with the absolute path where you store torchserve models.
			```

[Docs] rearrange docs and add multiple translation docs. (#320) * Move tools docs to `tools` folder. * Fix link error in model_serving.md * Fix typo in CONTRIBUTING.md * Add Chinese translation of CONTRIBUTING.md * Add translation of `onnx2tensorrt.md`, `pytorch2onnx.md`, `model_serving.md` and `pytorch2torchscript.md`. * Improve tools docs. * Add docs about installing mmcls via mim. 2021-06-30 20:53:09 +08:00			`[Read the docs](https://github.com/pytorch/serve/blob/master/docs/rest_api.md) about the Inference (8080), Management (8081) and Metrics (8082) APis`
Add mmcls2torchserve (#292) * Add mmcls2torchserve * Update docs 2021-06-12 21:45:45 +08:00
			`## 4. Test deployment`

			```shell
[Tool] Add a tool to test TorchServe. (#468) * Add `title` option in `show_result_pyplot`. * Add test_torchserver.py * Add docs about test torchserve * Update docs and result output. * Update chinese docs. 2021-10-14 17:56:32 +08:00			`curl http://127.0.0.1:8080/predictions/${MODEL_NAME} -T demo/demo.JPEG`
Add mmcls2torchserve (#292) * Add mmcls2torchserve * Update docs 2021-06-12 21:45:45 +08:00			```

[Docs] Add code-spell pre-commit hook and fix a large mount of typos. (#470) * Add code spell check hook * Add codespell config * Fix a lot of typos. * Add formating.py to keep compatibility. 2021-10-13 14:33:07 +08:00			`You should obtain a response similar to:`
Add mmcls2torchserve (#292) * Add mmcls2torchserve * Update docs 2021-06-12 21:45:45 +08:00
			```json
			`{`
[Tool] Add a tool to test TorchServe. (#468) * Add `title` option in `show_result_pyplot`. * Add test_torchserver.py * Add docs about test torchserve * Update docs and result output. * Update chinese docs. 2021-10-14 17:56:32 +08:00			`"pred_label": 58,`
			`"pred_score": 0.38102269172668457,`
			`"pred_class": "water snake"`
Add mmcls2torchserve (#292) * Add mmcls2torchserve * Update docs 2021-06-12 21:45:45 +08:00			`}`
			```
[Tool] Add a tool to test TorchServe. (#468) * Add `title` option in `show_result_pyplot`. * Add test_torchserver.py * Add docs about test torchserve * Update docs and result output. * Update chinese docs. 2021-10-14 17:56:32 +08:00
			And you can use `test_torchserver.py` to compare result of TorchServe and PyTorch, and visualize them.

			```shell
			`python tools/deployment/test_torchserver.py ${IMAGE_FILE} ${CONFIG_FILE} ${CHECKPOINT_FILE} ${MODEL_NAME}`
			`[--inference-addr ${INFERENCE_ADDR}] [--device ${DEVICE}]`
			```

			`Example:`

			```shell
			`python tools/deployment/test_torchserver.py \`
			`demo/demo.JPEG \`
			`configs/resnet/resnet18_b32x8_imagenet.py \`
			`checkpoints/resnet18_8xb32_in1k_20210831-fbbb1da6.pth \`
			`resnet18_in1k`
			```