History

liaoxingyu 07872d4cdb fix caffe export bug Summary: put tensor to gpu device		2020-08-20 16:28:52 +08:00
..
Caffe	update pytorchtocaffe tool	2020-06-09 14:45:27 +08:00
test_data	onnx/trt support	2020-07-29 17:43:39 +08:00
README.md	update deploy docs	2020-07-29 17:47:35 +08:00
caffe_export.py	fix caffe export bug	2020-08-20 16:28:52 +08:00
caffe_inference.py	onnx/trt support	2020-07-29 17:43:39 +08:00
onnx_export.py	onnx/trt support	2020-07-29 17:43:39 +08:00
onnx_inference.py	onnx/trt support	2020-07-29 17:43:39 +08:00
pytorch_to_caffe.py	update pytorchtocaffe tool	2020-06-09 14:45:27 +08:00
run_export.sh	update config file	2020-07-29 19:48:49 +08:00
trt_export.py	onnx/trt support	2020-07-29 17:43:39 +08:00
trt_inference.py	onnx/trt support	2020-07-29 17:43:39 +08:00

README.md

Model Deployment

This directory contains:

The scripts that convert a fastreid model to Caffe/ONNX/TRT format.
The exmpales that load a R50 baseline model in Caffe/ONNX/TRT and run inference.

Tutorial

Caffe Convert

step-to-step pipeline for caffe convert

This is a tiny example for converting fastreid-baseline in meta_arch to Caffe model, if you want to convert more complex architecture, you need to customize more things.

Run caffe_export.py to get the converted Caffe model,

python caffe_export.py --config-file root-path/market1501/bagtricks_R50/config.yml --name "baseline_R50" --output outputs/caffe_model --opts MODEL.WEIGHTS root-path/logs/market1501/bagtricks_R50/model_final.pth

then you can check the Caffe model and prototxt in outputs/caffe_model.

Change prototxt following next three steps:

Edit max_pooling in baseline_R50.prototxt like this

layer {
    name: "max_pool1"
    type: "Pooling"
    bottom: "relu_blob1"
    top: "max_pool_blob1"
    pooling_param {
        pool: MAX
        kernel_size: 3
        stride: 2
        pad: 0 # 1
        # ceil_mode: false
    }
}

Add avg_pooling right place in baseline_R50.prototxt

layer {
    name: "avgpool1"
    type: "Pooling"
    bottom: "relu_blob49"
    top: "avgpool_blob1"
    pooling_param {
        pool: AVE
        global_pooling: true
    }
}

Change the last layer top name to output

layer {
    name: "bn_scale54"
    type: "Scale"
    bottom: "batch_norm_blob54"
    top: "output" # bn_norm_blob54
    scale_param {
        bias_term: true
    }
}

(optional) You can open Netscope, then enter you network prototxt to visualize the network.

Run caffe_inference.py to save Caffe model features with input images

 python caffe_inference.py --model-def outputs/caffe_model/baseline_R50.prototxt \
 --model-weights outputs/caffe_model/baseline_R50.caffemodel \
 --input test_data/*.jpg --output caffe_output

Run demo/demo.py to get fastreid model features with the same input images, then verify that Caffe and PyTorch are computing the same value for the network.
```
np.testing.assert_allclose(torch_out, ort_out, rtol=1e-3, atol=1e-6)
```

ONNX Convert

step-to-step pipeline for onnx convert

This is a tiny example for converting fastreid-baseline in meta_arch to ONNX model. ONNX supports most operators in pytorch as far as I know and if some operators are not supported by ONNX, you need to customize these.

Run onnx_export.py to get the converted ONNX model,

python onnx_export.py --config-file root-path/bagtricks_R50/config.yml --name "baseline_R50" --output outputs/onnx_model --opts MODEL.WEIGHTS root-path/logs/market1501/bagtricks_R50/model_final.pth

then you can check the ONNX model in outputs/onnx_model.

(optional) You can use Netron to visualize the network.

Run onnx_inference.py to save ONNX model features with input images

 python onnx_inference.py --model-path outputs/onnx_model/baseline_R50.onnx \
 --input test_data/*.jpg --output onnx_output

Run demo/demo.py to get fastreid model features with the same input images, then verify that ONNX Runtime and PyTorch are computing the same value for the network.
```
np.testing.assert_allclose(torch_out, ort_out, rtol=1e-3, atol=1e-6)
```

TensorRT Convert

step-to-step pipeline for trt convert

This is a tiny example for converting fastreid-baseline in meta_arch to TRT model. We use tiny-tensorrt, which is a simple and easy-to-use nvidia TensorRT warpper, to get the model converted to tensorRT.

First you need to convert the pytorch model to ONNX format following ONNX Convert, and you need to remember your output name. Then you can convert ONNX model to TensorRT following instructions below.

Run command line below to get the converted TRT model from ONNX model,


python trt_export.py --name "baseline_R50" --output outputs/trt_model --onnx-model outputs/onnx_model/baseline.onnx --heighi 256 --width 128

then you can check the TRT model in outputs/trt_model.

Run trt_inference.py to save TRT model features with input images

 python onnx_inference.py --model-path outputs/trt_model/baseline.engine \
 --input test_data/*.jpg --output trt_output --output-name trt_model_outputname

Run demo/demo.py to get fastreid model features with the same input images, then verify that TensorRT and PyTorch are computing the same value for the network.
```
np.testing.assert_allclose(torch_out, ort_out, rtol=1e-3, atol=1e-6)
```

Acknowledgements

Thank to CPFLAME, gcong18, YuxiangJohn and wiggin66 at JDAI Model Acceleration Group for help in PyTorch model converting.