ONNX Runtime Deployment

DeprecationWarning

ONNX support will be deprecated in the future. Welcome to use the unified model deployment toolbox MMDeploy: https://github.com/open-mmlab/mmdeploy

Introduction of ONNX Runtime

ONNX Runtime is a cross-platform inferencing and training accelerator compatible with many popular ML/DNN frameworks. Check its github for more information.

Introduction of ONNX

ONNX stands for Open Neural Network Exchange, which acts as Intermediate Representation(IR) for ML/DNN models from many frameworks. Check its github for more information.

Why include custom operators for ONNX Runtime in MMCV

To verify the correctness of exported ONNX models in ONNX Runtime.
To ease the deployment of ONNX models with custom operators from mmcv.ops in ONNX Runtime.

List of operators for ONNX Runtime supported in MMCV

Operator	CPU	GPU	MMCV Releases
SoftNMS	Y	N	1.2.3
RoIAlign	Y	N	1.2.5
NMS	Y	N	1.2.7
grid_sampler	Y	N	1.3.1
CornerPool	Y	N	1.3.4
cummax	Y	N	1.3.4
cummin	Y	N	1.3.4

How to build custom operators for ONNX Runtime

Please be noted that only onnxruntime>=1.8.1 of CPU version on Linux platform is tested by now.

Prerequisite

Clone repository

git clone https://github.com/open-mmlab/mmcv.git

Download onnxruntime-linux from ONNX Runtime releases, extract it, expose ONNXRUNTIME_DIR and finally add the lib path to LD_LIBRARY_PATH as below:

wget https://github.com/microsoft/onnxruntime/releases/download/v1.8.1/onnxruntime-linux-x64-1.8.1.tgz

tar -zxvf onnxruntime-linux-x64-1.8.1.tgz
cd onnxruntime-linux-x64-1.8.1
export ONNXRUNTIME_DIR=$(pwd)
export LD_LIBRARY_PATH=$ONNXRUNTIME_DIR/lib:$LD_LIBRARY_PATH

Build on Linux

cd mmcv ## to MMCV root directory
MMCV_WITH_OPS=1 MMCV_WITH_ORT=1 python setup.py develop

How to do inference using exported ONNX models with custom operators in ONNX Runtime in python

Install ONNX Runtime with pip

pip install onnxruntime==1.8.1

Inference Demo

import os

import numpy as np
import onnxruntime as ort

from mmcv.ops import get_onnxruntime_op_path

ort_custom_op_path = get_onnxruntime_op_path()
assert os.path.exists(ort_custom_op_path)
session_options = ort.SessionOptions()
session_options.register_custom_ops_library(ort_custom_op_path)
## exported ONNX model with custom operators
onnx_file = 'sample.onnx'
input_data = np.random.randn(1, 3, 224, 224).astype(np.float32)
sess = ort.InferenceSession(onnx_file, session_options)
onnx_results = sess.run(None, {'input' : input_data})

How to add a new custom operator for ONNX Runtime in MMCV

Reminder

Please note that this feature is experimental and may change in the future. Strongly suggest users always try with the latest master branch.
The custom operator is not included in supported operator list in ONNX Runtime.
The custom operator should be able to be exported to ONNX.

Main procedures

Take custom operator soft_nms for example.

Add header soft_nms.h to ONNX Runtime include directory mmcv/ops/csrc/onnxruntime/
Add source soft_nms.cpp to ONNX Runtime source directory mmcv/ops/csrc/onnxruntime/cpu/

Register soft_nms operator in onnxruntime_register.cpp

#include "soft_nms.h"

SoftNmsOp c_SoftNmsOp;

if (auto status = ortApi->CustomOpDomain_Add(domain, &c_SoftNmsOp)) {
return status;
}

Add unit test into tests/test_ops/test_onnx.py Check here for examples.

Finally, welcome to send us PR of adding custom operators for ONNX Runtime in MMCV. 🤓

Known Issues

"RuntimeError: tuple appears in op that does not forward tuples, unsupported kind: prim::PythonOp."
1. Note generally cummax or cummin is exportable to ONNX as long as the torch version >= 1.5.0, since torch.cummax is only supported with torch >= 1.5.0. But when cummax or cummin serves as an intermediate component whose outputs is used as inputs for another modules, it's expected that torch version must be >= 1.7.0. Otherwise the above error might arise, when running exported ONNX model with onnxruntime.
2. Solution: update the torch version to 1.7.0 or higher.

References

How to export Pytorch model with custom op to ONNX and run it in ONNX Runtime
How to add a custom operator/kernel in ONNX Runtime

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

onnxruntime_op.md

onnxruntime_op.md

ONNX Runtime Deployment

DeprecationWarning

Introduction of ONNX Runtime

Introduction of ONNX

Why include custom operators for ONNX Runtime in MMCV

List of operators for ONNX Runtime supported in MMCV

How to build custom operators for ONNX Runtime

Prerequisite

Build on Linux

How to do inference using exported ONNX models with custom operators in ONNX Runtime in python

How to add a new custom operator for ONNX Runtime in MMCV

Reminder

Main procedures

Known Issues

References

Files

onnxruntime_op.md

Latest commit

History

onnxruntime_op.md

File metadata and controls

ONNX Runtime Deployment

DeprecationWarning

Introduction of ONNX Runtime

Introduction of ONNX

Why include custom operators for ONNX Runtime in MMCV

List of operators for ONNX Runtime supported in MMCV

How to build custom operators for ONNX Runtime

Prerequisite

Build on Linux

How to do inference using exported ONNX models with custom operators in ONNX Runtime in python

How to add a new custom operator for ONNX Runtime in MMCV

Reminder

Main procedures

Known Issues

References