Mmdeploy

Latest version: v1.3.1

Safety actively analyzes 714860 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 3 of 4

0.11.0

Features

- Support MMaction2 `TSN` and `SlowFast` deployment with ONNXRuntime and TensorRT (1183,1410,1455)
- Support Rockchip device `RV1126`
- Rewrite `BaseDenseHead.get_bboxes` to support SSD, FSAF and RetinaNet (1203)
- Add BaseDenseHead postprocessing in SDK (1238)
- Support YOLOv3 and YOLOv5 postprocessing in SDK (1280,1424)
- Add SDK profiler (1274)
- Support end2end deployment for pointpillars & centerpoint(pillar)from MMDet3d (1178)

Improvements

- Support loading TensorRT libnvinfer plugins (1275)
- Avoid copying dense arrays in SDK C API and Python API (1261, 1349)
- Add Core ML common configuration (1308)
- Refactor SDK registry (1368)
- Update regresssion test to serialize eval result into json (1310)
- Support onnxruntime-1.13 API(1407)
- Decouple preprocess operation and transformation (1353)

Bug fixes

- Set stream argument when using async memcpy (1314)
- Use OpenCV with `videoio` enabled for aarch64 platform (1343)
- Fix(tools/scripts): find env file failed (1385)
- Fix ncnn-int8 config path (1380)
- Fix out-of-boundary issue in SDK when `topk` is larger than `class_num` (1420)
- Fix yolohead trt8.2 (1433)
- Fix `pad_to_square` (1436)
- Fix `det_pose` demo (1419)
- Relax module adapter template constraints (1366)
- Fix ncnn torch 1.12 master (1430)
- Avoid gpu topk const-fold (1439)
- Support .NET Framwork 4.8 and fix batch inference error(1370)
- Upgrade ncnn to `20221128` to resolve build error (1459)

Document

- Add more images for demos and user guides (1339)
- Improve mmdet3d doc (1394)
- Display CI results in README (1452)
- Fix dead links in `write_config.md` (1396)

Contributors

xin-li-67 sunjiahao1999 francis0407 Typiqally triple-Mu lvhan028 grimoire AllentDan RunningLeon lzhangzz tpoisonooo hanrui1sensetime

0.10.0

C
typedef struct mmdeploy_mat_t {
uint8_t* data;
int height;
int width;
int channel;
mmdeploy_pixel_format_t format;
mmdeploy_data_type_t type;
mmdeploy_device_t device;
} mmdeploy_mat_t;


Bug fixes
- Fix `test_windows_onnxruntime` workflow error in circleci (1254)
- Fix build error when the target device is 'cuda' and the inference backend is 'onnxruntime-gpu' (1253)
- Fix `layer_norm` symbol error when exporting it with torch>=1.12 (1168)
- Fix regression test script errors (1217, 1146)

Document
- Update supported backend logos in the cover of README (1252)
- Add a link to MMYOLO in README (1235)

Contributors
doufengqi Qingrenn liu-mengyang SsTtOoNnEe OldDreamInWind sunjiahao1999 LiuYi-Up isLinXu lansfair lvhan028 grimoire AllentDan RunningLeon lzhangzz tpoisonooo hanrui1sensetime

0.9.0

Features

- Add Rust API for mmdeploy SDK. Project: https://github.com/liu-mengyang/rust-mmdeploy
- Support MMOCR TextSnake and MMPose Hourglass model deployment with ncnn-int8 (1074, 1064, 1066)
- Rewrite `torch.Tensor.__mod__` to support TensorRT (1024)

Improvements
- Separate C++ API demos from C API demos (1099)
- Refactor SDK pipeline (938)
- Check upstream libopencv-dev version before adding apt repository (1068)
- Make inference still available on headless device (1041)
- Validate installation in building scripts (1036)

Bug fixes

- Set `size_divisor` of `Pad` transform to `1` for static shape model. (1049)
- Fix `LayerNorm` shape issue when exporting to onnx with `torch <= 1.12` (1015)
- Fix calibration error when converting model to TensorRT-int8 (1050)
- Synchronize cuda stream after inference with onnxruntime-gpu (1053)
- Add `GatherTopk` TensorRT plugin as a workaround to fix dynamic shape issue (1033)
- Fix `RoiAlignFunction` error for CoreML (1029)
- Resolve two-stage detector deployment error with CoreML (1044)
- Fix two-stage detector TensorRT deployment error with dynamic shape (1046)

Document

- Update supported backends table in README (1109)
- Correct examples in tutorial - `how to develop TensorRT plugin` (1021)
- Fix broken links and typos (1078, 1025, 1061)

Contributors
liu-mengyang BrokenArrow1404 jinwonkim93 Qingrenn JingweiZhang12 ichitaka Typiqally lvhan028 irexyc tpoisonooo lzhangzz grimoire AllentDan hanrui1sensetime

0.8.0

Highlight
- Support more platforms and devices: `RISC-V`, `Apple M1`, `Huawei Ascend310` and `Rockchip RK3588`

Features
- Support more models on ONNX Runtime and TensorRT
- mmdetection `DETR` (924)
- mmclassification `Swin Transformer` (911)
- mmdetection3d `pointpillars (nus version)` (319)
- Support more platforms and devices:
- `RISC-V` via `ncnn` (910)
- `Apple M1` (760)
- `Huawei Ascend310` (747)
- `Rockchip RK3588` (865)
- Add `TorchScript` SDK inference backend (890)
- Experimental support for fusing transformations in preprocess pipeline by [CVFusion](https://github.com/OpenComputeLab/CVFusion) (#741)

Improvements

- Support multi-label classification in SDK (950)
- Add the following scripts to simplify mmdeploy installation for some scenarios: (919)

<div class="okr-block-clipboard" data-okr="%7B%22okrDelta%22%3A%5B%7B%22lineType%22%3A%22unsupport%22%2C%22lineOptions%22%3A%7B%7D%2C%22lineContent%22%3A%5B%5D%7D%5D%2C%22businessKey%22%3A%22lark-doc%22%7D"></div><div data-zone-id="0" data-line-index="0" style="white-space: pre;">

script | OS version
-- | --
build_ubuntu_x64_ncnn.py | 18.04/20.04
build_ubuntu_x64_ort.py | 18.04/20.04
build_ubuntu_x64_pplnn.py | 18.04/20.04
build_ubuntu_x64_torchscript.py | 18.04/20.04

</div>

- Add scaled dot-product attention operator for TensorRT (949)
- Support model batch inference profiling (868)
shell
profile the latency of resnet18-tensorrt model with batch size 4
python tools/profiler.py \
configs/mmcls/classification_tensorrt_dynamic-224x224-224x224.py \
../mmclassification/configs/resnet/resnet18_8xb32_in1k.py \
{/the/path/of/an/image/directory} \
--model {work-dirs}/mmcls/resnet/trt/end2end.engine \
--device cuda \
--shape 224x224 \
--num-iter 100 \
--warmup 10 \
--batch-size 4


Bug fixes

- Fix CI errors (985, 983, 977, 987, 966, 945)
- Fix missing `sqrt` in `PAAHead` (984)
- Fix `nms_rotated` logic when no bbox is detected (976)
- Fix rewrite for `torch.Tensor.__setitem__` in some corner cases (964, 941)
- Disable ONNX optimizer when converting model to ncnn (961)
- Fix regression test (958)
- Disable cublaslt for CUDA 10.2 (947)
- Stop sorting dataset by default & set `test_mode` for mmdet pipelines (920)
- Resolve the issue (909) - ` ValueError: cpu is invalid for the backend tensorrt.` when exporting SDK meta info (912)
- Validate the device id when the inference backend is TensorRT or OpenVINO (886)
- Fix mmdeploy_pplnn_net build error when target device is CPU (896)
- Replace `adaptive_avg_pool2d` with `avg_pool2d` to support exporting ONNX with dynamic shape (857)


Document

- Clarify arguments in model conversion (956, 940)
- Add tutorial in Chinese about "[How to write a customized TensorRT plugin" ](https://github.com/open-mmlab/mmdeploy/blob/master/docs/zh_cn/tutorial/07_write_a_plugin.md)(#290)
- Keep cmake build option in a separate document [cmake_option](https://github.com/open-mmlab/mmdeploy/blob/master/docs/en/01-how-to-build/cmake_option.md). (#832)
- Add project architecture (882)
- Sync Enginsh and Chinese documents (842)
- Correct build-demo commands in [prebuilt_package_windows.md](https://github.com/open-mmlab/mmdeploy/blob/master/docs/en/02-how-to-run/prebuilt_package_windows.md) (#879)
- Fix the wrong argument in model quantization document (866)

Known issues
- `DETR` deployment failed both via ONNX Runtime and TensorRT (1011, [pytorch 84563](https://github.com/pytorch/pytorch/issues/84563))
Contributors
OldDreamInWind liu-mengyang gy-7 Groexhy munhou miraclezqc VVsssssk hanrui1sensetime tpoisonooo grimoire irexyc RunningLeon AllentDan lzhangzz lvhan028

0.7.0

Highlight
- Support SNPE (789)
- Please refer to [Build for SNPE](https://github.com/open-mmlab/mmdeploy/blob/master/docs/en/01-how-to-build/snpe.md) to get start SNPE deployment journey
- Add C++ API for SDK (831)

Features
- Support SNPE (789)
- Add C++ API for SDK (831)
- Support MMRotate model with le135 angle format (788)
- Support RoI Transformer and `Gliding Vertex` model deployment from `MMRotate` (713, 650)
- Add inference latency test script `tools/profile.py` (655)
Here is an example to profile `TensorRT_fp32-resnet18` inference latency:
shell
python tools/profile.py \
configs/mmcls/classification_tensorrt_dynamic-224x224-224x224.py \
../mmclassification/configs/resnet/resnet18_8xb32_in1k.py \
../mmdetection/demo
--model work-dirs/mmcls/resnet/trt/end2end.engine \
--device cuda \
--shape 224x224 \
--num-iter 100 \
--warmup 10


Improvements
- Optimize prebuilt process for Python SDK (810)
- Upgrade `ppl.nn` and `ppl.cv` to `v0.8.1` and `v0.7.0` respectively (793, 564)
- Support batch image test in test script `test.py` (829)
- Install onnx optimizer by setuptools instead of cmake build (690, 811, 843)
- Add SDK code coverage (808)
- Support kwargs in SDK Python bindings (794, 844, 852)
- Support building SDK into a single library by enabling `MMDEPLOY_BUILD_SDK_MONOLITHIC` (806)
- Add a new option `MMDEPLOY_BUILD_EXAMPLES` to build and install SDK examples (822)
- Reduce log verbosity and improve error reporting (755)
- Upgrade GPU Dockerfile to use TensorRT 8.2.4.2 (706)
- Optimize ONNX graph
- Add a function rewriter to `torch.Tensor.__setitem__`, eliminating almost 80% nodes for `x[:,:,:H,:W] = y` onnx export (704)
- Add `CommonSubgraphElimination` onnx pass (647)
- [BC Breaking] Standardize C API(634)
- Prefix all struct with `mmdeploy_` and move all header files into `mmdeploy` folder.
![image](https://user-images.githubusercontent.com/4560679/182781226-7444d5cc-fdd8-434d-b041-97d4ccd53a4f.png)
- Rename `onnx2ncnn` to `mmdeploy_onnx2ncnn` (694)

Bug fixes
- Fix build error on macOS platform (762)
- Fix `troch.triu` function rewriter error when exporting to onnx (792)
- Resolve Cascade R-CNN, `YOLOX` and `SATRN` deployment failure (787, 758, 753)
- Fix `check_env.py` about checking whether custom ops are available (785)
- Fix export for TopK operator in PyTorch 1.12 (715)
Fix export for padding operators in PyTorch<1.10 (754)
- Add default `topk` in SDK model meta info when it is not explicitly specified in `mmclassifcation` model configs (702)
- Fix SingleRoIExtractor for TorchScript backend (724)
- Fix export for DistancePointBBoxCoder.decode (687)
- Fix wrong backend type when doing calibration (719)
- Set exit code to 1 when error happens (715)
- Fix build error on android platform (698)
- Pass `img_metas` while exporting to onnx (681, 700, 707)

Document

- Update build document for android platform (817)
- Fix rendering issues of `get_started` documents in readthedocs (740)
- Add prebuilt package usage on Windows platform (816)
- Simplify `get_started` guide (813)

Contributors

nijkah dwSun lvhan028 lzhangzz irexyc RunningLeon grimoire tpoisonooo AllentDan hanrui1sensetime

0.6.0

Highlight

- Support Swin Transformer deployment with TensorRT and ONNX Runtime (652)
- Support Segmenter deployment with all backends (587)
- Add Java API for SDK (563)


Features

- Support Swin Transformer deployment with TensorRT and ONNX Runtime (652)
- Add Java API for SDK (563)
- Support `Segmenter` deployment with all backends (587)
- Support two-stage rotated detector deployment with TensorRT (530)


Improvements

- Add onnx pass to fuse `select-assign` graph pattern (589)
- Add more CircleCI workflows on Linux, Windows and Linux-GPU platforms (368)
- Add documentation and sample code for model partitioning (599)
- Add `GridPriorsTRT` plugin to speed up TensorRT anchor generation from `155us` t0 `13us` (646)
- Add `MMDEPLOY_TASKS` variable in cmake scripts to remove duplication code (606)
- Improve ncnn patch embed (592)
- Support compute capability 87 for Jetson Orin (601)
- Adjust `csrc` structure (594)


Bug fixes

- Add `build` to TensorRT plugin candidate path list (672)
- Fix missing "image shape" when exporting mmpose models (667)
- Fix ncnn unittest error (626)
- Fix bugs when deploying ShuffleNetV2 with TensorRT (645)
- Relax `mmcls` version constraint (653)
- Eliminate illegal memory access for object detector C API (613)
- Add dim param for `Tensor::Squeeze` (603)
- Fix link missed issue in `index.rst` (607)
- Add support for MMOCR 0.5+ (604)
- Fix output tensor shape of ncnn backend (605)


Documentation

- Fix errors and typos in user documents (676, 675, 655, 654, 621, 588, 586)
- Update deployment benchmark for ViT (624)
- Replace markdown lint with `mdformat` and configure `myst-parser` (610)

Contributors
zambranohally bgsuello triple-Mu DrRyanHuang liuqc11 Yosshi999 zytx121 RunningLeon AllentDan lzhangzz irexyc grimoire lvhan028 hanrui1sensetime tpoisonooo

Page 3 of 4

© 2025 Safety CLI Cybersecurity Inc. All Rights Reserved.