Lmdeploy

Latest version: v0.7.2.post1

Safety actively analyzes 723625 Python packages for vulnerabilities to keep your Python projects secure.

Page 1 of 8

0.7.2.post1

What's Changed
💥 Improvements
* Add spaces_between_special_tokens to /v1/interactive and make compatible with empty text by AllentDan in https://github.com/InternLM/lmdeploy/pull/3283
* add env var to control timeout by CUHKSZzxy in https://github.com/InternLM/lmdeploy/pull/3291
🐞 Bug fixes
* fix activation grid oversize by grimoire in https://github.com/InternLM/lmdeploy/pull/3282
* Set ensure_ascii=False for tool calling by AllentDan in https://github.com/InternLM/lmdeploy/pull/3295
🌐 Other
* bump version to v0.7.2.post1 by lvhan028 in https://github.com/InternLM/lmdeploy/pull/3298

**Full Changelog**: https://github.com/InternLM/lmdeploy/compare/v0.7.2...v0.7.2.post1

0.7.2

What's Changed
🚀 Features
* [Feature] support qwen2.5-vl for pytorch engine by CUHKSZzxy in https://github.com/InternLM/lmdeploy/pull/3194
* Support reward models by lvhan028 in https://github.com/InternLM/lmdeploy/pull/3192
* Add collective communication kernels by lzhangzz in https://github.com/InternLM/lmdeploy/pull/3163
* PytorchEngine multi-node support v2 by grimoire in https://github.com/InternLM/lmdeploy/pull/3147
* Add flash mla by AllentDan in https://github.com/InternLM/lmdeploy/pull/3218
* Add gemma3 implementation by AllentDan in https://github.com/InternLM/lmdeploy/pull/3272
💥 Improvements
* remove update badwords by grimoire in https://github.com/InternLM/lmdeploy/pull/3183
* defaullt executor ray by grimoire in https://github.com/InternLM/lmdeploy/pull/3210
* change ascend&camb default_batch_size to 256 by jinminxi104 in https://github.com/InternLM/lmdeploy/pull/3251
* Tool reasoning parsers and streaming function call by AllentDan in https://github.com/InternLM/lmdeploy/pull/3198
* remove torchelastic flag by grimoire in https://github.com/InternLM/lmdeploy/pull/3242
* disable flashmla warning on sm<90 by grimoire in https://github.com/InternLM/lmdeploy/pull/3271
🐞 Bug fixes
* Fix missing cli chat option by lzhangzz in https://github.com/InternLM/lmdeploy/pull/3209
* [ascend] fix multi-card distributed inference failures by tangzhiyi11 in https://github.com/InternLM/lmdeploy/pull/3215
* fix for small cache-max-entry-count by grimoire in https://github.com/InternLM/lmdeploy/pull/3221
* [dlinfer] fix glm-4v graph mode on ascend by jinminxi104 in https://github.com/InternLM/lmdeploy/pull/3235
* fix qwen2.5 pytorch engine dtype error on NPU by tcye in https://github.com/InternLM/lmdeploy/pull/3247
* [Fix] failed to update the tokenizer's eos_token_id into stop_word list by lvhan028 in https://github.com/InternLM/lmdeploy/pull/3257
* fix dsv3 gate scaling by grimoire in https://github.com/InternLM/lmdeploy/pull/3263
* Fix the bug for reading dict error by GxjGit in https://github.com/InternLM/lmdeploy/pull/3196
* Fix get ppl by lvhan028 in https://github.com/InternLM/lmdeploy/pull/3268
📚 Documentations
* Specifiy lmdeploy version in benchmark guide by lyj0309 in https://github.com/InternLM/lmdeploy/pull/3216
* [ascend] add Ascend docker image by jinminxi104 in https://github.com/InternLM/lmdeploy/pull/3239
🌐 Other
* [ci] testcase refactoring by zhulinJulia24 in https://github.com/InternLM/lmdeploy/pull/3151
* [ci] add testcase for native communicator by zhulinJulia24 in https://github.com/InternLM/lmdeploy/pull/3217
* [ci] add volc evaluation testcase by zhulinJulia24 in https://github.com/InternLM/lmdeploy/pull/3240
* [ci] remove v100 testconfig by zhulinJulia24 in https://github.com/InternLM/lmdeploy/pull/3253
* add rdma dependencies into docker file by CUHKSZzxy in https://github.com/InternLM/lmdeploy/pull/3262
* docs: update ascend docs for docker running by CyCle1024 in https://github.com/InternLM/lmdeploy/pull/3266
* bump version to v0.7.2 by lvhan028 in https://github.com/InternLM/lmdeploy/pull/3252

New Contributors
* lyj0309 made their first contribution in https://github.com/InternLM/lmdeploy/pull/3216
* tcye made their first contribution in https://github.com/InternLM/lmdeploy/pull/3247

**Full Changelog**: https://github.com/InternLM/lmdeploy/compare/v0.7.1...v0.7.2

0.7.1

What's Changed
🚀 Features
* support release pipeline by irexyc in https://github.com/InternLM/lmdeploy/pull/3069
* [feature] add dlinfer w8a8 support. by Reinerzhou in https://github.com/InternLM/lmdeploy/pull/2988
* [maca] support deepseekv2 for maca backend. by Reinerzhou in https://github.com/InternLM/lmdeploy/pull/2918
* [Feature] support deepseek-vl2 for pytorch engine by CUHKSZzxy in https://github.com/InternLM/lmdeploy/pull/3149
💥 Improvements
* use weights iterator while loading by RunningLeon in https://github.com/InternLM/lmdeploy/pull/2886
* Add deepseek-r1 chat template by AllentDan in https://github.com/InternLM/lmdeploy/pull/3072
* Update tokenizer by lvhan028 in https://github.com/InternLM/lmdeploy/pull/3061
* Set max concurrent requests by AllentDan in https://github.com/InternLM/lmdeploy/pull/2961
* remove logitswarper by grimoire in https://github.com/InternLM/lmdeploy/pull/3109
* Update benchmark script and user guide by lvhan028 in https://github.com/InternLM/lmdeploy/pull/3110
* support eos_token list in turbomind by irexyc in https://github.com/InternLM/lmdeploy/pull/3044
* Use aiohttp inside proxy server && add --disable-cache-status argument by AllentDan in https://github.com/InternLM/lmdeploy/pull/3020
* Update runtime package dependencies by zgjja in https://github.com/InternLM/lmdeploy/pull/3142
* Make turbomind support embedding inputs on GPU by chengyuma in https://github.com/InternLM/lmdeploy/pull/3177
🐞 Bug fixes
* [dlinfer] fix ascend qwen2_vl graph_mode by yao-fengchen in https://github.com/InternLM/lmdeploy/pull/3045
* fix error in interactive api by lvhan028 in https://github.com/InternLM/lmdeploy/pull/3074
* fix sliding window mgr by grimoire in https://github.com/InternLM/lmdeploy/pull/3068
* More arguments in api_client, update docstrings by AllentDan in https://github.com/InternLM/lmdeploy/pull/3077
* Add system role to deepseek chat template by AllentDan in https://github.com/InternLM/lmdeploy/pull/3031
* Fix xcomposer2d5 by irexyc in https://github.com/InternLM/lmdeploy/pull/3087
* fix user guide about cogvlm deployment by lvhan028 in https://github.com/InternLM/lmdeploy/pull/3088
* fix postional argument by lvhan028 in https://github.com/InternLM/lmdeploy/pull/3086
* Fix UT of deepseek chat template by lvhan028 in https://github.com/InternLM/lmdeploy/pull/3125
* Fix internvl2.5 error after eviction by grimoire in https://github.com/InternLM/lmdeploy/pull/3122
* Fix cogvlm and phi3vision by RunningLeon in https://github.com/InternLM/lmdeploy/pull/3137
* [fix] fix vl gradio, use pipeline api and remove interactive chat by irexyc in https://github.com/InternLM/lmdeploy/pull/3136
* fix the issue that stop_token may be less than defined in model.py by irexyc in https://github.com/InternLM/lmdeploy/pull/3148
* fix typing by lz1998 in https://github.com/InternLM/lmdeploy/pull/3153
* fix min length penalty by irexyc in https://github.com/InternLM/lmdeploy/pull/3150
* fix default temperature value by irexyc in https://github.com/InternLM/lmdeploy/pull/3166
* Use pad_token_id as image_token_id for vl models by RunningLeon in https://github.com/InternLM/lmdeploy/pull/3158
* Fix tool call prompt for InternLM and Qwen by AllentDan in https://github.com/InternLM/lmdeploy/pull/3156
* Update qwen2.py by GxjGit in https://github.com/InternLM/lmdeploy/pull/3174
* fix temperature=0 by grimoire in https://github.com/InternLM/lmdeploy/pull/3176
* fix blocked fp8 moe by grimoire in https://github.com/InternLM/lmdeploy/pull/3181
* fix deepseekv2 has no attribute use_mla error by CUHKSZzxy in https://github.com/InternLM/lmdeploy/pull/3188
* fix unstoppable chat by lvhan028 in https://github.com/InternLM/lmdeploy/pull/3189
🌐 Other
* [ci] add internlm3 into testcase by zhulinJulia24 in https://github.com/InternLM/lmdeploy/pull/3038
* add internlm3 to supported models by lvhan028 in https://github.com/InternLM/lmdeploy/pull/3041
* update pre-commit config by lvhan028 in https://github.com/InternLM/lmdeploy/pull/2683
* [maca] add cudagraph support on maca backend. by Reinerzhou in https://github.com/InternLM/lmdeploy/pull/2834
* bump version to v0.7.0.post1 by lvhan028 in https://github.com/InternLM/lmdeploy/pull/3076
* bump version to v0.7.0.post2 by lvhan028 in https://github.com/InternLM/lmdeploy/pull/3094
* [Fix] fix the URL judgment problem in Windows by Lychee-acaca in https://github.com/InternLM/lmdeploy/pull/3103
* bump version to v0.7.0.post3 by lvhan028 in https://github.com/InternLM/lmdeploy/pull/3115
* [ci] fix some fail in daily testcase by zhulinJulia24 in https://github.com/InternLM/lmdeploy/pull/3134
* Bump version to v0.7.1 by lvhan028 in https://github.com/InternLM/lmdeploy/pull/3178

New Contributors
* Lychee-acaca made their first contribution in https://github.com/InternLM/lmdeploy/pull/3103
* lz1998 made their first contribution in https://github.com/InternLM/lmdeploy/pull/3153
* GxjGit made their first contribution in https://github.com/InternLM/lmdeploy/pull/3174
* chengyuma made their first contribution in https://github.com/InternLM/lmdeploy/pull/3177
* CUHKSZzxy made their first contribution in https://github.com/InternLM/lmdeploy/pull/3149

**Full Changelog**: https://github.com/InternLM/lmdeploy/compare/v0.7.0...v0.7.1

0.7.0.post3

What's Changed
💥 Improvements
* Set max concurrent requests by AllentDan in https://github.com/InternLM/lmdeploy/pull/2961
* remove logitswarper by grimoire in https://github.com/InternLM/lmdeploy/pull/3109
🐞 Bug fixes
* fix user guide about cogvlm deployment by lvhan028 in https://github.com/InternLM/lmdeploy/pull/3088
* fix postional argument by lvhan028 in https://github.com/InternLM/lmdeploy/pull/3086
🌐 Other
* [Fix] fix the URL judgment problem in Windows by Lychee-acaca in https://github.com/InternLM/lmdeploy/pull/3103
* bump version to v0.7.0.post3 by lvhan028 in https://github.com/InternLM/lmdeploy/pull/3115

New Contributors
* Lychee-acaca made their first contribution in https://github.com/InternLM/lmdeploy/pull/3103

**Full Changelog**: https://github.com/InternLM/lmdeploy/compare/v0.7.0.post2...v0.7.0.post3

0.7.0.post2

What's Changed
💥 Improvements
* Add deepseek-r1 chat template by AllentDan in https://github.com/InternLM/lmdeploy/pull/3072
* Update tokenizer by lvhan028 in https://github.com/InternLM/lmdeploy/pull/3061
🐞 Bug fixes
* Add system role to deepseek chat template by AllentDan in https://github.com/InternLM/lmdeploy/pull/3031
* Fix xcomposer2d5 by irexyc in https://github.com/InternLM/lmdeploy/pull/3087
🌐 Other
* bump version to v0.7.0.post2 by lvhan028 in https://github.com/InternLM/lmdeploy/pull/3094

**Full Changelog**: https://github.com/InternLM/lmdeploy/compare/v0.7.0.post1...v0.7.0.post2

0.7.0.post1

What's Changed
💥 Improvements
* use weights iterator while loading by RunningLeon in https://github.com/InternLM/lmdeploy/pull/2886
🐞 Bug fixes
* [dlinfer] fix ascend qwen2_vl graph_mode by yao-fengchen in https://github.com/InternLM/lmdeploy/pull/3045
* fix error in interactive api by lvhan028 in https://github.com/InternLM/lmdeploy/pull/3074
* fix sliding window mgr by grimoire in https://github.com/InternLM/lmdeploy/pull/3068
* More arguments in api_client, update docstrings by AllentDan in https://github.com/InternLM/lmdeploy/pull/3077
🌐 Other
* [ci] add internlm3 into testcase by zhulinJulia24 in https://github.com/InternLM/lmdeploy/pull/3038
* add internlm3 to supported models by lvhan028 in https://github.com/InternLM/lmdeploy/pull/3041
* update pre-commit config by lvhan028 in https://github.com/InternLM/lmdeploy/pull/2683
* [maca] add cudagraph support on maca backend. by Reinerzhou in https://github.com/InternLM/lmdeploy/pull/2834
* bump version to v0.7.0.post1 by lvhan028 in https://github.com/InternLM/lmdeploy/pull/3076

**Full Changelog**: https://github.com/InternLM/lmdeploy/compare/v0.7.0...v0.7.0.post1

Page 1 of 8

Releases

Has known vulnerabilities

Lmdeploy

Page 1 of 8

0.7.2.post1

0.7.2

0.7.1

0.7.0.post3

0.7.0.post2

0.7.0.post1

Page 1 of 8

Links

Releases