中文版
新特性
1. 支持使用`swift app`开启可视化推理创空间,参考[这里](https://github.com/modelscope/ms-swift/tree/main/examples/app)
2. 支持大模型的RM和PPO训练,参考[这里](https://github.com/modelscope/ms-swift/tree/main/examples/train/rlhf)
3. 支持`SequenceClassification`模型(含BERT)的BNB/GPTQ量化,参考[这里](https://github.com/modelscope/ms-swift/tree/main/examples/export/quantize/bert)
4. 支持reward model的推理、部署和BNB/GPTQ量化
新模型
1. ZhipuAI/cogagent-9b-20241220
2. Reward Models: Shanghai_AI_Laboratory/internlm2-1_8b-reward系列, Qwen/Qwen2-Math-RM-72B系列, AI-ModelScope/Skywork-Reward-Llama-3.1-8B系列, AI-ModelScope/GRM_Llama3.1_8B_rewardmodel-ft系列
3. AIDC-AI/Ovis1.6-Gemma2-27B, AIDC-AI/Ovis1.6-Llama3.2-3B
4. PowerInfer/SmallThinker-3B-Preview
新数据集
1. PowerInfer/LONGCOT-Refine-500K, PowerInfer/QWQ-LONGCOT-500K
English Version
New Features
1. Support for using `swift app` to launch a visual inference creative space, see [here](https://github.com/modelscope/ms-swift/tree/main/examples/app)
2. Support for RM and PPO training of large models, see [here](https://github.com/modelscope/ms-swift/tree/main/examples/train/rlhf)
3. Support for BNB/GPTQ quantization of `SequenceClassification` models (including BERT), see [here](https://github.com/modelscope/ms-swift/tree/main/examples/export/quantize/bert)
4. Support for inference, deployment, and BNB/GPTQ quantization of reward models
New Models
1. ZhipuAI/cogagent-9b-20241220
2. Reward Models: Shanghai_AI_Laboratory/internlm2-1_8b-reward series, Qwen/Qwen2-Math-RM-72B series, AI-ModelScope/Skywork-Reward-Llama-3.1-8B series, AI-ModelScope/GRM_Llama3.1_8B_rewardmodel-ft series
3. AIDC-AI/Ovis1.6-Gemma2-27B, AIDC-AI/Ovis1.6-Llama3.2-3B
4. PowerInfer/SmallThinker-3B-Preview
New Datasets
1. PowerInfer/LONGCOT-Refine-500K, PowerInfer/QWQ-LONGCOT-500K
What's Changed
* Fix app-ui dropdown by tastelikefeet in https://github.com/modelscope/ms-swift/pull/2787
* fix multi-lora by Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2790
* fix stream infer by Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2793
* fix some web-ui bugs by tastelikefeet in https://github.com/modelscope/ms-swift/pull/2794
* support swift app by Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2792
* fix pt batch infer by Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2800
* fix world_size by Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2801
* update base_model deploy example by Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2803
* fix glm4v by Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2806
* fix swift deploy log error (repeat log) by Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2808
* support ZhipuAI/cogagent-9b-20241220 by Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2810
* fix citest by Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2812
* fix enable_cache by Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2813
* update docs (specific model arguments) by Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2822
* add 'right' option for 'truncation_strategy' by zsxm1998 in https://github.com/modelscope/ms-swift/pull/2754
* Fix glm4v suffix by Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2829
* Update padding side by Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2832
* Update base_to_chat shell by Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2833
* Fix bugs by Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2838
* Fix some bugs by tastelikefeet in https://github.com/modelscope/ms-swift/pull/2848
* support reward_model by Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2849
* Move optimizer to create_optimizer by tastelikefeet in https://github.com/modelscope/ms-swift/pull/2851
* fix post_init by Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2855
* fix cache_name_file by Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2856
* fix telechat template by Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2857
* Update more models by Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2852
* Support quant bert reward by Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2859
* fix jsonl writer by Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2860
* support reward model train by Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2862
* fix vllm video by Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2864
* support mps by Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2866
* Update agent demo by Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2867
* fix bugs by Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2869
* Support ppo by Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2783
* update citest by Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2873
* fix dataset cache bugs by Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2876
New Contributors
* zsxm1998 made their first contribution in https://github.com/modelscope/ms-swift/pull/2754
**Full Changelog**: https://github.com/modelscope/ms-swift/compare/v3.0.1...v3.0.2