Skip to content

Releases: modelscope/ms-swift

v3.2.0

04 Mar 15:48
Compare
Choose a tag to compare

中文版

新特性

  1. GRPO支持多vLLM/lmdeploy数据并行采样,支持异步采样,参考这里。多模态GRPO实验记录参考这里
  2. swift deploy infer_backend为pt时支持动态batch;流式推理接口修改(break change)。
  3. swift infer infer_backend为vllm/lmdeploy支持数据并行。参考这里
  4. 支持moun优化器,参考这里

新模型

  1. moonshotai/Moonlight-16B-A3B-Instruct
  2. LLM-Research/Phi-4-mini-instruct, LLM-Research/Phi-4-multimodal-instruct
  3. DeepSeek-V3-awq, deepseek-r1-awq
  4. Baichuan-M1-14B-Instruct

新数据集

  1. 多模态GRPO:
    • lmms-lab/multimodal-open-r1-8k-verified
    • okwinds/clevr_cogen_a_train

New Features

  1. GRPO supports multi-vLLM/lmdeploy data parallel sampling and asynchronous sampling. For more information, refer to here. Records of multi-modal GRPO experiments can be found here.
  2. When swift deploy infer_backend is set to pt, it supports dynamic batching; the streaming inference interface has been modified (breaking change).
  3. When swift infer infer_backend is set to vllm/lmdeploy, it supports data parallelism. Refer to here.
  4. Supports the muon optimizer. For more information, refer to here.

New Models

  1. moonshotai/Moonlight-16B-A3B-Instruct
  2. LLM-Research/Phi-4-mini-instruct, LLM-Research/Phi-4-multimodal-instruct
  3. DeepSeek-V3-awq, deepseek-r1-awq
  4. Baichuan-M1-14B-Instruct

New Datasets

  1. Multi-modal GRPO:
    • lmms-lab/multimodal-open-r1-8k-verified
    • okwinds/clevr_cogen_a_train

What's Changed

New Contributors

Full Changelog: v3.1.1...v3.2.0

v3.1.1

20 Feb 06:31
Compare
Choose a tag to compare

中文版

新特性

  1. 支持大模型、多模态模型、Agent、多节点GRPO训练,参考这里
  2. 支持Embeding模型训练,参考这里
  3. swift sample支持MCTS、蒸馏方式数据采样,支持多模态模型采样。
  4. 支持自定义数据集评测,参考这里

新模型

  1. AIDC-AI/Ovis2-2B系列
  2. Qwen/Qwen2.5-VL-72B-Instruct-AWQ系列
  3. stepfun-ai/GOT-OCR-2.0-hf
  4. stepfun-ai/Step-Audio-Chat
  5. mistralai/Mistral-Small-24B-Instruct-2501

新数据集

  1. GRPO相关
    • AI-ModelScope/MATH-lighteval
    • LLM-Research/xlam-function-calling-60k
    • AI-MO/NuminaMath-TIR
  2. R1相关
    • liucong/Chinese-DeepSeek-R1-Distill-data-110k-SFT
    • modelscope/MathR, modelscope/MathR-32B-Distill

New Features

  1. Support for large models, multimodal models, Agents, and multi-node GRPO training. Refer to this documentation.
  2. Support for Embedding model training. Refer to this script.
  3. swift sample supports MCTS and distillation data sampling, as well as multimodal model sampling.
  4. Support for custom dataset evaluation. Refer to this documentation.

New Models

  1. AIDC-AI/Ovis2-2B series
  2. Qwen/Qwen2.5-VL-72B-Instruct-AWQ series
  3. stepfun-ai/GOT-OCR-2.0-hf
  4. stepfun-ai/Step-Audio-Chat
  5. mistralai/Mistral-Small-24B-Instruct-2501

New Datasets

  1. Related to GRPO
    • AI-ModelScope/MATH-lighteval
    • LLM-Research/xlam-function-calling-60k
    • AI-MO/NuminaMath-TIR
  2. Related to R1
    • liucong/Chinese-DeepSeek-R1-Distill-data-110k-SFT
    • modelscope/MathR, modelscope/MathR-32B-Distill

What's Changed

New Contributors

Full Changelog: v3.1.0...v3.1.1

v3.1.0

07 Feb 12:38
Compare
Choose a tag to compare

中文版

新特性

  1. 支持swift sample命令进行数据采样,参考这里
  2. 支持强化微调训练,目前已支持拒绝采样微调,参考这里
  3. Grounding任务自定义数据格式重构,参考这里
  4. swift infer支持输出推理速度和ACC/ROUGE/BLEU指标。

新模型

  1. Qwen/Qwen2.5-VL-3B-Instruct系列
  2. Qwen/Qwen2.5-7B-Instruct-1M系列
  3. deepseek-ai/Janus-Pro-1B系列
  4. bytedance-research/UI-TARS-2B-SFT系列

新数据集

  1. ServiceNow-AI/R1-Distill-SFT
  2. bespokelabs/Bespoke-Stratos-17k
  3. open-thoughts/OpenThoughts-114k

English Version

New Features

  1. Supports the swift sample command for data sampling; refer to here.
  2. Supports reinforcement fine-tuning training, with current support for rejection sampling fine-tuning; refer to here.
    3Grounding task custom data format restructuring; refer to here.
  3. swift infer supports outputting inference speed and ACC/ROUGE/BLEU metrics.

New Models

  1. Qwen/Qwen2.5-VL-3B-Instruct Series
  2. Qwen/Qwen2.5-7B-Instruct-1M Series
  3. deepseek-ai/Janus-Pro-1B Series
  4. bytedance-research/UI-TARS-2B-SFT Series

New Datasets

  1. ServiceNow-AI/R1-Distill-SFT
  2. bespokelabs/Bespoke-Stratos-17k
  3. open-thoughts/OpenThoughts-114k

What's Changed

New Contributors

Full Changelog: v3.0.3...v3.1.0

v3.0.3

22 Jan 15:27
Compare
Choose a tag to compare

中文版

新特性

  1. 支持多模态大模型SequenceClassification架构用于多模态分类任务,参考这里
  2. 支持多模态大模型reward model训练。

新模型

  1. Shanghai_AI_Laboratory/internlm3-8b-instruct
  2. OpenBMB/MiniCPM-o-2_6
  3. deepseek-ai/DeepSeek-R1, deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B系列
  4. bytedance-research/Valley-Eagle-7B
  5. LLM-Research/phi-4
  6. Qwen/Qwen2.5-Math-PRM-7B, Qwen/Qwen2.5-Math-PRM-72B
  7. MiniMaxAI/MiniMax-Text-01, MiniMaxAI/MiniMax-VL-01

English Version

New Features

  1. Support multi-modal large model SequenceClassification architecture for multi-modal classification tasks, see here.
  2. Support training of multi-modal reward model.

New Models

  1. Shanghai_AI_Laboratory/internlm3-8b-instruct
  2. OpenBMB/MiniCPM-o-2_6
  3. deepseek-ai/DeepSeek-R1, deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B series
  4. bytedance-research/Valley-Eagle-7B
  5. LLM-Research/phi-4
  6. Qwen/Qwen2.5-Math-PRM-7B, Qwen/Qwen2.5-Math-PRM-72B
  7. MiniMaxAI/MiniMax-Text-01, MiniMaxAI/MiniMax-VL-01

What's Changed

Full Changelog: v3.0.2...v3.0.3

v3.0.2

07 Jan 15:28
Compare
Choose a tag to compare

中文版

新特性

  1. 支持使用swift app开启可视化推理创空间,参考这里
  2. 支持大模型的RM和PPO训练,参考这里
  3. 支持SequenceClassification模型(含BERT)的BNB/GPTQ量化,参考这里
  4. 支持reward model的推理、部署和BNB/GPTQ量化

新模型

  1. ZhipuAI/cogagent-9b-20241220
  2. Reward Models: Shanghai_AI_Laboratory/internlm2-1_8b-reward系列, Qwen/Qwen2-Math-RM-72B系列, AI-ModelScope/Skywork-Reward-Llama-3.1-8B系列, AI-ModelScope/GRM_Llama3.1_8B_rewardmodel-ft系列
  3. AIDC-AI/Ovis1.6-Gemma2-27B, AIDC-AI/Ovis1.6-Llama3.2-3B
  4. PowerInfer/SmallThinker-3B-Preview

新数据集

  1. PowerInfer/LONGCOT-Refine-500K, PowerInfer/QWQ-LONGCOT-500K

English Version

New Features

  1. Support for using swift app to launch a visual inference creative space, see here
  2. Support for RM and PPO training of large models, see here
  3. Support for BNB/GPTQ quantization of SequenceClassification models (including BERT), see here
  4. Support for inference, deployment, and BNB/GPTQ quantization of reward models

New Models

  1. ZhipuAI/cogagent-9b-20241220
  2. Reward Models: Shanghai_AI_Laboratory/internlm2-1_8b-reward series, Qwen/Qwen2-Math-RM-72B series, AI-ModelScope/Skywork-Reward-Llama-3.1-8B series, AI-ModelScope/GRM_Llama3.1_8B_rewardmodel-ft series
  3. AIDC-AI/Ovis1.6-Gemma2-27B, AIDC-AI/Ovis1.6-Llama3.2-3B
  4. PowerInfer/SmallThinker-3B-Preview

New Datasets

  1. PowerInfer/LONGCOT-Refine-500K, PowerInfer/QWQ-LONGCOT-500K

What's Changed

New Contributors

Full Changelog: v3.0.1...v3.0.2

v3.0.1

27 Dec 03:45
Compare
Choose a tag to compare

中文版

新特性

  1. 支持SequenceClassification模型的训练、推理和部署。可以查看以下例子:qwen2.5bert
  2. LlamaPro支持多模态模型. 例如:qwen2vl、internvl2.5、llama3-vision等。

新模型

  1. Qwen/QVQ-72B-Preview
  2. iic/DocOwl2
  3. OpenGVLab/InternVL2-Pretrain-Models, OpenGVLab/InternVL2_5-4B-AWQ系列, OpenGVLab/InternVL2_5-1B-MPO系列
  4. deepseek-ai/DeepSeek-V3系列
  5. answerdotai/ModernBERT-base系列
  6. AI-ModelScope/paligemma2-3b-pt-224系列, AI-ModelScope/paligemma2-3b-ft-docci-448系列
  7. AI-ModelScope/Skywork-o1-Open-Llama-3.1-8B

English Version

New Features:

  1. Support for training, inference, and deployment of SequenceClassification models. You can check the following examples: qwen2.5, bert.
  2. LlamaPro supports multimodal models, such as qwen2vl, internvl2.5, and llama3-vision.

New Models:

  1. Qwen/QVQ-72B-Preview
  2. iic/DocOwl2
  3. OpenGVLab/InternVL2-Pretrain-Models, OpenGVLab/InternVL2_5-4B-AWQ series, OpenGVLab/InternVL2_5-1B-MPO series
  4. deepseek-ai/DeepSeek-V3 series
  5. answerdotai/ModernBERT-base series
  6. AI-ModelScope/paligemma2-3b-pt-224 series, AI-ModelScope/paligemma2-3b-ft-docci-448 series
  7. AI-ModelScope/Skywork-o1-Open-Llama-3.1-8B

What's Changed

Full Changelog: v3.0.0...v3.0.1

v3.0.0

23 Dec 03:17
Compare
Choose a tag to compare

中文版

架构修改与新特性:

具体可以查看这里: https://swift.readthedocs.io/zh-cn/latest/Instruction/ReleaseNote3.0.html

新模型:

  1. OpenGVLab/InternVL2_5-1B等系列模型
  2. LLM-Research/Llama-3.3-70B-Instruct
  3. BAAI/Emu3-Gen
  4. deepseek-ai/DeepSeek-V2.5-1210, deepseek-ai/deepseek-vl2等系列模型
  5. Shanghai_AI_Laboratory/internlm-xcomposer2d5-ol-7b
  6. InfiniAI/Megrez-3b-Instruct, InfiniAI/Megrez-3B-Omni
  7. TeleAI/TeleChat2-3B等系列模型

English Version

Architecture Modifications and New Features:

For more details, please visit: https://swift.readthedocs.io/en/latest/Instruction/ReleaseNote3.0.html

New Models:

  1. OpenGVLab/InternVL2_5-1B series models
  2. LLM-Research/Llama-3.3-70B-Instruct
  3. BAAI/Emu3-Gen
  4. deepseek-ai/DeepSeek-V2.5-1210, deepseek-ai/deepseek-vl2 series models
  5. Shanghai_AI_Laboratory/internlm-xcomposer2d5-ol-7b
  6. InfiniAI/Megrez-3b-Instruct, InfiniAI/Megrez-3B-Omni
  7. TeleAI/TeleChat2-3B series models

What's Changed

New Contributors

Full Changelog: v2.6.1...v3.0.0

v2.6.1

29 Nov 09:29
Compare
Choose a tag to compare

New Models:

  1. Marco-o1
  2. mPLUG-Owl3-7B-241101
  3. QwQ-32B-Preview
  4. glm-edge, glm-edge-v

New Datasets:

  1. OpenO1-SFT

What's Changed

New Contributors

Full Changelog: v2.6.0...v2.6.1

v2.6.0

13 Nov 08:06
Compare
Choose a tag to compare

English Version

Models

  1. Support Qwen2.5 coder models

Feature

  1. Correct and support the new loss and gradient accumulation algorithm from transformers.trainer

中文版本

模型

  1. 支持千问coder系列模型

功能

  1. 支持新的transformers loss和GA计算算法,并修正了其中的bug

What's Changed

Full Changelog: v2.5.2...v2.6.0

v2.5.2

02 Nov 07:50
Compare
Choose a tag to compare

New Models:

  1. emu3-chat
  2. aya-expanse
  3. ministral-8b-inst-2410

New Datasets:

  1. llava-video-178k
  2. moviechat-1k-test

What's Changed

New Contributors

Full Changelog: v2.5.1...v2.5.2