- Optimize CPU RAM usage during quantization with offloading - Support FP8 conversion for DBRX, Mixtral, and Command R+
1.3.4
- Hotfix for LoRA checkpoint saving error.
1.3.3
New Features
- **FP8 Checkpoint Conversion**: We've introduced a new feature for FP8 checkpoint conversion. - **Sharded Safetensors Checkpoint Saving**: Added the ability to save sharded safetensors checkpoints. - **LoRA Support on Mistral Model**: We have added support for LoRA (Low-Rank Adaptation) on the Mistral model.
Bug Fixes
- **BF16 Hotfix**: Addressed an urgent issue with bf16 processing. - **BFloat Safetensors Conversion**: Fixed an issue related to bfloat conversion for safetensors. - **Automatic Token Refresh**: Resolved a bug affecting automatic token refresh.
1.3.2
- Add `base_model_name_or_path` option to `friendli model convert-adapter`. - Remove stale dependencies.
1.3.1
- Update protobuf schema. - Patch sending API requests with content type `application/protobuf`.
1.3.0
- Now resources of Friendli Dedicated Endpoints can be managed with CLI and SDK. The available resources are `endpoint`, `model`, `team`, and `project`. - Login with CLI is now available. SSO login is also available. - Update on Multi-LoRA checkpoint conversion.