Features
- Added support for multimodal models (748)
- Make API consistent by relying on `LiteLLM` messages (657, 633)
- Made it possible to customize system prompts (745)
- Implemented automatic retries in case of refusals (747)
- Implemented batch inference for local models (658)
Docs
- Reworked docs to include how-to guides