New features
- New ‘embed’ endpoint (`POST /indexes/{index_name}/embed`) (https://github.com/marqo-ai/marqo/pull/803). Marqo can now perform inference and return the embeddings for a single piece or list of content, where content can be either a string or weighted dictionary of strings. See usage [here](https://docs.marqo.ai/2.5/API-Reference/Embed/embed/).
- New ‘recommend’ endpoint (`POST /indexes/{index_name}/recommend`) (https://github.com/marqo-ai/marqo/pull/816). Given a list of existing document IDs, Marqo can now recommend similar documents by performing a search on interpolated vectors from the documents. See usage [here](https://docs.marqo.ai/2.5/API-Reference/Search/recommend/).
- Add Inference Cache to speed up frequent search and embed requests (https://github.com/marqo-ai/marqo/pull/802). Marqo now caches embeddings generated during inference. The cache size and type can be configured with `MARQO_INFERENCE_CACHE_SIZE` and `MARQO_INFERENCE_CACHE_TYPE`. See configuration instructions [here](https://docs.marqo.ai/2.5/Guides/Advanced-Usage/configuration/#configuring-cache).
- Add configurable search timeout (https://github.com/marqo-ai/marqo/pull/813). Backend timeout now defaults to 1s, but can be configured with the environment variable `VESPA_SEARCH_TIMEOUT_MS`. See configuration instructions [here](https://docs.marqo.ai/2.5/Guides/Advanced-Usage/configuration/#configuring-usage-limits).
- More informative `get_cuda_info` response (https://github.com/marqo-ai/marqo/pull/811). New keys: `utilization` `memory_used_percent` have been added for easier tracking of cuda device status. See [here](https://docs.marqo.ai/2.5/API-Reference/Device/get_cuda_information/) for more information.
Bug fixes and minor changes
- Upgraded `open_clip_torch`, `timm`, and `safetensors` for access to new models (https://github.com/marqo-ai/marqo/pull/810)
Contributor shout-outs
- Shoutout to all our 4.1k stargazers! Thanks for continuing to use our product and helping Marqo grow.
- Keep on sharing your questions and feedback on our [forum](https://community.marqo.ai/) and [Slack channel](https://marqo-community.slack.com/join/shared_invite/zt-2b4nsvbd2-TDf8agPszzWH5hYKBMIgDA#/shared-invite/email)! If you have any more inquiries or thoughts, please don’t hesitate to reach out.