New Models
* [Hermes 3](https://ollama.com/library/hermes3): Hermes 3 is the latest version of the flagship Hermes series of LLMs by Nous Research, which includes support for tool calling.
* [Phi 3.5](https://ollama.com/library/phi3.5): A lightweight AI model with 3.8 billion parameters with performance overtaking similarly and larger sized models.
* [SmolLM](https://ollama.com/library/smollm): A family of small models with 135M, 360M, and 1.7B parameters, trained on a new high-quality dataset.
What's Changed
* CUDA 12 support: improving performance by up to 10% on newer NVIDIA GPUs
* Improved performance of `ollama pull` and `ollama push` on slower connections
* Fixed issue where setting `OLLAMA_NUM_PARALLEL` would cause models to be reloaded on lower VRAM systems
* Ollama on Linux is now distributed as a `tar.gz` file, which contains the `ollama` binary along with required libraries.
New Contributors
* pamelafox made their first contribution in https://github.com/ollama/ollama/pull/6345
* eust-w made their first contribution in https://github.com/ollama/ollama/pull/5964
**Full Changelog**: https://github.com/ollama/ollama/compare/v0.3.6...v0.3.7