ollama Changelog

0.1.47

![Ollama Gemma 2 illustration](https://github.com/ollama/ollama/assets/3325447/3dc55e12-1ab9-4d28-baa4-4608def3cb3d)

What's Changed
* Added support for Google Gemma 2 models (9B and 27B)
* Fixed issues with `ollama create` when importing from Safetensors

A special thank you to the Google Cloud and DeepMind team members for Gemma 2 support.

**Full Changelog**: https://github.com/ollama/ollama/compare/v0.1.46...v0.1.47

0.1.46

![ollama run](https://github.com/ollama/ollama/assets/3325447/29eb08bb-1494-46c0-baa9-19f118c54cf4)

What's Changed
* Increased model loading speed with `ollama run`, especially if running an already-loaded model
* Improved performance of `/api/show` including for large models
* Fixes issue where the `--quantize` flag in `ollama create` would lead to an error
* Improved model loading times when models would not completely fit in system memory on Linux
* Fixed issue where certain `Modelfile` parameters would not be parsed correctly

**Full Changelog**: https://github.com/ollama/ollama/compare/v0.1.45...v0.1.46

0.1.45

New models
* [DeepSeek-Coder-V2](https://ollama.com/library/deepseek-coder-v2): A 16B & 236B open-source Mixture-of-Experts code language model that achieves performance comparable to GPT4-Turbo in code-specific tasks.

`ollama show`

`ollama show` will now show model details such as context length, parameters, embedding size, license and more:

% ollama show llama3
Model
arch llama
parameters 8.0B
quantization Q4_0
context length 8192
embedding length 4096

Parameters
num_keep 24
stop "<|start_header_id|>"
stop "<|end_header_id|>"
stop "<|eot_id|>"

License
META LLAMA 3 COMMUNITY LICENSE AGREEMENT
Meta Llama 3 Version Release Date: April 18, 2024

What's Changed
* `ollama show <model>` will now show model information such as context window size
* Model loading on Windows with CUDA GPUs is now faster
* Setting `seed` in the `/v1/chat/completions` OpenAI compatibility endpoint no longer changes `temperature`
* Enhanced GPU discovery and multi-gpu support with concurrency
* The Linux install script will now skip searching for network devices
* Introduced a workaround for AMD Vega RX 56 SDMA support on Linux
* Fix memory prediction for `deepseek-v2` and `deepseek-coder-v2` models
* `api/show` endpoint returns extensive model metadata
* GPU configuration variables are now reported in `ollama serve`
* Update Linux ROCm to v6.1.1

New Contributors
* jayson-cloude made their first contribution in https://github.com/ollama/ollama/pull/4972

**Full Changelog**: https://github.com/ollama/ollama/compare/v0.1.44...v0.1.45

0.1.44

What's Changed
* Fixed issue where unicode characters such as emojis would not be loaded correctly when running `ollama create`
* Fixed certain cases where Nvidia GPUs would not be detected and reported as compute capability 1.0 devices

**Full Changelog**: https://github.com/ollama/ollama/compare/v0.1.43...v0.1.44

0.1.43

![Ollama honest work](https://github.com/ollama/ollama/assets/3325447/06b05d79-1872-45d9-bed3-76f72afa2baf)

What's Changed
* New [import.md](https://github.com/ollama/ollama/blob/main/docs/import.md) guide for converting and importing models to Ollama
* Fixed issue where embedding vectors resulting from `/api/embeddings` would not be accurate
* JSON mode responses will no longer include invalid escape characters
* Removing a model will no longer show incorrect `File not found` errors
* Fixed issue where running `ollama create` would result in an error on Windows with certain file formatting

New Contributors
* erhant made their first contribution in https://github.com/ollama/ollama/pull/4854
* nischalj10 made their first contribution in https://github.com/ollama/ollama/pull/4612
* dcasota made their first contribution in https://github.com/ollama/ollama/pull/4852
* Napuh made their first contribution in https://github.com/ollama/ollama/pull/4084
* hughescr made their first contribution in https://github.com/ollama/ollama/pull/3782
* jimscard made their first contribution in https://github.com/ollama/ollama/pull/3382

**Full Changelog**: https://github.com/ollama/ollama/compare/v0.1.42...v0.1.43

0.1.42

New models
* [Qwen 2](https://ollama.com/library/qwen2): a new series of large language models from Alibaba group

What's Changed
* Fixed issue where `qwen2` would output erroneous text such as `GGG` on Nvidia and AMD GPUs
* `ollama pull` is now faster if it detects a model is already downloaded
* `ollama create` will now automatically detect prompt templates for popular model architectures such as Llama, Gemma, Phi and more.
* Ollama can now be accessed from local apps built with Electron and Tauri, as well as in developing apps in local html files
* Update welcome prompt in Windows to `llama3`
* Fixed issues where `/api/ps` and `/api/tags` would show invalid timestamps in responses

New Contributors
* shoebham made their first contribution in https://github.com/ollama/ollama/pull/4766
* kartikm7 made their first contribution in https://github.com/ollama/ollama/pull/4719
* royjhan made their first contribution in https://github.com/ollama/ollama/pull/4822

**Full Changelog**: https://github.com/ollama/ollama/compare/v0.1.41...v0.1.42

Ollama

Page 8 of 19