![Llama 3](https://github.com/ollama/ollama/assets/3325447/8dc9c472-9d72-4b39-95ae-2c85ada375b9)
New models:
* [Llama 3](https://ollama.com/library/llama3): a new model by Meta, and the most capable openly available LLM to date
* [Phi 3 Mini](https://ollama.com/library/phi3): a new 3.8B parameters, lightweight, state-of-the-art open model by Microsoft.
* [Moondream](https://ollama.com/library/moondream) moondream is a small vision language model designed to run efficiently on edge devices.
* [Llama 3 Gradient 1048K](https://ollama.com/library/llama3-gradient): A Llama 3 fine-tune by Gradient to support up to a 1M token context window.
* [Dolphin Llama 3](https://ollama.com/library/dolphin-llama3): The uncensored Dolphin model, trained by Eric Hartford and based on Llama 3 with a variety of instruction, conversational, and coding skills.
* [Qwen 110B](https://ollama.com/library/qwen:110b): The first Qwen model over 100B parameters in size with outstanding performance in evaluations
What's Changed
* Fixed issues where the model would not terminate, causing the API to hang.
* Fixed a series of out of memory errors on Apple Silicon Macs
* Fixed out of memory errors when running Mixtral architecture models
Experimental concurrency features
New concurrency features are coming soon to Ollama. They are available
* `OLLAMA_NUM_PARALLEL`: Handle multiple requests simultaneously for a single model
* `OLLAMA_MAX_LOADED_MODELS`: Load multiple models simultaneously
To enable these features, set the environment variables for `ollama serve`. For more info see [this guide](https://github.com/ollama/ollama/blob/main/docs/faq.md#how-do-i-configure-ollama-server):
OLLAMA_NUM_PARALLEL=4 OLLAMA_MAX_LOADED_MODELS=4 ollama serve
New Contributors
* hmartinez82 made their first contribution in https://github.com/ollama/ollama/pull/3972
* Cephra made their first contribution in https://github.com/ollama/ollama/pull/4037
* arpitjain099 made their first contribution in https://github.com/ollama/ollama/pull/4007
* MarkWard0110 made their first contribution in https://github.com/ollama/ollama/pull/4031
* alwqx made their first contribution in https://github.com/ollama/ollama/pull/4073
* sidxt made their first contribution in https://github.com/ollama/ollama/pull/3705
* ChengenH made their first contribution in https://github.com/ollama/ollama/pull/3789
* secondtruth made their first contribution in https://github.com/ollama/ollama/pull/3503
* reid41 made their first contribution in https://github.com/ollama/ollama/pull/3612
* ericcurtin made their first contribution in https://github.com/ollama/ollama/pull/3626
* JT2M0L3Y made their first contribution in https://github.com/ollama/ollama/pull/3633
* datvodinh made their first contribution in https://github.com/ollama/ollama/pull/3655
* MapleEve made their first contribution in https://github.com/ollama/ollama/pull/3817
* swuecho made their first contribution in https://github.com/ollama/ollama/pull/3810
* brycereitano made their first contribution in https://github.com/ollama/ollama/pull/3895
* bsdnet made their first contribution in https://github.com/ollama/ollama/pull/3889
* fyxtro made their first contribution in https://github.com/ollama/ollama/pull/3855
* natalyjazzviolin made their first contribution in https://github.com/ollama/ollama/pull/3962
**Full Changelog**: https://github.com/ollama/ollama/compare/v0.1.32...v0.1.33