Carview!

@sncix

Ollama's new app

Ollama's new app is available for macOS and Windows: Download Ollama

What's Changed

ollama ps will now show the context length of loaded models
Improved performance in gemma3n models by 2-3x
Parallel request processing now defaults to 1. For more details, see the FAQ
Fixed issue where tool calling would not work correctly with granite3.3 and mistral-nemo models
Fixed issue where Ollama's tool calling would not work correctly if a tool's name was part of of another one, such as add and get_address
Improved performance when using multiple GPUs by 10-30%
Ollama's OpenAI-compatible API will now support WebP images
Fixed issue where ollama show would report an error
ollama run will more gracefully display errors

New Contributors

@sncix made their first contribution in #11189
@mfornet made their first contribution in #11425
@haiyuewa made their first contribution in #11427
@warting made their first contribution in #11461
@ycomiti made their first contribution in #11462
@minxinyi made their first contribution in #11502
@ruyut made their first contribution in #11528

Full Changelog: v0.9.6...v0.10.0

@vrampal

What's Changed

Fixed styling issue in launch screen
tool_name can now be provided in messages with "role": "tool" using the /api/chat endpoint

New Contributors

@vrampal made their first contribution in #9681

Full Changelog: v0.9.5...v0.9.6-rc0

@xukecheng

Updates to Ollama for macOS and Windows

A new version of Ollama's macOS and Windows applications are now available. New improvements to the apps will be introduced over the coming releases:

New features

Expose Ollama on the network

Ollama can now be exposed on the network, allowing others to access Ollama on other devices or even over the internet. This is useful for having Ollama running on a powerful Mac, PC or Linux computer while making it accessible to less powerful devices.

Model directory

The directory in which models are stored can now be modified! This allows models to be stored on external hard disks or alternative directories than the default.

Smaller footprint and faster starting on macOS

The macOS app is now a native application and starts much faster while requiring a much smaller installation.

Additional changes in 0.9.5

Fixed issue where the ollama CLI would not be installed by Ollama on macOS on startup
Fixed issue where files in ollama-darwin.tgz were not notarized
Add NativeMind to Community Integrations by @xukecheng in #11242
Ollama for macOS now requires version 12 (Monterey) or newer

New Contributors

@xukecheng made their first contribution in #11242

Updates to Ollama for macOS and Windows

A new version of Ollama's macOS and Windows applications are now available. New improvements to the apps will be introduced over the coming releases:

New features

Expose Ollama on the network

Ollama can now be exposed on the network, allowing others to access Ollama on other devices or even over the internet. This is useful for having Ollama running on a powerful Mac, PC or Linux computer while making it accessible to less powerful devices.

Model directory

The directory in which models are stored can now be modified! This allows models to be stored on external hard disks or alternative directories than the default.

Smaller footprint and faster starting on macOS

The macOS app is now a native application and starts much faster while requiring a much smaller installation.

What's Changed

Reduced download size and startup time for Ollama on macOS
Tool calling with empty parameters will now work correctly
Fixed issue when quantizing models with the Gemma 3n architecture
Ollama for macOS should not longer ask for root privileges when updating unless required
Ollama for macOS now requires version 12 (Monterey) or newer

Full Changelog: v0.9.3...v0.9.4

@Aj-Seven

Gemma 3n

Ollama now supports Gemma 3n.

Gemma 3n models are designed for efficient execution on everyday devices such as laptops, tablets or phones. These models were trained with data in over 140 spoken languages.

Effective 2B

ollama run gemma3n:e2b

Effective 4B

ollama run gemma3n:e4b

What's Changed

Fixed issue where errors would not be properly reported on Apple Silicon Macs
Ollama will now limit context length to what the model was trained against to avoid strange overflow behavior

New Contributors

@Aj-Seven made their first contribution in #11169

Full Changelog: v0.9.2...v0.9.3

@NGC13009

What's Changed

Fixed issue where tool calls without parameters would not be returned correctly
Fixed does not support generate errors
Fixed issue where some special tokens would not be tokenized properly for some model architectures

New Contributors

@NGC13009 made their first contribution in #11080

Full Changelog: v0.9.1...v0.9.2

@JasonHonKL

Tool calling improvements

New tool calling support

The following models now support tool calling:

DeepSeek-R1-2508 (671B model)
Magistral

Tool calling reliability has also been improved for the following models:

To re-download the models, use ollama pull.

New Ollama for macOS and Windows preview

A new version of Ollama's macOS and Windows applications are available to test for early feedback. New improvements to the apps will be introduced over the coming releases:

If you have feedback, please create an issue on GitHub with the app label. These apps will automatically update themselves to future versions of Ollama, so you may have to redownload new preview versions in the future.

New features

Expose Ollama on the network

Ollama can now be exposed on the network, allowing others to access Ollama on other devices or even over the internet. This is useful for having Ollama running on a powerful Mac, PC or Linux computer while making it accessible to less powerful devices.

Allow local browser access

Enabling this allows websites to access your local installation of Ollama. This is handy for developing browser-based applications using Ollama's JavaScript library.

Model directory

The directory in which models are stored can now be modified! This allows models to be stored on external hard disks or alternative directories than the default.

Smaller footprint and faster starting on macOS

The macOS app is now a native application and starts much faster while requiring a much smaller installation.

What's Changed

Magistral now supports disabling thinking mode. Note: it is also recommended to change the system prompt when doing so.
Error messages that previously showed POST predict will now be more informative
Improved tool calling reliability for some models
Fixed issue on Windows where ollama run would not start Ollama automatically

New Contributors

@JasonHonKL made their first contribution in #10174
@hwittenborn made their first contribution in #10998
@krzysztofjeziorny made their first contribution in #10973

Full Changelog: v0.9.0...v0.9.1

New models

DeepSeek-R1-2508: DeepSeek-R1 has received a minor version upgrade to DeepSeek-R1-0528 for the 8 billion parameter distilled model and the full 671 billion parameter model. In this update, DeepSeek R1 has significantly improved its reasoning and inference capabilities.

Thinking

Ollama now has the ability to enable or disable thinking. This gives users the flexibility to choose the model’s thinking behavior for different applications and use cases.

When thinking is enabled, the output will separate the model’s thinking from the model’s output. When thinking is disabled, the model will not think and directly output the content.

Models that support thinking:

DeepSeek R1
Qwen 3
more will be added under thinking models.

When running a model that supports thinking, Ollama will now display the model's thoughts:

% ollama run deepseek-r1
>>> How many Rs are in strawberry
Thinking...
First, I need to understand what the question is asking. It's asking how many letters 'R' are present in the word "strawberry."
Next, I'll examine each letter in the word individually.
I'll start from the beginning and count every occurrence of the letter 'R.'
After reviewing all the letters, I determine that there are three instances where the letter 'R' appears in the word "strawberry."
...done thinking.
There are three **Rs** in the word **"strawberry"**.

In Ollama's API, a model's thinking is now returned as a separate thinking field for easy parsing:

{
  "message": {
    "role": "assistant",
    "thinking": "First, I need to understand what the question is asking. It's asking how many letters 'R' are present in the word "strawberry...",
    "content": "There are **3** instances of the letter **R** in the word **"strawberry."**"
  }
}

Turning thinking on and off

In the API, thinking can be enabled by passing "think": true and disabled by passing "think": false

curl https://localhost:11434/api/chat -d '{
  "model": "deepseek-r1",
  "messages": [
    {
      "role": "user",
      "content": "Why is the sky blue?"
    },
  ],
  "think": true
}'

In Ollama's CLI, use /set think and /set nothink to enable and disable thinking.

What's Changed

Add thinking support to Ollama

Full Changelog: v0.8.0...v0.9.0

@hellotunamayo

What's Changed

Ollama will now stream responses with tool calls blog post
Logs will now include better memory estimate debug information when running models in Ollama's engine.

New Contributors

@hellotunamayo made their first contribution in #10790

Full Changelog: v0.7.1...v0.8.0

@ronxldwilson

What's Changed

Improved model memory management to allocate sufficient memory to prevent crashes when running multimodal models in certain situations
Enhanced memory estimation for models to prevent unintended memory offloading
ollama show will now show ... when data is truncated
Fixed crash that would occur with qwen2.5vl
Fixed crash on Nvidia's CUDA for llama3.2-vision
Support for Alibaba's Qwen 3 and Qwen 2 architectures in Ollama's new multimodal engine

New Contributors

@ronxldwilson made their first contribution in #10763
@DarkCaster made their first contribution in #10779

Full Changelog: v0.7.0...v0.7.1

Releases: ollama/ollama

v0.10.0

Ollama's new app

What's Changed

New Contributors

Contributors

Uh oh!

v0.9.6

What's Changed

New Contributors

Contributors

Uh oh!

v0.9.5

Updates to Ollama for macOS and Windows

New features

Expose Ollama on the network

Model directory

Smaller footprint and faster starting on macOS

Additional changes in 0.9.5

New Contributors

Contributors

Uh oh!

v0.9.4

Updates to Ollama for macOS and Windows

New features

Expose Ollama on the network

Model directory

Smaller footprint and faster starting on macOS

What's Changed

Uh oh!

v0.9.3

Gemma 3n

Effective 2B

Effective 4B

What's Changed

New Contributors

Contributors

Uh oh!

v0.9.2

What's Changed

New Contributors

Contributors

Uh oh!

v0.9.1

Tool calling improvements

New tool calling support

New Ollama for macOS and Windows preview

New features

Expose Ollama on the network

Allow local browser access

Model directory

Smaller footprint and faster starting on macOS

What's Changed

New Contributors

Contributors

Uh oh!

v0.9.0

New models

Thinking

Turning thinking on and off

What's Changed

Uh oh!

v0.8.0

What's Changed

New Contributors

Contributors

Uh oh!

v0.7.1

What's Changed

New Contributors

Contributors

Uh oh!