CARVIEW |
Select Language
HTTP/2 200
date: Sat, 19 Jul 2025 19:28:13 GMT
content-type: text/html; charset=utf-8
vary: X-PJAX, X-PJAX-Container, Turbo-Visit, Turbo-Frame, X-Requested-With,Accept-Encoding, Accept, X-Requested-With
etag: W/"0e7ddba3d4072591f04cc88f92885d5f"
cache-control: max-age=0, private, must-revalidate
strict-transport-security: max-age=31536000; includeSubdomains; preload
x-frame-options: deny
x-content-type-options: nosniff
x-xss-protection: 0
referrer-policy: no-referrer-when-downgrade
content-security-policy: default-src 'none'; base-uri 'self'; child-src github.githubassets.com github.com/assets-cdn/worker/ github.com/assets/ gist.github.com/assets-cdn/worker/; connect-src 'self' uploads.github.com www.githubstatus.com collector.github.com raw.githubusercontent.com api.github.com github-cloud.s3.amazonaws.com github-production-repository-file-5c1aeb.s3.amazonaws.com github-production-upload-manifest-file-7fdce7.s3.amazonaws.com github-production-user-asset-6210df.s3.amazonaws.com *.rel.tunnels.api.visualstudio.com wss://*.rel.tunnels.api.visualstudio.com objects-origin.githubusercontent.com copilot-proxy.githubusercontent.com proxy.individual.githubcopilot.com proxy.business.githubcopilot.com proxy.enterprise.githubcopilot.com *.actions.githubusercontent.com wss://*.actions.githubusercontent.com productionresultssa0.blob.core.windows.net/ productionresultssa1.blob.core.windows.net/ productionresultssa2.blob.core.windows.net/ productionresultssa3.blob.core.windows.net/ productionresultssa4.blob.core.windows.net/ productionresultssa5.blob.core.windows.net/ productionresultssa6.blob.core.windows.net/ productionresultssa7.blob.core.windows.net/ productionresultssa8.blob.core.windows.net/ productionresultssa9.blob.core.windows.net/ productionresultssa10.blob.core.windows.net/ productionresultssa11.blob.core.windows.net/ productionresultssa12.blob.core.windows.net/ productionresultssa13.blob.core.windows.net/ productionresultssa14.blob.core.windows.net/ productionresultssa15.blob.core.windows.net/ productionresultssa16.blob.core.windows.net/ productionresultssa17.blob.core.windows.net/ productionresultssa18.blob.core.windows.net/ productionresultssa19.blob.core.windows.net/ github-production-repository-image-32fea6.s3.amazonaws.com github-production-release-asset-2e65be.s3.amazonaws.com insights.github.com wss://alive.github.com api.githubcopilot.com api.individual.githubcopilot.com api.business.githubcopilot.com api.enterprise.githubcopilot.com; font-src github.githubassets.com; form-action 'self' github.com gist.github.com copilot-workspace.githubnext.com objects-origin.githubusercontent.com; frame-ancestors 'none'; frame-src viewscreen.githubusercontent.com notebooks.githubusercontent.com; img-src 'self' data: blob: github.githubassets.com media.githubusercontent.com camo.githubusercontent.com identicons.github.com avatars.githubusercontent.com private-avatars.githubusercontent.com github-cloud.s3.amazonaws.com objects.githubusercontent.com release-assets.githubusercontent.com secured-user-images.githubusercontent.com/ user-images.githubusercontent.com/ private-user-images.githubusercontent.com opengraph.githubassets.com copilotprodattachments.blob.core.windows.net/github-production-copilot-attachments/ github-production-user-asset-6210df.s3.amazonaws.com customer-stories-feed.github.com spotlights-feed.github.com objects-origin.githubusercontent.com *.githubusercontent.com; manifest-src 'self'; media-src github.com user-images.githubusercontent.com/ secured-user-images.githubusercontent.com/ private-user-images.githubusercontent.com github-production-user-asset-6210df.s3.amazonaws.com gist.github.com; script-src github.githubassets.com; style-src 'unsafe-inline' github.githubassets.com; upgrade-insecure-requests; worker-src github.githubassets.com github.com/assets-cdn/worker/ github.com/assets/ gist.github.com/assets-cdn/worker/
server: github.com
content-encoding: gzip
accept-ranges: bytes
set-cookie: _gh_sess=Onj2c0VQTp7UKoLxwVn4%2Bivrwg8IFMFoQGN8sRBFMmkhPEtih5SPSCBHmIKTjwDAkI%2F%2BBaPRm7DFQ3gPcihu2bJE%2BsSn62xjnCoHpGp%2FAM9AboNwObBVarLw08scX%2B3%2BWZ84KYT%2Be05nJUc7%2B1c4llHm1lAQwb6Ppsd4xqF1Tr9N0pGHB9yQPH%2FXGzG8OCRcYfktwOzFrHVZU7MKo9d0QTe1fjqJ4gRFvi9nQOjuutMuf82LoUqIzFl1uBOHyvVQV%2Fd5RUFk%2FHhTq5sphywUaQ%3D%3D--eYkbgJ2BEBwWK7XM--aC2H0OEjAKXf7I8bffpnhA%3D%3D; Path=/; HttpOnly; Secure; SameSite=Lax
set-cookie: _octo=GH1.1.1376058545.1752953293; Path=/; Domain=github.com; Expires=Sun, 19 Jul 2026 19:28:13 GMT; Secure; SameSite=Lax
set-cookie: logged_in=no; Path=/; Domain=github.com; Expires=Sun, 19 Jul 2026 19:28:13 GMT; HttpOnly; Secure; SameSite=Lax
x-github-request-id: 9A84:10A6BA:7CC6BD:9CD485:687BF1CD
Release v0.2.0 Β· ollama/ollama Β· GitHub
Loading
Skip to content
Navigation Menu
{{ message }}
-
Notifications
You must be signed in to change notification settings - Fork 12.4k
v0.2.0
Compare
·
1333 commits
to main
since this release
53da2c6
This commit was created on GitHub.com and signed with GitHubβs verified signature.
Concurrency
Ollama 0.2.0 is now available with concurrency support. This unlocks 2 specific features:
Parallel requests
Ollama can now serve multiple requests at the same time, using only a little bit of additional memory for each request. This enables use cases such as:
- Handling multiple chat sessions at the same time
- Hosting a code completion LLM for your internal team
- Processing different parts of a document simultaneously
- Running several agents at the same time.
demo.mov
Multiple models
Ollama now supports loading different models at the same time, dramatically improving:
- Retrieval Augmented Generation (RAG): both the embedding and text completion models can be loaded into memory simultaneously.
- Agents: multiple different agents can now run simultaneously
- Running large and small models side-by-side
Models are automatically loaded and unloaded based on requests and how much GPU memory is available.
To see which models are loaded, runΒ ollama ps
:
% ollama ps
NAME ID SIZE PROCESSOR UNTIL
gemma:2b 030ee63283b5 2.8 GB 100% GPU 4 minutes from now
all-minilm:latest 1b226e2802db 530 MB 100% GPU 4 minutes from now
llama3:latest 365c0bd3c000 6.7 GB 100% GPU 4 minutes from now
For more information on concurrency, see the FAQ
New models
- GLM-4: A strong multi-lingual general language model with competitive performance to Llama 3.
- CodeGeeX4: A versatile model for AI software development scenarios, including code completion.
- Gemma 2: Improved output quality and base text generation models now available
What's Changed
- Improved Gemma 2
- Fixed issue where model would generate invalid tokens after hitting context window
- Fixed inference output issues with
gemma2:27b
- Re-downloading the model may be required:
ollama pull gemma2
orollama pull gemma2:27b
- Ollama will now show a better error if a model architecture isn't supported
- Improved handling of quotes and spaces in Modelfile
FROM
lines - Ollama will now return an error if the system does not have enough memory to run a model on Linux
New Contributors
- @Muku784 made their first contribution in #5382
- @abitrolly made their first contribution in #4821
Full Changelog: v0.1.48...v0.2.0
Assets 10
129 people reacted
You canβt perform that action at this time.