CARVIEW |
Select Language
HTTP/2 200
date: Sat, 19 Jul 2025 18:40:03 GMT
content-type: text/html; charset=utf-8
vary: X-PJAX, X-PJAX-Container, Turbo-Visit, Turbo-Frame, X-Requested-With,Accept-Encoding, Accept, X-Requested-With
etag: W/"f3298ca4270ca7031800247830cab9cd"
cache-control: max-age=0, private, must-revalidate
strict-transport-security: max-age=31536000; includeSubdomains; preload
x-frame-options: deny
x-content-type-options: nosniff
x-xss-protection: 0
referrer-policy: no-referrer-when-downgrade
content-security-policy: default-src 'none'; base-uri 'self'; child-src github.githubassets.com github.com/assets-cdn/worker/ github.com/assets/ gist.github.com/assets-cdn/worker/; connect-src 'self' uploads.github.com www.githubstatus.com collector.github.com raw.githubusercontent.com api.github.com github-cloud.s3.amazonaws.com github-production-repository-file-5c1aeb.s3.amazonaws.com github-production-upload-manifest-file-7fdce7.s3.amazonaws.com github-production-user-asset-6210df.s3.amazonaws.com *.rel.tunnels.api.visualstudio.com wss://*.rel.tunnels.api.visualstudio.com objects-origin.githubusercontent.com copilot-proxy.githubusercontent.com proxy.individual.githubcopilot.com proxy.business.githubcopilot.com proxy.enterprise.githubcopilot.com *.actions.githubusercontent.com wss://*.actions.githubusercontent.com productionresultssa0.blob.core.windows.net/ productionresultssa1.blob.core.windows.net/ productionresultssa2.blob.core.windows.net/ productionresultssa3.blob.core.windows.net/ productionresultssa4.blob.core.windows.net/ productionresultssa5.blob.core.windows.net/ productionresultssa6.blob.core.windows.net/ productionresultssa7.blob.core.windows.net/ productionresultssa8.blob.core.windows.net/ productionresultssa9.blob.core.windows.net/ productionresultssa10.blob.core.windows.net/ productionresultssa11.blob.core.windows.net/ productionresultssa12.blob.core.windows.net/ productionresultssa13.blob.core.windows.net/ productionresultssa14.blob.core.windows.net/ productionresultssa15.blob.core.windows.net/ productionresultssa16.blob.core.windows.net/ productionresultssa17.blob.core.windows.net/ productionresultssa18.blob.core.windows.net/ productionresultssa19.blob.core.windows.net/ github-production-repository-image-32fea6.s3.amazonaws.com github-production-release-asset-2e65be.s3.amazonaws.com insights.github.com wss://alive.github.com api.githubcopilot.com api.individual.githubcopilot.com api.business.githubcopilot.com api.enterprise.githubcopilot.com; font-src github.githubassets.com; form-action 'self' github.com gist.github.com copilot-workspace.githubnext.com objects-origin.githubusercontent.com; frame-ancestors 'none'; frame-src viewscreen.githubusercontent.com notebooks.githubusercontent.com; img-src 'self' data: blob: github.githubassets.com media.githubusercontent.com camo.githubusercontent.com identicons.github.com avatars.githubusercontent.com private-avatars.githubusercontent.com github-cloud.s3.amazonaws.com objects.githubusercontent.com release-assets.githubusercontent.com secured-user-images.githubusercontent.com/ user-images.githubusercontent.com/ private-user-images.githubusercontent.com opengraph.githubassets.com copilotprodattachments.blob.core.windows.net/github-production-copilot-attachments/ github-production-user-asset-6210df.s3.amazonaws.com customer-stories-feed.github.com spotlights-feed.github.com objects-origin.githubusercontent.com *.githubusercontent.com; manifest-src 'self'; media-src github.com user-images.githubusercontent.com/ secured-user-images.githubusercontent.com/ private-user-images.githubusercontent.com github-production-user-asset-6210df.s3.amazonaws.com gist.github.com; script-src github.githubassets.com; style-src 'unsafe-inline' github.githubassets.com; upgrade-insecure-requests; worker-src github.githubassets.com github.com/assets-cdn/worker/ github.com/assets/ gist.github.com/assets-cdn/worker/
server: github.com
content-encoding: gzip
accept-ranges: bytes
set-cookie: _gh_sess=XLbXgM%2FOSWAt8YM%2BekGU8qil15%2FhNPrq1O8TXSjaQVHTs4rGL8dg939JMfNwsjkZqj5Vq7Il3dreOgW3kzVdnZcqqDh0KrzlmFZCTkU7DuVInfx9%2FcJjVHCpBy8BldRj7Ok7cjUVcV1hung4FWiCsP6hE4IpzbWCnHW7fFTjLy%2BG7G6WI2hjIPk99On0OZY7eu%2FQhILzy%2BoE%2BDi7KmwiGhvefBRWdGB0uztlSZ4NUy2J5BthL85Tv4PP9vOLSfEPNTrX03sJ8yCsnOVKQ4STVg%3D%3D--ey71hR01z%2B4dOqx8--BdLUD6HofCa%2BqtQngJMlCQ%3D%3D; Path=/; HttpOnly; Secure; SameSite=Lax
set-cookie: _octo=GH1.1.1828959390.1752950402; Path=/; Domain=github.com; Expires=Sun, 19 Jul 2026 18:40:02 GMT; Secure; SameSite=Lax
set-cookie: logged_in=no; Path=/; Domain=github.com; Expires=Sun, 19 Jul 2026 18:40:02 GMT; HttpOnly; Secure; SameSite=Lax
x-github-request-id: BD78:E3E22:A3EC9A:D3D543:687BE682
2.2.4 Backend: TabbyAPI · av/harbor Wiki · GitHub
Skip to content
Navigation Menu
{{ message }}
-
-
Notifications
You must be signed in to change notification settings - Fork 128
2.2.4 Backend: TabbyAPI
av edited this page Apr 26, 2025
·
5 revisions
Handle:
tabbyapi
URL: https://localhost:33931
An OAI compatible exllamav2 API that's both lightweight and fast
- Supports same set of models as exllamav2
Harbor integrates with the HuggingFaceDownloader CLI which can be used to download models for the TabbyAPI service.
# [Optional] lookup models on the HF Hub
harbor hf find exl2
# [Optional] If pulling from the closed or gated repo
# Pre-configure the HF access token
harbor hf token <your-token>
# 1. Download the desired model, use "user/repo" specifier
# Note the "./hf" directory set as the download location - this is
# where the HuggingFace cache is mounted for downloader CLI
harbor hf dl -m Annuvin/gemma-2-2b-it-abliterated-4.0bpw-exl2 -s ./hf
harbor hf dl -m bartowski/Phi-3.1-mini-4k-instruct-exl2 -s ./hf -b 8_0
# 2. Set the model to run
# Use the same specifier as for the downloader
harbor tabbyapi model Annuvin/gemma-2-2b-it-abliterated-4.0bpw-exl2
harbor tabbyapi model bartowski/Phi-3.1-mini-4k-instruct-exl2
# 3. Start the service
harbor up tabbyapi
# Download with a model specifier
harbor hf download ChenMnZ/Mistral-Large-Instruct-2407-EfficientQAT-w2g64-GPTQ
# With a specific revision
harbor hf download turboderp/Llama-3.1-8B-Instruct-exl2 --revision 6.0bpw
# Grab actual name for the folder
harbor find ChenMnZ
# Set the model to run
harbor config set tabbyapi.model.specifier /hub/models--ChenMnZ--Mistral-Large-Instruct-2407-EfficientQAT-w2g64-GPTQ/snapshots/f46105941fa36d2663f77f11840c2f49a69d6681/
TabbyAPI exposes an OpenAI-compatible API and can be used with related services directly.
# [Optional] Pull the tabbyapi images
harbor pull tabbyapi
# Start the service
harbor up tabbyapi
# [Optional] Set additional arguments
harbor tabbyapi args --log-prompt true
# See TabbyAPI docs
harbor tabbyapi docs
Harbor will mount a few volumes for the TabbyAPI container:
- Host HuggingFace cache -
/models/hf
-
llama.cpp cache -
/models/llama.cpp
Clone this wiki locally
You can’t perform that action at this time.