CARVIEW |
Select Language
HTTP/2 200
date: Fri, 25 Jul 2025 02:49:10 GMT
content-type: text/html; charset=utf-8
vary: X-PJAX, X-PJAX-Container, Turbo-Visit, Turbo-Frame, X-Requested-With,Accept-Encoding, Accept, X-Requested-With
etag: W/"33b2d4681da94dccc162355eb53218fb"
cache-control: max-age=0, private, must-revalidate
strict-transport-security: max-age=31536000; includeSubdomains; preload
x-frame-options: deny
x-content-type-options: nosniff
x-xss-protection: 0
referrer-policy: no-referrer-when-downgrade
content-security-policy: default-src 'none'; base-uri 'self'; child-src github.githubassets.com github.com/assets-cdn/worker/ github.com/assets/ gist.github.com/assets-cdn/worker/; connect-src 'self' uploads.github.com www.githubstatus.com collector.github.com raw.githubusercontent.com api.github.com github-cloud.s3.amazonaws.com github-production-repository-file-5c1aeb.s3.amazonaws.com github-production-upload-manifest-file-7fdce7.s3.amazonaws.com github-production-user-asset-6210df.s3.amazonaws.com *.rel.tunnels.api.visualstudio.com wss://*.rel.tunnels.api.visualstudio.com objects-origin.githubusercontent.com copilot-proxy.githubusercontent.com proxy.individual.githubcopilot.com proxy.business.githubcopilot.com proxy.enterprise.githubcopilot.com *.actions.githubusercontent.com wss://*.actions.githubusercontent.com productionresultssa0.blob.core.windows.net/ productionresultssa1.blob.core.windows.net/ productionresultssa2.blob.core.windows.net/ productionresultssa3.blob.core.windows.net/ productionresultssa4.blob.core.windows.net/ productionresultssa5.blob.core.windows.net/ productionresultssa6.blob.core.windows.net/ productionresultssa7.blob.core.windows.net/ productionresultssa8.blob.core.windows.net/ productionresultssa9.blob.core.windows.net/ productionresultssa10.blob.core.windows.net/ productionresultssa11.blob.core.windows.net/ productionresultssa12.blob.core.windows.net/ productionresultssa13.blob.core.windows.net/ productionresultssa14.blob.core.windows.net/ productionresultssa15.blob.core.windows.net/ productionresultssa16.blob.core.windows.net/ productionresultssa17.blob.core.windows.net/ productionresultssa18.blob.core.windows.net/ productionresultssa19.blob.core.windows.net/ github-production-repository-image-32fea6.s3.amazonaws.com github-production-release-asset-2e65be.s3.amazonaws.com insights.github.com wss://alive.github.com api.githubcopilot.com api.individual.githubcopilot.com api.business.githubcopilot.com api.enterprise.githubcopilot.com; font-src github.githubassets.com; form-action 'self' github.com gist.github.com copilot-workspace.githubnext.com objects-origin.githubusercontent.com; frame-ancestors 'none'; frame-src viewscreen.githubusercontent.com notebooks.githubusercontent.com; img-src 'self' data: blob: github.githubassets.com media.githubusercontent.com camo.githubusercontent.com identicons.github.com avatars.githubusercontent.com private-avatars.githubusercontent.com github-cloud.s3.amazonaws.com objects.githubusercontent.com release-assets.githubusercontent.com secured-user-images.githubusercontent.com/ user-images.githubusercontent.com/ private-user-images.githubusercontent.com opengraph.githubassets.com copilotprodattachments.blob.core.windows.net/github-production-copilot-attachments/ github-production-user-asset-6210df.s3.amazonaws.com customer-stories-feed.github.com spotlights-feed.github.com objects-origin.githubusercontent.com *.githubusercontent.com; manifest-src 'self'; media-src github.com user-images.githubusercontent.com/ secured-user-images.githubusercontent.com/ private-user-images.githubusercontent.com github-production-user-asset-6210df.s3.amazonaws.com gist.github.com; script-src github.githubassets.com; style-src 'unsafe-inline' github.githubassets.com; upgrade-insecure-requests; worker-src github.githubassets.com github.com/assets-cdn/worker/ github.com/assets/ gist.github.com/assets-cdn/worker/
server: github.com
content-encoding: gzip
accept-ranges: bytes
set-cookie: _gh_sess=mYqg2AymT5eHGyqmnWy5RTnl1kApT%2Btv2%2Bdo99evYv5H7A%2BJ5Fpfjr1PIEP9vyvtMIvLyzb364MxmFtakKty%2BkgBdk3gkj5IslHwmbrQH%2BOhWY3EFQ3n5kEIL2dYYTTTX9j4Px%2FQAlA%2FvcC4xjffe1LZRZgJAYZOrxLsUIyhoGItSSiuD6AGqcIYjHAyX36XwJj9riWZaOgS7QlbTg85JFGass7YXnQJ4qjw4BCavyCCCvdEObMszwP1E70Kjw%2FK5%2BA9RIqOAKsTcEKef6wxhA%3D%3D--TTl5MMW3Jv1Rc7fs--UjaLOjti0sFd9y1guopqrw%3D%3D; Path=/; HttpOnly; Secure; SameSite=Lax
set-cookie: _octo=GH1.1.2066905473.1753411749; Path=/; Domain=github.com; Expires=Sat, 25 Jul 2026 02:49:09 GMT; Secure; SameSite=Lax
set-cookie: logged_in=no; Path=/; Domain=github.com; Expires=Sat, 25 Jul 2026 02:49:09 GMT; HttpOnly; Secure; SameSite=Lax
x-github-request-id: E16E:69B36:DEFD5:14D7E8:6882F0A5
Release first official release · vectorch-ai/ScaleLLM · GitHub
Loading
Skip to content
Navigation Menu
{{ message }}
-
Notifications
You must be signed in to change notification settings - Fork 37
first official release
Compare
ScaleLLM is a high-performance inference system for large language models, designed for production environments. It supports most popular open-source models, including Llama2, Bloom, GPT-NeoX, and more.
- High Performance: ScaleLLM is optimized for high-performance LLM inference.
- Tensor Parallelism: Utilizes tensor parallelism for efficient model execution.
- OpenAI-compatible API Efficient golang rest api server that compatible with OpenAI.
- Huggingface models Integration Seamless integration with most popular HF models.
- Customizable: Offers flexibility for customization to meet your specific needs.
- Production Ready: Designed to be deployed in production environments.
Supported Models
Models | Tensor Parallel | Quantization | HF models examples |
---|---|---|---|
Llama2 | Yes | Yes | meta-llama/Llama-2-7b, TheBloke/Llama-2-13B-chat-GPTQ, TheBloke/Llama-2-70B-AWQ |
Aquila | Yes | Yes | BAAI/Aquila-7B, BAAI/AquilaChat-7B |
Bloom | Yes | Yes | bigscience/bloom |
GPT_j | Yes | Yes | EleutherAI/gpt-j-6b |
GPT_NeoX | Yes | -- | EleutherAI/gpt-neox-20b |
GPT2 | Yes | -- | gpt2 |
InternLM | Yes | Yes | internlm/internlm-7b |
Mistral | Yes | Yes | mistralai/Mistral-7B-v0.1 |
MPT | Yes | Yes | mosaicml/mpt-30b |
Assets 2
5 people reacted
You can’t perform that action at this time.