HTTP/2 200
date: Tue, 30 Dec 2025 06:38:46 GMT
content-type: text/html; charset=utf-8
vary: X-PJAX, X-PJAX-Container, Turbo-Visit, Turbo-Frame, X-Requested-With,Accept-Encoding, Accept, X-Requested-With
etag: W/"0661c75b0dd8fa6d95c9eb16299e3217"
cache-control: max-age=0, private, must-revalidate
strict-transport-security: max-age=31536000; includeSubdomains; preload
x-frame-options: deny
x-content-type-options: nosniff
x-xss-protection: 0
referrer-policy: origin-when-cross-origin, strict-origin-when-cross-origin
content-security-policy: default-src 'none'; base-uri 'self'; child-src github.githubassets.com github.com/assets-cdn/worker/ github.com/assets/ gist.github.com/assets-cdn/worker/; connect-src 'self' uploads.github.com www.githubstatus.com collector.github.com raw.githubusercontent.com api.github.com github-cloud.s3.amazonaws.com github-production-repository-file-5c1aeb.s3.amazonaws.com github-production-upload-manifest-file-7fdce7.s3.amazonaws.com github-production-user-asset-6210df.s3.amazonaws.com *.rel.tunnels.api.visualstudio.com wss://*.rel.tunnels.api.visualstudio.com github.githubassets.com objects-origin.githubusercontent.com copilot-proxy.githubusercontent.com proxy.individual.githubcopilot.com proxy.business.githubcopilot.com proxy.enterprise.githubcopilot.com *.actions.githubusercontent.com wss://*.actions.githubusercontent.com productionresultssa0.blob.core.windows.net/ productionresultssa1.blob.core.windows.net/ productionresultssa2.blob.core.windows.net/ productionresultssa3.blob.core.windows.net/ productionresultssa4.blob.core.windows.net/ productionresultssa5.blob.core.windows.net/ productionresultssa6.blob.core.windows.net/ productionresultssa7.blob.core.windows.net/ productionresultssa8.blob.core.windows.net/ productionresultssa9.blob.core.windows.net/ productionresultssa10.blob.core.windows.net/ productionresultssa11.blob.core.windows.net/ productionresultssa12.blob.core.windows.net/ productionresultssa13.blob.core.windows.net/ productionresultssa14.blob.core.windows.net/ productionresultssa15.blob.core.windows.net/ productionresultssa16.blob.core.windows.net/ productionresultssa17.blob.core.windows.net/ productionresultssa18.blob.core.windows.net/ productionresultssa19.blob.core.windows.net/ github-production-repository-image-32fea6.s3.amazonaws.com github-production-release-asset-2e65be.s3.amazonaws.com insights.github.com wss://alive.github.com wss://alive-staging.github.com api.githubcopilot.com api.individual.githubcopilot.com api.business.githubcopilot.com api.enterprise.githubcopilot.com; font-src github.githubassets.com; form-action 'self' github.com gist.github.com copilot-workspace.githubnext.com objects-origin.githubusercontent.com; frame-ancestors 'none'; frame-src viewscreen.githubusercontent.com notebooks.githubusercontent.com; img-src 'self' data: blob: github.githubassets.com media.githubusercontent.com camo.githubusercontent.com identicons.github.com avatars.githubusercontent.com private-avatars.githubusercontent.com github-cloud.s3.amazonaws.com objects.githubusercontent.com release-assets.githubusercontent.com secured-user-images.githubusercontent.com/ user-images.githubusercontent.com/ private-user-images.githubusercontent.com opengraph.githubassets.com marketplace-screenshots.githubusercontent.com/ copilotprodattachments.blob.core.windows.net/github-production-copilot-attachments/ github-production-user-asset-6210df.s3.amazonaws.com customer-stories-feed.github.com spotlights-feed.github.com objects-origin.githubusercontent.com *.githubusercontent.com; manifest-src 'self'; media-src github.com user-images.githubusercontent.com/ secured-user-images.githubusercontent.com/ private-user-images.githubusercontent.com github-production-user-asset-6210df.s3.amazonaws.com gist.github.com github.githubassets.com; script-src github.githubassets.com; style-src 'unsafe-inline' github.githubassets.com; upgrade-insecure-requests; worker-src github.githubassets.com github.com/assets-cdn/worker/ github.com/assets/ gist.github.com/assets-cdn/worker/
server: github.com
content-encoding: gzip
accept-ranges: bytes
set-cookie: _gh_sess=%2B9mQo%2Bh%2Bu6rwLGWdIX4O%2FrTs2QtQRLL6nAomnNRFNJYxEBu3%2Ft7CA4saGGOjpQZDmmdUIjjVXOIb4bf8I2TJg6vjh%2BmnD%2F4%2Fj09wm2LEdjyn2j2HxCfOZLUYhQPNswdmSnbwPgBoEQoErOQk95pkr0Yy%2BqHAfdhdrW6pKk0rsSzCMo4Os1uFLhRh6mx0n49%2ByfQor0dmUP5zQST5m395j%2BKZmLPKqcxlKsBE8prjiHomBeKEaPVvk9JiCY9LesMiFVCAHA%2FzbmqdA%2FQmr0xxyw%3D%3D--LCRvWDeZgb0QSaJ0--ptU4ElWzpoH98iJ1tT63ow%3D%3D; Path=/; HttpOnly; Secure; SameSite=Lax
set-cookie: _octo=GH1.1.146182111.1767076725; Path=/; Domain=github.com; Expires=Wed, 30 Dec 2026 06:38:45 GMT; Secure; SameSite=Lax
set-cookie: logged_in=no; Path=/; Domain=github.com; Expires=Wed, 30 Dec 2026 06:38:45 GMT; HttpOnly; Secure; SameSite=Lax
x-github-request-id: DA5A:1D06DD:296C82:2DAB01:69537375
xlite-dev · GitHub
xlite-dev
Develop ML/AI toolkits and ML/AI/CUDA Learning resources.
Pinned
Loading
📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉
Cuda
9.1k
898
🛠A lite C++ AI toolkit: 100+ models with MNN, ORT and TRT, including Det, Seg, Stable-Diffusion, Face-Fusion, etc.🎉
C++
4.3k
769
📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉
Python
4.9k
330
📚A curated list of Awesome Diffusion Inference Papers with Codes: Sampling, Cache, Quantization, Parallelism, etc.🎉
Python
480
24
💎An easy-to-use PyTorch library for face landmarks detection: training, evaluation, inference, and 100+ data augmentations.🎉
Python
268
27
🤖FFPA: Extend FlashAttention-2 with Split-D, ~O(1) SRAM complexity for large headdim, 1.8x~3x↑🎉 vs SDPA EA.
Cuda
242
12
Repositories
Showing 10 of 52 repositories
Qwen-Image
Public
Forked from
QwenLM/Qwen-Image
Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.
xlite-dev/Qwen-Image’s past year of commit activity
Python
1
Apache-2.0
374
0
0
Updated Dec 25, 2025
xlite-dev/Z-Image’s past year of commit activity
Python
1
Apache-2.0
479
0
0
Updated Dec 25, 2025
diffusers
Public
Forked from
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
xlite-dev/diffusers’s past year of commit activity
Python
0
Apache-2.0
6,713
0
0
Updated Dec 24, 2025
LeetCUDA
Public
📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉
xlite-dev/LeetCUDA’s past year of commit activity
lite.ai.toolkit
Public
🛠A lite C++ AI toolkit: 100+ models with MNN, ORT and TRT, including Det, Seg, Stable-Diffusion, Face-Fusion, etc.🎉
xlite-dev/lite.ai.toolkit’s past year of commit activity
sglang
Public
Forked from
sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
xlite-dev/sglang’s past year of commit activity
Python
0
Apache-2.0
3,917
0
0
Updated Dec 12, 2025
xlite-dev/vllm-omni’s past year of commit activity
Python
0
Apache-2.0
234
0
0
Updated Dec 11, 2025
SageAttention
Public
Forked from
thu-ml/SageAttention
Quantized Attention that achieves speedups of 2.1-3.1x and 2.7-5.1x compared to FlashAttention2 and xformers, respectively, without lossing end-to-end metrics across various models.
xlite-dev/SageAttention’s past year of commit activity
Cuda
0
Apache-2.0
301
0
0
Updated Dec 3, 2025
Awesome-LLM-Inference
Public
📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉
xlite-dev/Awesome-LLM-Inference’s past year of commit activity
Awesome-DiT-Inference
Public
📚A curated list of Awesome Diffusion Inference Papers with Codes: Sampling, Cache, Quantization, Parallelism, etc.🎉
xlite-dev/Awesome-DiT-Inference’s past year of commit activity
Python
480
GPL-3.0
24
0
0
Updated Nov 28, 2025
Most used topics
Loading…
You can’t perform that action at this time.