CARVIEW |
Select Language
HTTP/2 200
date: Sat, 26 Jul 2025 19:24:34 GMT
content-type: text/html; charset=utf-8
vary: X-PJAX, X-PJAX-Container, Turbo-Visit, Turbo-Frame, X-Requested-With,Accept-Encoding, Accept, X-Requested-With
x-robots-tag: none
etag: W/"439d35f822e6e5f0bf4a175211cbad19"
cache-control: max-age=0, private, must-revalidate
strict-transport-security: max-age=31536000; includeSubdomains; preload
x-frame-options: deny
x-content-type-options: nosniff
x-xss-protection: 0
referrer-policy: no-referrer-when-downgrade
content-security-policy: default-src 'none'; base-uri 'self'; child-src github.githubassets.com github.com/assets-cdn/worker/ github.com/assets/ gist.github.com/assets-cdn/worker/; connect-src 'self' uploads.github.com www.githubstatus.com collector.github.com raw.githubusercontent.com api.github.com github-cloud.s3.amazonaws.com github-production-repository-file-5c1aeb.s3.amazonaws.com github-production-upload-manifest-file-7fdce7.s3.amazonaws.com github-production-user-asset-6210df.s3.amazonaws.com *.rel.tunnels.api.visualstudio.com wss://*.rel.tunnels.api.visualstudio.com objects-origin.githubusercontent.com copilot-proxy.githubusercontent.com proxy.individual.githubcopilot.com proxy.business.githubcopilot.com proxy.enterprise.githubcopilot.com *.actions.githubusercontent.com wss://*.actions.githubusercontent.com productionresultssa0.blob.core.windows.net/ productionresultssa1.blob.core.windows.net/ productionresultssa2.blob.core.windows.net/ productionresultssa3.blob.core.windows.net/ productionresultssa4.blob.core.windows.net/ productionresultssa5.blob.core.windows.net/ productionresultssa6.blob.core.windows.net/ productionresultssa7.blob.core.windows.net/ productionresultssa8.blob.core.windows.net/ productionresultssa9.blob.core.windows.net/ productionresultssa10.blob.core.windows.net/ productionresultssa11.blob.core.windows.net/ productionresultssa12.blob.core.windows.net/ productionresultssa13.blob.core.windows.net/ productionresultssa14.blob.core.windows.net/ productionresultssa15.blob.core.windows.net/ productionresultssa16.blob.core.windows.net/ productionresultssa17.blob.core.windows.net/ productionresultssa18.blob.core.windows.net/ productionresultssa19.blob.core.windows.net/ github-production-repository-image-32fea6.s3.amazonaws.com github-production-release-asset-2e65be.s3.amazonaws.com insights.github.com wss://alive.github.com api.githubcopilot.com api.individual.githubcopilot.com api.business.githubcopilot.com api.enterprise.githubcopilot.com; font-src github.githubassets.com; form-action 'self' github.com gist.github.com copilot-workspace.githubnext.com objects-origin.githubusercontent.com; frame-ancestors 'none'; frame-src viewscreen.githubusercontent.com notebooks.githubusercontent.com; img-src 'self' data: blob: github.githubassets.com media.githubusercontent.com camo.githubusercontent.com identicons.github.com avatars.githubusercontent.com private-avatars.githubusercontent.com github-cloud.s3.amazonaws.com objects.githubusercontent.com release-assets.githubusercontent.com secured-user-images.githubusercontent.com/ user-images.githubusercontent.com/ private-user-images.githubusercontent.com opengraph.githubassets.com copilotprodattachments.blob.core.windows.net/github-production-copilot-attachments/ github-production-user-asset-6210df.s3.amazonaws.com customer-stories-feed.github.com spotlights-feed.github.com objects-origin.githubusercontent.com *.githubusercontent.com; manifest-src 'self'; media-src github.com user-images.githubusercontent.com/ secured-user-images.githubusercontent.com/ private-user-images.githubusercontent.com github-production-user-asset-6210df.s3.amazonaws.com gist.github.com; script-src github.githubassets.com; style-src 'unsafe-inline' github.githubassets.com; upgrade-insecure-requests; worker-src github.githubassets.com github.com/assets-cdn/worker/ github.com/assets/ gist.github.com/assets-cdn/worker/
server: github.com
content-encoding: gzip
accept-ranges: bytes
set-cookie: _gh_sess=MOZJKX%2BGXITmFOOerDj8B4Ycwp1r6FLf%2F1NC2ALmc86f2zlzuwUPTU7zoKRJYLukeqgRU6j9qwMy%2B2Fy6naedMDpb88BkuglvBngB2kFxY71Dzs2INcNlkPLHxD%2FzOTNuc5l9zgvihb8xQiL%2B762Ug%2Fm%2Bc2oj%2BLiipecwXmJe6q3i8eS412eFi%2FPg%2BjNUYIA9OGxtiEQ3vjjcwg3F%2BqeTaryvD9fUKR9xA9%2F30nos7EfiMeaaCjMs0k%2F8rctI1d12ZZdyqhih9Ps%2F%2BlopGsQmg%3D%3D--36z%2FzYnV%2F%2FMuo1sa--8NBS3Me1MqAO0rZwhHeHaA%3D%3D; Path=/; HttpOnly; Secure; SameSite=Lax
set-cookie: _octo=GH1.1.1068912207.1753557874; Path=/; Domain=github.com; Expires=Sun, 26 Jul 2026 19:24:34 GMT; Secure; SameSite=Lax
set-cookie: logged_in=no; Path=/; Domain=github.com; Expires=Sun, 26 Jul 2026 19:24:34 GMT; HttpOnly; Secure; SameSite=Lax
x-github-request-id: 8872:1AFF47:866C2C:ADB6E8:68852B72
How to build · intel/xFasterTransformer Wiki · GitHub
Skip to content
Navigation Menu
{{ message }}
-
Notifications
You must be signed in to change notification settings - Fork 73
How to build
Changqing Li edited this page Nov 24, 2023
·
3 revisions
Need to modify CMakeLists.txt:
# Enable AVX512_FP16 optimization
add_definitions(-DAVX512_FP32_WEIGHT_ONLY_FP16=true)
# add_definitions(-DAVX512_FP16_WEIGHT_ONLY_FP16=true)
# add_definitions(-DAVX512_BF16_WEIGHT_ONLY_BF16=true)
add_definitions(-DAVX512_FP32_WEIGHT_ONLY_INT8=true)
# add_definitions(-DAVX512_FP16_WEIGHT_ONLY_INT8=true)
# add_definitions(-DDEBUG=true)
# add_definitions(-DSTEP_BY_STEP_ATTN=true)
# add_definitions(-DUSE_MKLML=true)
# add_definitions(-DTIMELINE=true)
# add_definitions(-DUSE_SHM=true)
$ mkdir build && cd build
$ cmake -DBUILD_WITH_SHARED_LIBS=ON ..
$ make -j
※ 每次编译项目,需要采用如下,确保编译正确:make clean && cmake .. && make -j
(1) Build UT with option -DXFT_BUILD_TESTS=ON, like cmake .. -DXFT_BUILD_TESTS=ON.
(2) Build with Debug option -DCMAKE_BUILD_TYPE=Debug, like cmake .. -DCMAKE_BUILD_TYPE=Debug. This will use -O0 instead of -O2 and open -g.
- Prepare env
- torch
- python
- Build xfastertransformer.so
# cd <root_directory>
mkdir build && cd build
cmake ..
- Create whl package
# cd <root_directory>
python3 setup.py bdist_wheel --verbose
# add tag
python setup.py egg_info --tag-build="avx512+fp32" bdist_wheel --verbose
python -c "import xfastertransformer as xft; xft.LlamaConvert().converter("/data/llama-2-7b-chat","/data/llama-2-7b-chat-xft", "fp16")"
Clone this wiki locally
You can’t perform that action at this time.