CARVIEW |
Select Language
HTTP/2 200
date: Sat, 19 Jul 2025 14:34:48 GMT
content-type: text/html; charset=utf-8
vary: X-PJAX, X-PJAX-Container, Turbo-Visit, Turbo-Frame, X-Requested-With,Accept-Encoding, Accept, X-Requested-With
etag: W/"dec0d001f0a7699339b9bf637293cfcc"
cache-control: max-age=0, private, must-revalidate
strict-transport-security: max-age=31536000; includeSubdomains; preload
x-frame-options: deny
x-content-type-options: nosniff
x-xss-protection: 0
referrer-policy: no-referrer-when-downgrade
content-security-policy: default-src 'none'; base-uri 'self'; child-src github.githubassets.com github.com/assets-cdn/worker/ github.com/assets/ gist.github.com/assets-cdn/worker/; connect-src 'self' uploads.github.com www.githubstatus.com collector.github.com raw.githubusercontent.com api.github.com github-cloud.s3.amazonaws.com github-production-repository-file-5c1aeb.s3.amazonaws.com github-production-upload-manifest-file-7fdce7.s3.amazonaws.com github-production-user-asset-6210df.s3.amazonaws.com *.rel.tunnels.api.visualstudio.com wss://*.rel.tunnels.api.visualstudio.com objects-origin.githubusercontent.com copilot-proxy.githubusercontent.com proxy.individual.githubcopilot.com proxy.business.githubcopilot.com proxy.enterprise.githubcopilot.com *.actions.githubusercontent.com wss://*.actions.githubusercontent.com productionresultssa0.blob.core.windows.net/ productionresultssa1.blob.core.windows.net/ productionresultssa2.blob.core.windows.net/ productionresultssa3.blob.core.windows.net/ productionresultssa4.blob.core.windows.net/ productionresultssa5.blob.core.windows.net/ productionresultssa6.blob.core.windows.net/ productionresultssa7.blob.core.windows.net/ productionresultssa8.blob.core.windows.net/ productionresultssa9.blob.core.windows.net/ productionresultssa10.blob.core.windows.net/ productionresultssa11.blob.core.windows.net/ productionresultssa12.blob.core.windows.net/ productionresultssa13.blob.core.windows.net/ productionresultssa14.blob.core.windows.net/ productionresultssa15.blob.core.windows.net/ productionresultssa16.blob.core.windows.net/ productionresultssa17.blob.core.windows.net/ productionresultssa18.blob.core.windows.net/ productionresultssa19.blob.core.windows.net/ github-production-repository-image-32fea6.s3.amazonaws.com github-production-release-asset-2e65be.s3.amazonaws.com insights.github.com wss://alive.github.com api.githubcopilot.com api.individual.githubcopilot.com api.business.githubcopilot.com api.enterprise.githubcopilot.com; font-src github.githubassets.com; form-action 'self' github.com gist.github.com copilot-workspace.githubnext.com objects-origin.githubusercontent.com; frame-ancestors 'none'; frame-src viewscreen.githubusercontent.com notebooks.githubusercontent.com; img-src 'self' data: blob: github.githubassets.com media.githubusercontent.com camo.githubusercontent.com identicons.github.com avatars.githubusercontent.com private-avatars.githubusercontent.com github-cloud.s3.amazonaws.com objects.githubusercontent.com release-assets.githubusercontent.com secured-user-images.githubusercontent.com/ user-images.githubusercontent.com/ private-user-images.githubusercontent.com opengraph.githubassets.com copilotprodattachments.blob.core.windows.net/github-production-copilot-attachments/ github-production-user-asset-6210df.s3.amazonaws.com customer-stories-feed.github.com spotlights-feed.github.com objects-origin.githubusercontent.com *.githubusercontent.com; manifest-src 'self'; media-src github.com user-images.githubusercontent.com/ secured-user-images.githubusercontent.com/ private-user-images.githubusercontent.com github-production-user-asset-6210df.s3.amazonaws.com gist.github.com; script-src github.githubassets.com; style-src 'unsafe-inline' github.githubassets.com; upgrade-insecure-requests; worker-src github.githubassets.com github.com/assets-cdn/worker/ github.com/assets/ gist.github.com/assets-cdn/worker/
server: github.com
content-encoding: gzip
accept-ranges: bytes
set-cookie: _gh_sess=mgikz6pSvXMFsivTPMV7Udy%2BAez5xbhCKNK5RT6aMqFiGf2jK9MUVCu42wDTgXqzYw9RUFBBA6Z6plmlVXyK199XJWLGgDHTvZRXoG5TGJx8C5b6njOZC2OldplrVGR5LeI7p6cZsrp%2B3IIEbJPjfpaOXZ5raYprfk6N54yNQGFnHLk%2FFkTqNIV2M%2Fz%2B4yVoYaeeBhSx1IGsi3R%2Bwdl%2Fa%2B52d7FnOlvCrVsUIM40M6kiSscU3HUgv%2FeK741umi0%2BPSAc6GpQDg2VIcWZKg7jBw%3D%3D--pdozIbwWkHvlmm3L--sBUKqAAoWNEm%2BtfS2CyLEA%3D%3D; Path=/; HttpOnly; Secure; SameSite=Lax
set-cookie: _octo=GH1.1.143259751.1752935688; Path=/; Domain=github.com; Expires=Sun, 19 Jul 2026 14:34:48 GMT; Secure; SameSite=Lax
set-cookie: logged_in=no; Path=/; Domain=github.com; Expires=Sun, 19 Jul 2026 14:34:48 GMT; HttpOnly; Secure; SameSite=Lax
x-github-request-id: B070:249704:5A0783:70D459:687BAD08
Releases · AI-Hypercomputer/xpk · GitHub
18 Jul 13:58
Loading
13 Jun 09:08
Loading
14 Apr 17:33
gcie
Grzegorz Ciesielski
Loading
27 Mar 20:05
Loading
25 Mar 14:08
Loading
18 Mar 21:16
Loading
28 Jan 10:31
Loading
12 Apr 22:08
Loading
Skip to content
Navigation Menu
{{ message }}
-
Notifications
You must be signed in to change notification settings - Fork 43
Releases: AI-Hypercomputer/xpk
Releases · AI-Hypercomputer/xpk
v0.10.0
Compare
Highlights
DWS Flex support for GPUs and TPUs
Managed Lustre storage attach support
What's Changed
New Features
- Update PathwaysJob version to v0.1.2 by @RoshaniN in #507
- Update Cluster Toolkit Version by @pawloch00 in #503
- Managed Lustre storage attach support implemented by @sharabiani in #534
- Implement DWS for GPUs and TPUs by @pawloch00 in #467
Bug fixes
- Fix issue in control_plane_endpoints_config.dns_endpoint_config.allow… by @SujeethJinesh in #499
- Fix broken A3 High workloads by @gcie in #494
- Bring back shared_memory volume for A3 Mega and A3 High by @gcie in #512
- Provided the required permissions for JAX to list the pods by @sharabiani in #509
- fix the incorrect number of chips per VM for v5litepod-8 by @gcie in #513
- Update Kueue and Jobset controller default limit value by @ycchenzheng in #502
- Fix cluster creation from reservation by @pawloch00 in #522
New Contributors
- @ycchenzheng made their first contribution in #502
Full Changelog: v0.9.0...v0.10.0
Assets 2
v0.9.0
ad39147
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Compare
Highlights
GPUDirect-TCPX support for H100 accelerator (A3-High VMs)
A command to adapt a cluster to XPK expected config (xpk cluster adapt
)
DWS Calendar Mode Reservations
What's Changed
New Features
- GPUDirect-TCPX support for H100 by @gcie in #459
- Add Multi-tier checkpointing support in XPK Cluster Creation by @abhinavclemson in #465
- Add Jobset controller patching for MTC cluster by @abhinavclemson in #475
xpk cluster adapt
by @gcie in #466
Bug fixes
- Merge main to develop by @gcie in #458
- Update pathways.py with worker component type. by @RoshaniN in #456
- Fix error when
xpk storage attach --type=gcpfilestore
without--mount-options
by @gcie in #463 - Update README.md - text edit in Advanced usage section by @kzmyslona in #473
- Update PathwaysJob Version to v0.1.1 To Fix RM OOM by @SujeethJinesh in #477
- Placement Policy removed from A3-Mega blueprints with --spot by @sharabiani in #478
- Enable DNS Access to Prevent Connection Timeout Errors by @SujeethJinesh in #483
- Fix DWS Calendar Mode Reservations for A3 Mega by @gcie in #484
New Contributors
- @kzmyslona made their first contribution in #468
Full Changelog: v0.8.0...v0.9.0
Assets 2
v0.8.0
v0.8.0
This tag was signed with the committer’s verified signature.
SSH Key Fingerprint: XALzI+F9vLEX58nJSOjU/15Qp4UZmSjyOFt6g6gjwys
Verified
Learn about vigilant mode.
7e24869
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Compare
Highlights
- Support for provisioning A4 GKE clusters
- PathwaysJob integration
- Storage support (Parallelstore and Hyperdisk)
What's Changed
New Features
- Add the option to use Multi-tier checkpointing in workloads by @abhinavclemson in #447
- Integrate PathwaysJob into XPK. by @RoshaniN in #448
- Implement Parallelstore and Hyperdisk storages attach by @sharabiani in #436
- A4 support for prod by @gcie in #412
- Add
--mount-options
parameter toxpk storage attach/create
by @gcie in #450
Bug fixes
- Update JOBSET_VERSION from 0.7.2 to 0.8.0 by @SujeethJinesh in #425
- fix yaml alignment when
remote-python-sidecar-image
is passed by @sadikneipp in #426 - Bring back manual manifest specification for attaching storage by @gcie in #427
- Fix XPK version in Pypi release by @sharabiani in #428
- Remove sudo requirement from make by @sharabiani in #435
- fix: workloads not scheduling on A3 Ultra clusters by @gcie in #441
- Disable creating additional networks for L4 and A2 clusters by @gcie in #444
- Fix
xpk workload create
for L4 and A100 by @gcie in #452
Full Changelog: v0.7.2...v0.8.0
Assets 4
v0.7.2
6ba0019
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Compare
What's Changed
Bug fixes
- Remove sudo requirement from make by @sharabiani in #435
Full Changelog: v0.7.1...v0.7.2
Assets 2
v0.7.1
745df78
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Compare
What's Changed
Bug fixes
- fix yaml alignment when remote-python-sidecar-image is passed by @sadikneipp in #426
- Bring back manual manifest specification for attaching storage by @gcie in #427
- Fix XPK version in Pypi release by @sharabiani in #428
Assets 2
v0.7.0
3336958
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Compare
Highlights
- SLURM-like experience
- Storage support (GCSFuse, GCP Filestore)
- Remote state management for Cluster Toolkit backend
What's Changed
New Features
- Support for gcs_bucket added to blueprint_generator by @sharabiani in #351
- Add
xpk run
command by @IrvingMg in #343 - Add installation of gke-gcloud-auth-plugin by @IrvingMg in #357
- Implement config commands by @pawloch00 in #354
- Merge gcsfue into main by @pawloch00 in #317
- Merge gcsfuse into develop by @pawloch00 in #370
- Implement gcs state bucket parameter passing by @pawloch00 in #377
- Implement remote state managment changes by @pawloch00 in #380
- Add environment variables for Pathways metric collection by @guptaaka in #404
- Kjob storage configuration by @mbobrovskyi in #372
- Kjob storage configuration for run command by @mbobrovskyi in #408
- Support for multihost in slurm mode by @pawloch00 in #409
xpk storage detach
by @gcie in #420
Bug fixes
- Fix flaky
Check created job
test by @IrvingMg in #361 - Fix
xpk version
showing invalid git commit hash by @gcie in #358 - Sync main commits with develop branch by @SujeethJinesh in #379
- Add Workaround for Megascale Num Slices Issue on PW by @SujeethJinesh in #375
- Fix invalid nvidia libraries mount path for A3 Mega by @gcie in #382
- Skip Storages if Not Found (instead of failing) by @SujeethJinesh in #381
- De-dupe Superfluous Exit Codes Provided by the User by @SujeethJinesh in #387
- Update Resource Manager Kueue CPU Requirements by @SujeethJinesh in #386
- Add Host Networking with Cluster First DNS to Main Job by @SujeethJinesh in #391
- Fixes for slurm-like commands by @sharabiani in #373
- Fixes for slurm like commands by @pawloch00 in #397
- Fix nightly tests by @mbobrovskyi in #415
- Sync develop with main by @sharabiani in #421
- Fix xpk version error when cloned from git by @sharabiani in #423
New Contributors
Full Changelog: v0.6.0...v0.7.0
Assets 2
v0.6.0
e52a5f4
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Compare
Highlights
- Support for provisioning A3-Ultra and A3-Mega GKE clusters
- Cluster Toolkit integrated into XPK
What's Changed
- Workloads implemented for A3-Mega and A3-Ultra machines by @sharabiani in #306
- Build kjob in docker to remove golang dependency. by @pawloch00 in #305
- Support building docker images from non arm architecture. by @mbobrovskyi in #304
- Add url to the workload logs link directly by @Obliviour in #309
- Add support for automatic cleanup of jobsets by @Obliviour in #310
- Update Readme and makefile to smooth installation flow by @pawloch00 in #312
- Change way of kubectl install by @pawloch00 in #316
- Set min-master-version for A3-Ultra clusters by @sharabiani in #322
- Support v5litepod-8. by @mbobrovskyi in #292
- Print nodepools info when describing cluster by @IrvingMg in #294
- Fix CPU resource limits. by @RoshaniN in #323
- Move from custom-tpu-nodepool-arguments (deprecated) to custom-nodepool-arguments by @Obliviour in #328
- Remove kubectl installation from Makefile by @pawloch00 in #324
- Add crds deletion as in kueue update steps by @pawloch00 in #321
- A3Ultra blueprint updated to fix gpu driver and Jobset issues by @sharabiani in #330
- Add ttlSecondsAfterFinished to Pathways workloads by @Obliviour in #320
- Remove manual private network configuration from large scale instructions by @Obliviour in #327
- Updating the CPU config for better CPU usage. by @RoshaniN in #331
- Use cluster toolkit dockerfile from tag by @pawloch00 in #332
- Cluster Toolkit updated to v1.45.0 by @sharabiani in #333
- Fix integration tests for new ctk version by @pawloch00 in #334
- Use release kjob v0.1.0 version. by @mbobrovskyi in #315
- Bump kueue to v0.10.0. by @mbobrovskyi in #318
- The generated dependencies dir path fixed for A3U by @sharabiani in #337
- Implement version command by @pawloch00 in #338
- Cluster Toolkit upgraded to v1.45.1 by @sharabiani in #340
- Add v to version displayed by @pawloch00 in #339
Full Changelog: v0.4.1...v0.6.0
Assets 2
3 people reacted
v0.4.0
eff0dd0
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Compare
Add Pathways end-to-end tests to XPK build tests and nightly tests. (…
Assets 2
You can’t perform that action at this time.