HeavyBall v2.0.0

High-performance, extensible, chainable optimizers for PyTorch.

🚀 What's New in v2.0.0

Major Release: HeavyBall 2.0.0 introduces comprehensive documentation overhaul, enhanced interactive visualizations, and new optimizers including the Chainable Architecture and Schedule-Free Optimizers. This release features robust testing, stability improvements, and detailed theory with practical examples.

📖 Read the full release notes | 🚀 Quick Start Guide

Why heavyball

Lightning-Fast Training: Batched foreach operations deliver significant speedups on large models.
Adaptive & Extensible: Built-in AdamW, RMSprop, Schedule-Free algorithms, and PaLM-inspired schedules.
Plug-and-Play: Drop-in replacements for torch.optim with seamless integration.
Customizable: Chainable API lets you compose optimizers and transforms (MARS correction, cautious updates, orthogonal updates).
Battle-Tested: Extensive benchmarks and real-world examples included.

Key Features

New in v2.0.0: Foreach-optimized PSGD variants (ForeachPSGDKron, ForeachCachedPSGDKron) with substantial speedups
New in v2.0.0: Schedule-Free optimizers (ForeachSFAdamW, SFAdaGrad) that eliminate learning rate scheduling
New in v2.0.0: Chainable architecture for composing complex optimization pipelines
Foreach-based optimizers: ForeachAdamW, ForeachRMSprop, ForeachSFAdamW, Muon, ADOPT, MSAM, …
Advanced update rules: MARS correction, cautious updates, PaLM beta2 scheduling
Enhanced BF16/FP16 support for mixed-precision training
Comprehensive benchmark suite and interactive visualizations
Detailed documentation with theoretical foundations and practical examples

Quickstart

Install:

pip install heavyball

Basic usage:

import torch
from torch import nn
from heavyball import ForeachAdamW
model = nn.Sequential(
    nn.Linear(128, 64), nn.ReLU(), nn.Linear(64, 10)
)
optimizer = ForeachAdamW(model.parameters(), lr=1e-3)
for data, target in dataloader:
    optimizer.zero_grad()
    output = model(data)
    loss = torch.nn.functional.cross_entropy(output, target)
    loss.backward()
    optimizer.step()

Interactive Demo

Experience HeavyBall optimizers in action with our interactive visualization:

Open the demo: Simply open index.html in your web browser
View performance: Watch real-time optimizer trajectory visualization showing loss convergence
Export results: Save visualizations as PNG images with custom filenames
Run tests: Use npm test to run the Playwright test suite for the demo

The interactive demo provides an intuitive way to understand optimizer behavior and performance characteristics without writing code.

Benchmarks

Reproduce benchmarks with:

python3 -m benchmark.run_all_benchmarks --opt ForeachSOAP --opt LaProp --opt AdamW --opt Muon --opt ForeachCachedNewtonPSGD  --opt RMSprop --opt OrthoLaProp --opt ForeachSFAdamW --opt ForeachADOPT --opt LaPropOrtho --opt CachedPSGDKron --opt SignLaProp --opt ForeachSOLP --opt PSGDLRA --opt NewtonPSGDLRA --opt NewtonHybrid2PSGDKron --opt NewtonHybrid2PSGDLRA --opt mars-NewtonHybrid2PSGDLRA --opt MSAMLaProp --opt mars-adaptive-NewtonHybrid2PSGDKron  --opt mars-ortho-NewtonHybrid2PSGDKron --opt MuonLaProp --opt mars-unscaled-NewtonHybrid2PSGDKron --opt mars-NewtonHybrid2PSGDKron --opt cautious-AdamW --opt unscaled_cautious-AdamW --opt mars-AdamW  --dtype float32 --steps 1000000 --trials 1000 --parallelism 256 --seeds 1 --difficulties trivial --difficulties easy --difficulties medium --difficulties hard --difficulties extreme --difficulties nightmare --timeout 2880

Contributing

We welcome contributions! Please check the issue tracker and follow these steps:

Fork the repo and create a feature branch.
Install dev dependencies: pip install -e .[dev].
Run tests: pytest.
Submit a pull request.

License

BSD 3-Clause — see the LICENSE file.

Made by the HeavyBall team.

Name		Name	Last commit message	Last commit date
Latest commit History 564 Commits
.github/workflows		.github/workflows
.idea		.idea
analytics		analytics
assets		assets
benchmark		benchmark
docs		docs
drafts		drafts
examples		examples
heavyball		heavyball
monitoring		monitoring
playwright-output		playwright-output
playwright-report		playwright-report
reports		reports
scorecards		scorecards
scripts		scripts
test		test
tests		tests
visualization/interactive		visualization/interactive
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
ANNOUNCEMENT_PLAN.md		ANNOUNCEMENT_PLAN.md
CODE_QUALITY_REPORT.md		CODE_QUALITY_REPORT.md
DEPLOYMENT_CHECKLIST.md		DEPLOYMENT_CHECKLIST.md
DEPLOYMENT_IMPLEMENTATION_SUMMARY.md		DEPLOYMENT_IMPLEMENTATION_SUMMARY.md
FINAL_EDITING_REPORT.md		FINAL_EDITING_REPORT.md
LICENSE		LICENSE
README.md		README.md
STYLE_GUIDE_FIX_SUMMARY.md		STYLE_GUIDE_FIX_SUMMARY.md
TIMEOUT_FIX_SUMMARY.md		TIMEOUT_FIX_SUMMARY.md
VISUAL_ASSETS_COMPLETION_SUMMARY.md		VISUAL_ASSETS_COMPLETION_SUMMARY.md
build.sh		build.sh
index.html		index.html
lint_report.md		lint_report.md
mkdocs.yml		mkdocs.yml
mypy.ini		mypy.ini
package-lock.json		package-lock.json
package.json		package.json
peer_review_action_plan.md		peer_review_action_plan.md
peer_review_template.md		peer_review_template.md
playwright.config.ts		playwright.config.ts
pr_commands.sh		pr_commands.sh
pr_description.md		pr_description.md
pre-commit.yaml		pre-commit.yaml
preconditioning_definitions.md		preconditioning_definitions.md
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
script.js		script.js
style.css		style.css

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

HeavyBall v2.0.0

🚀 What's New in v2.0.0

Why heavyball

Key Features

Quickstart

Interactive Demo

Benchmarks

Contributing

License

About

Uh oh!

Releases 7

Packages

Uh oh!

Contributors 11

Uh oh!

Languages

License

HomebrewML/HeavyBall

Folders and files

Latest commit

History

Repository files navigation

HeavyBall v2.0.0

🚀 What's New in v2.0.0

Why heavyball

Key Features

Quickstart

Interactive Demo

Benchmarks

Contributing

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 7

Packages 0

Uh oh!

Contributors 11

Uh oh!

Languages

Packages