HOME
ABOUT
- RESULTS
- differences
- BENEFITS
- HISTORY
- TEAM
- LOCATION
- FACILITIES
- BANKING
- MEMBERSHIPS
- APPROVALS
- LICENCES
- SUPPLIERS
- SPONSORSHIPS
- MEDIA
- PRIVACY
AUCTIONS
SHIPPING
FEES
- TS REWARDS
TOOLS
guides
FAQ
CONTACT
- CONNECT

VEHICLES
BRAND
- JAPANESE CARS
  - DAIHATSU
  - EUNOS
  - FORD
  - HONDA
  - ISUZU
  - LEXUS
  - MAZDA
  - MITSUBISHI
  - MITSUOKA
  - NISSAN
  - SUBARU
  - SUZUKI
  - TOYOTA
- GERMAN CARS
- AMERICAN CARS
- BRITISH CARS
- ITALIAN CARS
- FRENCH CARS
- SWEDISH CARS
- KOREAN CARS
TYPE
- mobility
- VENDING
- instruction
- TAXIS
- AMBULANCES
- FIRE ENGINES
- HEARSES
- LIMOUSINES
- COMMERCIAL
CLASS
FUEL
TRUCKS
minitrucks
- DAIHATSU
- HONDA
- MAZDA
- MITSUBISHI
- NISSAN
- SUBARU
- SUZUKI
- DUMP
- CRANE
- CAMPER
- REFRIGERATED
- 4WD
- NEW
BUSES
MOTORHOMES
- YAHOO!
- RAKUTEN
- DEALER

PARTS
- FREE REPORT
- PARTS CONTAINERS
- PARTS SYSTEMS
- PARTS PROTECTION
- BODY SHELLS
- DISMANTLING
- ONLINE PARTS
- NEW PARTS
- INTERIOR PARTS
- EXTERIOR PARTS
  - BONNETS
  - BUMPERS
  - GRILLES
  - FENDERS
  - DOORS
  - TRUNKS
  - SPOILERS
  - LIGHTS
  - EMBLEMS
  - CAMERAS
- ENGINES
- TRANSMISSIONS
- WHEELS & TYRES
  - WHEELS
  - TYRES
CUTS
PERFORMANCE PARTS
TRUCK PARTS
MOTORBIKE PARTS
- MOTORBIKE ENGINES
- MOTORBIKE ACCESSORIES

MOTORBIKES
MARINE
FORKLIFTS
MACHINERY
AGRICULTURAL
OTHER
COUNTRY
- AUSTRALIA
- CANADA
- KENYA
- MYANMAR
- NEW ZEALAND
- PAKISTAN
- TANZANIA
- UNITED STATES

CARVIEW

MOTORHOMES

Select Language

HTTP/2 200 date: Mon, 29 Dec 2025 03:53:15 GMT content-type: text/html; charset=utf-8 content-encoding: gzip cache-control: public, max-age=0, s-maxage=300 etag: W/"b3e084e5c7cf28b814f6f8257dbc66fe" last-modified: Fri, 26 Dec 2025 11:07:35 UTC vary: Accept-Encoding vary: Accept-Encoding x-content-type-options: nosniff cf-cache-status: EXPIRED server: cloudflare cf-ray: 9b56596b9e37c46e-BLR alt-svc: h3=":443"; ma=86400 AlgoTune

✨ New: AlgoTune can now be easily run on AWS with just an OpenRouter API key and AWS credentials. Try it out!

AlgoTune

Can Language Models Speed Up General-Purpose Numerical Programs?

NeurIPS 2025

Ori Press Brandon Amos Haoyu Zhao Yikai Wu Samuel K. Ainsworth Dominik Krupke Patrick Kidger Touqir Sajed Bartolomeo Stellato Jisun Park Nathanael Bosch Eli Meril Albert Steppi Arman Zharmagambetov Fangzhao Zhang David Pérez-Piñeiro Alberto Mercurio Ni Zhan Talor Abramovich Kilian Lieret Hanlin Zhang Shirley Huang Matthias Bethge Ofir Press

Paper Code

Can language models optimize the runtime of popular algorithms like gzip compression, AES encryption or SVD? To answer this, we built AlgoTune, a benchmark consisting of more than one hundred widely used math, physics, and computer science functions. For each function, the goal is to write code that is faster than the reference implementation while producing the same outputs as the reference, on a held-out test set of inputs. In addition to the benchmark, we also developed AlgoTuner, an agent which enables language models to iteratively optimize code.

This site contains AlgoTuner trajectories for all AlgoTune tasks. Each entry shows the complete conversation between the model and the AlgoTune environment, including code edits, timing evaluations, and the iterative optimization process.

Leaderboard

We use our agent, called AlgoTuner, to optimize functions in AlgoTune, using ten state-of-the-art models. AlgoTuner, using these models, is able to achieve impressive surface-level speedups on many tasks, but is unable to come up with novel algorithms.

Model Name	AlgoTune Score
o4-mini	1.72x
DeepSeek R1	1.70x
GPT-5	1.67x
Claude Sonnet 4.5	1.52x
GLM-4.5	1.52x
Gemini 2.5 Pro	1.51x
Qwen3 Coder	1.44x
gpt-oss-120b	1.41x
GPT-5 Mini	1.38x
Claude Opus 4.1	1.34x
Claude Opus 4	1.33x
GPT-5 Pro (medium)	1.31x

The AlgoTune score for each model is the harmonic mean of its speedups across all AlgoTune tasks. In the table at the bottom of this page, you can find the speedups achieved by each model on each AlgoTune task.

AlgoTune Task Implementation

To measure speedups for the algorithms in AlgoTune, we implement a class containing three functions for each algorithm. One generates problem instances (i.e. in the case of PCA this a matrix and number of components), one method checks that the problem has been solved (i.e. for PCA, we check that the matrix is orthonormal), and the last function is a reference solver (for the PCA task, we just use a PCA solver from scikit-learn).

Task Name	Best Speedup	2nd Best Speedup	3rd Best Speedup	4th Best Speedup
aes_gcm_encryption	Claude Opus 4.1 (1.54x)	Qwen3 Coder (1.14x)	o4-mini (1.05x)	GPT-5 (1.05x)
affine_transform_2d	GPT-5 (1.00x)	GLM-4.5 (0.22x)	Qwen3 Coder (0.22x)	o4-mini (0.22x)
aircraft_wing_design	Gemini 2.5 Pro (1.70x)	GPT-5 (1.53x)	GLM-4.5 (1.36x)	Claude Opus 4 (1.03x)
articulation_points	GLM-4.5 (10.74x)	Qwen3 Coder (7.32x)	GPT-5 (5.93x)	DeepSeek R1 (5.93x)
base64_encoding	Gemini 2.5 Pro (1.75x)	GPT-5 Mini (1.34x)	Claude Opus 4.1 (1.30x)	Qwen3 Coder (1.14x)
battery_scheduling	Claude Sonnet 4.5 (48.39x)	o4-mini (27.48x)	GLM-4.5 (26.78x)	Gemini 2.5 Pro (26.28x)
btsp	GLM-4.5 (3.46x)	Claude Sonnet 4.5 (2.82x)	DeepSeek R1 (2.76x)	gpt-oss-120b (2.64x)
capacitated_facility_location	DeepSeek R1 (16.99x)	GPT-5 (10.30x)	Qwen3 Coder (8.72x)	Gemini 2.5 Pro (8.53x)
chacha_encryption	o4-mini (1.54x)	Claude Opus 4.1 (1.29x)	Qwen3 Coder (1.15x)	DeepSeek R1 (1.04x)
channel_capacity	Gemini 2.5 Pro (1.19x)	Claude Opus 4.1 (1.15x)	Qwen3 Coder (1.13x)	GLM-4.5 (1.13x)
chebyshev_center	Qwen3 Coder (6.16x)	o4-mini (5.65x)	Gemini 2.5 Pro (4.91x)	Claude Opus 4.1 (4.87x)
cholesky_factorization	Qwen3 Coder (1.15x)	o4-mini (1.12x)	Claude Sonnet 4.5 (1.12x)	Claude Opus 4.1 (1.10x)
clustering_outliers	Qwen3 Coder (2.53x)	gpt-oss-120b (1.93x)	o4-mini (1.32x)	DeepSeek R1 (1.16x)
communicability	Gemini 2.5 Pro (197.67x)	Claude Opus 4 (106.19x)	Claude Sonnet 4.5 (67.68x)	DeepSeek R1 (66.39x)
convex_hull	DeepSeek R1 (5.09x)	Gemini 2.5 Pro (4.95x)	GPT-5 (1.00x)	o4-mini (1.00x)
convolve2d_full_fill	Claude Sonnet 4.5 (205.51x)	Gemini 2.5 Pro (175.96x)	GLM-4.5 (163.54x)	o4-mini (161.95x)
convolve_1d	GLM-4.5 (1.74x)	DeepSeek R1 (1.06x)	o4-mini (1.05x)	Gemini 2.5 Pro (1.03x)
correlate2d_full_fill	Claude Sonnet 4.5 (188.93x)	DeepSeek R1 (177.11x)	GLM-4.5 (135.10x)	Qwen3 Coder (133.14x)
correlate_1d	Gemini 2.5 Pro (1.09x)	GPT-5 Mini (1.07x)	GLM-4.5 (1.06x)	Claude Opus 4.1 (1.06x)
count_connected_components	DeepSeek R1 (6.04x)	Qwen3 Coder (4.93x)	GLM-4.5 (4.81x)	GPT-5 (4.39x)
count_riemann_zeta_zeros	Claude Sonnet 4.5 (1.01x)	GPT-5 Mini (1.00x)	gpt-oss-120b (1.00x)	Claude Opus 4.1 (1.00x)
cumulative_simpson_1d	GPT-5 (15.91x)	o4-mini (14.64x)	DeepSeek R1 (12.82x)	Claude Opus 4.1 (9.26x)
cumulative_simpson_multid	Qwen3 Coder (1.69x)	Claude Opus 4.1 (1.13x)	Claude Sonnet 4.5 (1.05x)	gpt-oss-120b (1.01x)
cvar_projection	GPT-5 (7.72x)	Claude Sonnet 4.5 (3.29x)	DeepSeek R1 (2.79x)	GLM-4.5 (2.56x)
cyclic_independent_set	GPT-5 (61.49x)	GPT-5 Pro (medium) (47.29x)	Claude Sonnet 4.5 (46.87x)	DeepSeek R1 (39.92x)
dct_type_I_scipy_fftpack	gpt-oss-120b (1.98x)	Gemini 2.5 Pro (1.21x)	o4-mini (1.07x)	Claude Sonnet 4.5 (1.05x)
delaunay	DeepSeek R1 (3.75x)	o4-mini (3.55x)	GPT-5 (3.53x)	GPT-5 Pro (medium) (3.51x)
dijkstra_from_indices	GPT-5 (1.88x)	GLM-4.5 (1.23x)	GPT-5 Mini (0.96x)	o4-mini (0.05x)
discrete_log	DeepSeek R1 (1.33x)	GPT-5 (1.08x)	GPT-5 Mini (1.07x)	Claude Sonnet 4.5 (1.01x)
dst_type_II_scipy_fftpack	gpt-oss-120b (2.04x)	o4-mini (1.85x)	GLM-4.5 (1.70x)	Qwen3 Coder (1.61x)
dynamic_assortment_planning	Claude Sonnet 4.5 (258.11x)	o4-mini (218.65x)	GPT-5 (82.13x)	DeepSeek R1 (48.51x)
earth_movers_distance	GPT-5 (1.04x)	o4-mini (1.02x)	GLM-4.5 (1.00x)	Qwen3 Coder (1.00x)
edge_expansion	DeepSeek R1 (28.80x)	o4-mini (26.62x)	gpt-oss-120b (22.96x)	Claude Opus 4 (1.06x)
eigenvalues_complex	Qwen3 Coder (1.49x)	Claude Opus 4.1 (1.49x)	GLM-4.5 (1.49x)	o4-mini (1.48x)
eigenvalues_real	GPT-5 (2.52x)	DeepSeek R1 (2.52x)	Claude Opus 4.1 (2.51x)	GLM-4.5 (2.51x)
eigenvectors_complex	o4-mini (1.04x)	GPT-5 (1.03x)	Qwen3 Coder (1.03x)	GLM-4.5 (1.02x)
eigenvectors_real	Claude Sonnet 4.5 (1.05x)	Gemini 2.5 Pro (1.04x)	GPT-5 Pro (medium) (1.02x)	GPT-5 Mini (1.02x)
elementwise_integration	o4-mini (1.00x)	gpt-oss-120b (1.00x)	GLM-4.5 (1.00x)	Claude Sonnet 4.5 (1.00x)
feedback_controller_design	o4-mini (343.02x)	DeepSeek R1 (334.31x)	gpt-oss-120b (160.50x)	Claude Sonnet 4.5 (82.02x)
fft_cmplx_scipy_fftpack	o4-mini (2.38x)	DeepSeek R1 (2.36x)	Claude Opus 4.1 (2.35x)	GLM-4.5 (2.30x)
fft_convolution	Claude Opus 4.1 (0.47x)	GPT-5 (0.42x)	GLM-4.5 (0.42x)	o4-mini (0.40x)
fft_real_scipy_fftpack	GLM-4.5 (1.63x)	gpt-oss-120b (1.45x)	DeepSeek R1 (1.40x)	Claude Sonnet 4.5 (1.31x)
firls	GLM-4.5 (1.01x)	Claude Opus 4 (1.01x)	GPT-5 (1.00x)	GPT-5 Pro (medium) (1.00x)
generalized_eigenvalues_complex	Claude Sonnet 4.5 (5.59x)	Claude Opus 4.1 (5.39x)	DeepSeek R1 (5.26x)	GPT-5 Mini (3.72x)
generalized_eigenvalues_real	Qwen3 Coder (3.24x)	GPT-5 (3.23x)	GLM-4.5 (3.18x)	DeepSeek R1 (3.13x)
generalized_eigenvectors_complex	DeepSeek R1 (3.36x)	Gemini 2.5 Pro (2.70x)	gpt-oss-120b (2.49x)	Qwen3 Coder (2.46x)
generalized_eigenvectors_real	Gemini 2.5 Pro (3.19x)	Claude Opus 4 (1.92x)	DeepSeek R1 (1.68x)	o4-mini (1.43x)
graph_coloring_assign	o4-mini (42.88x)	GPT-5 Pro (medium) (33.32x)	gpt-oss-120b (30.50x)	GPT-5 (29.92x)
graph_global_efficiency	Gemini 2.5 Pro (16.61x)	GPT-5 (16.37x)	Claude Opus 4.1 (15.85x)	DeepSeek R1 (15.65x)
graph_isomorphism	gpt-oss-120b (105.04x)	GLM-4.5 (91.03x)	Claude Sonnet 4.5 (85.93x)	Claude Opus 4.1 (80.10x)
graph_laplacian	GPT-5 (0.98x)	Claude Sonnet 4.5 (0.24x)	GLM-4.5 (0.19x)	DeepSeek R1 (0.19x)
group_lasso	Claude Sonnet 4.5 (1.01x)	Qwen3 Coder (1.01x)	GPT-5 (1.01x)	GLM-4.5 (1.00x)
gzip_compression	o4-mini (1.34x)	GPT-5 Mini (1.00x)	GPT-5 (1.00x)	gpt-oss-120b (1.00x)
integer_factorization	Claude Sonnet 4.5 (1.00x)	GPT-5 Pro (medium) (0.99x)	Claude Opus 4.1 (Fail)	Claude Opus 4 (Fail)
job_shop_scheduling	GLM-4.5 (3.33x)	Claude Sonnet 4.5 (2.86x)	Qwen3 Coder (2.18x)	gpt-oss-120b (1.96x)
kalman_filter	o4-mini (46.98x)	GPT-5 (32.26x)	DeepSeek R1 (15.76x)	Gemini 2.5 Pro (9.93x)
kcenters	GPT-5 Mini (10.16x)	GLM-4.5 (3.16x)	gpt-oss-120b (2.60x)	o4-mini (2.57x)
kd_tree	o4-mini (1.13x)	GPT-5 Mini (1.06x)	GLM-4.5 (1.05x)	GPT-5 (1.04x)
kernel_density_estimation	Claude Sonnet 4.5 (1.01x)	Claude Opus 4.1 (Fail)	Claude Opus 4 (Fail)	DeepSeek R1 (Fail)
kmeans	o4-mini (16.87x)	Claude Sonnet 4.5 (15.94x)	Claude Opus 4 (15.49x)	Gemini 2.5 Pro (15.25x)
ks_test_2samp	Claude Opus 4.1 (1.11x)	gpt-oss-120b (1.02x)	GPT-5 Mini (1.01x)	GPT-5 (1.00x)
l0_pruning	Claude Opus 4 (2.71x)	Gemini 2.5 Pro (2.48x)	Claude Opus 4.1 (1.42x)	GLM-4.5 (1.41x)
l1_pruning	o4-mini (17.69x)	DeepSeek R1 (1.85x)	Gemini 2.5 Pro (1.79x)	GLM-4.5 (1.73x)
lasso	Qwen3 Coder (1.73x)	DeepSeek R1 (1.57x)	o4-mini (1.18x)	GPT-5 (1.10x)
least_squares	DeepSeek R1 (2.32x)	GPT-5 (2.29x)	Claude Opus 4 (2.02x)	GPT-5 Pro (medium) (1.77x)
linear_system_solver	GPT-5 (1.13x)	o4-mini (1.12x)	Claude Opus 4.1 (1.11x)	GLM-4.5 (1.11x)
lp_box	GPT-5 (44.25x)	GPT-5 Mini (28.29x)	DeepSeek R1 (17.01x)	Claude Sonnet 4.5 (15.96x)
lp_centering	GPT-5 Pro (medium) (1.02x)	o4-mini (1.01x)	DeepSeek R1 (1.01x)	Claude Sonnet 4.5 (1.01x)
lp_mdp	o4-mini (865.71x)	GLM-4.5 (617.76x)	GPT-5 (416.84x)	DeepSeek R1 (369.78x)
lqr	GLM-4.5 (1.51x)	GPT-5 (1.34x)	GPT-5 Pro (medium) (1.32x)	DeepSeek R1 (1.25x)
lti_simulation	DeepSeek R1 (16.39x)	Gemini 2.5 Pro (2.05x)	o4-mini (1.15x)	GPT-5 (1.01x)
lu_factorization	Claude Opus 4 (1.20x)	Claude Opus 4.1 (1.01x)	Claude Sonnet 4.5 (1.00x)	o4-mini (1.00x)
lyapunov_stability	DeepSeek R1 (189.60x)	o4-mini (142.10x)	Claude Sonnet 4.5 (127.15x)	gpt-oss-120b (119.51x)
markowitz	Claude Sonnet 4.5 (1.62x)	Claude Opus 4.1 (1.04x)	Claude Opus 4 (0.98x)	DeepSeek R1 (Fail)
matrix_completion	Claude Opus 4.1 (1.01x)	GPT-5 Pro (medium) (1.01x)	Qwen3 Coder (1.01x)	Claude Opus 4 (1.00x)
matrix_exponential	GPT-5 (1.00x)	o4-mini (0.59x)	DeepSeek R1 (0.59x)	GPT-5 Mini (0.59x)
matrix_exponential_sparse	gpt-oss-120b (1.00x)	DeepSeek R1 (1.00x)	GPT-5 (0.98x)	Claude Opus 4.1 (Fail)
matrix_multiplication	GPT-5 (1.06x)	o4-mini (1.06x)	Claude Opus 4.1 (0.32x)	DeepSeek R1 (0.32x)
matrix_sqrt	Claude Sonnet 4.5 (1.05x)	o4-mini (1.04x)	Claude Opus 4.1 (1.00x)	GPT-5 (1.00x)
max_clique_cpsat	GPT-5 Pro (medium) (47.46x)	GPT-5 (47.20x)	Claude Sonnet 4.5 (46.32x)	gpt-oss-120b (30.20x)
max_common_subgraph	GPT-5 (79.46x)	GPT-5 Pro (medium) (72.96x)	o4-mini (46.79x)	GPT-5 Mini (38.91x)
max_flow_min_cost	Claude Opus 4 (14.34x)	Qwen3 Coder (13.78x)	GLM-4.5 (9.42x)	Claude Opus 4.1 (9.18x)
max_independent_set_cpsat	o4-mini (76.14x)	GPT-5 (20.68x)	GPT-5 Pro (medium) (12.29x)	GPT-5 Mini (8.46x)
max_weighted_independent_set	GPT-5 Mini (3.25x)	Claude Sonnet 4.5 (1.95x)	GLM-4.5 (1.85x)	o4-mini (1.55x)
min_dominating_set	GPT-5 Mini (6.87x)	gpt-oss-120b (2.61x)	GLM-4.5 (2.53x)	GPT-5 (1.92x)
min_weight_assignment	gpt-oss-120b (1.71x)	DeepSeek R1 (1.70x)	Claude Sonnet 4.5 (1.57x)	Claude Opus 4.1 (1.56x)
minimum_spanning_tree	GLM-4.5 (12.78x)	Claude Sonnet 4.5 (10.60x)	gpt-oss-120b (10.07x)	o4-mini (9.90x)
minimum_volume_ellipsoid	DeepSeek R1 (45.38x)	Gemini 2.5 Pro (16.00x)	o4-mini (6.65x)	Claude Sonnet 4.5 (1.05x)
multi_dim_knapsack	DeepSeek R1 (56.93x)	GPT-5 Mini (6.73x)	Gemini 2.5 Pro (5.50x)	Qwen3 Coder (4.24x)
nmf	Qwen3 Coder (1.42x)	Claude Opus 4 (1.22x)	DeepSeek R1 (1.16x)	Claude Sonnet 4.5 (1.16x)
ode_brusselator	GPT-5 (387.43x)	o4-mini (301.75x)	GPT-5 Mini (206.24x)	Claude Sonnet 4.5 (199.98x)
ode_fitzhughnagumo	GPT-5 (326.89x)	GPT-5 Mini (11.09x)	Claude Opus 4.1 (1.12x)	Qwen3 Coder (1.10x)
ode_hires	DeepSeek R1 (29.24x)	Gemini 2.5 Pro (25.75x)	GPT-5 (17.80x)	GPT-5 Pro (medium) (15.90x)
ode_hodgkinhuxley	GPT-5 Pro (medium) (235.44x)	o4-mini (165.94x)	Gemini 2.5 Pro (112.08x)	DeepSeek R1 (52.40x)
ode_lorenz96_nonchaotic	GLM-4.5 (3.15x)	DeepSeek R1 (2.86x)	GPT-5 (2.33x)	Claude Sonnet 4.5 (1.87x)
ode_lotkavolterra	GPT-5 (825.43x)	o4-mini (814.44x)	GPT-5 Pro (medium) (395.84x)	Gemini 2.5 Pro (53.56x)
ode_nbodyproblem	Claude Sonnet 4.5 (57.70x)	GPT-5 (54.39x)	o4-mini (54.21x)	Claude Opus 4.1 (50.61x)
ode_seirs	o4-mini (3084.39x)	GPT-5 (534.75x)	Gemini 2.5 Pro (43.75x)	Claude Sonnet 4.5 (13.37x)
ode_stiff_robertson	DeepSeek R1 (68.88x)	Qwen3 Coder (14.88x)	o4-mini (12.01x)	gpt-oss-120b (6.47x)
ode_stiff_vanderpol	o4-mini (2062.53x)	GPT-5 (127.92x)	DeepSeek R1 (90.93x)	GLM-4.5 (42.38x)
odr	o4-mini (1.01x)	GPT-5 (1.01x)	gpt-oss-120b (1.01x)	DeepSeek R1 (1.01x)
optimal_advertising	GPT-5 (1.36x)	GPT-5 Mini (1.31x)	DeepSeek R1 (1.29x)	GPT-5 Pro (medium) (1.24x)
outer_product	Claude Opus 4.1 (1.78x)	Qwen3 Coder (1.06x)	GLM-4.5 (1.03x)	DeepSeek R1 (1.02x)
pagerank	Gemini 2.5 Pro (30.97x)	GPT-5 (8.37x)	Claude Sonnet 4.5 (4.30x)	Claude Opus 4 (4.22x)
pca	DeepSeek R1 (4.15x)	gpt-oss-120b (3.89x)	GPT-5 (3.69x)	o4-mini (3.62x)
pde_burgers1d	DeepSeek R1 (4.17x)	Claude Sonnet 4.5 (4.15x)	Claude Opus 4 (4.03x)	GLM-4.5 (3.84x)
pde_heat1d	Claude Sonnet 4.5 (3.14x)	GLM-4.5 (3.09x)	Qwen3 Coder (2.95x)	GPT-5 (2.27x)
polynomial_mixed	o4-mini (99.78x)	DeepSeek R1 (4.32x)	Claude Opus 4.1 (1.05x)	GLM-4.5 (1.04x)
polynomial_real	GLM-4.5 (138.47x)	DeepSeek R1 (134.71x)	o4-mini (73.71x)	Claude Sonnet 4.5 (70.51x)
power_control	DeepSeek R1 (346.26x)	GLM-4.5 (328.28x)	Qwen3 Coder (307.32x)	o4-mini (304.84x)
procrustes	o4-mini (2.32x)	Qwen3 Coder (1.86x)	GPT-5 (1.84x)	DeepSeek R1 (1.03x)
psd_cone_projection	o4-mini (8.96x)	GPT-5 Pro (medium) (8.88x)	Claude Sonnet 4.5 (8.79x)	GPT-5 (8.77x)
qp	GPT-5 (1.89x)	Claude Sonnet 4.5 (1.79x)	Claude Opus 4.1 (1.74x)	Claude Opus 4 (1.70x)
qr_factorization	o4-mini (7.95x)	GPT-5 (7.83x)	GPT-5 Pro (medium) (7.27x)	Claude Opus 4 (1.17x)
quantile_regression	GLM-4.5 (1.69x)	Gemini 2.5 Pro (1.41x)	GPT-5 (1.20x)	Claude Sonnet 4.5 (1.18x)
queens_with_obstacles	GPT-5 (3.61x)	DeepSeek R1 (3.00x)	Gemini 2.5 Pro (2.87x)	gpt-oss-120b (2.82x)
queuing	Qwen3 Coder (1.19x)	gpt-oss-120b (1.12x)	GLM-4.5 (1.10x)	o4-mini (1.10x)
qz_factorization	o4-mini (1.73x)	GPT-5 (1.01x)	Claude Opus 4.1 (1.00x)	GLM-4.5 (1.00x)
randomized_svd	Claude Sonnet 4.5 (4.88x)	GPT-5 (4.80x)	GLM-4.5 (4.61x)	DeepSeek R1 (4.51x)
rbf_interpolation	o4-mini (1.02x)	Claude Sonnet 4.5 (1.00x)	GLM-4.5 (1.00x)	Qwen3 Coder (1.00x)
rectanglepacking	Gemini 2.5 Pro (2.29x)	Claude Sonnet 4.5 (2.04x)	GPT-5 Pro (medium) (2.02x)	GPT-5 (1.92x)
robust_kalman_filter	Gemini 2.5 Pro (8.63x)	GPT-5 Pro (medium) (7.19x)	o4-mini (7.05x)	Qwen3 Coder (6.64x)
robust_linear_program	Gemini 2.5 Pro (6.51x)	o4-mini (6.49x)	Claude Sonnet 4.5 (1.07x)	Qwen3 Coder (1.06x)
rocket_landing_optimization	DeepSeek R1 (1.63x)	Claude Sonnet 4.5 (1.61x)	GLM-4.5 (1.06x)	Gemini 2.5 Pro (1.03x)
rotate_2d	GPT-5 (1.22x)	GPT-5 Mini (1.00x)	o4-mini (0.24x)	DeepSeek R1 (0.23x)
set_cover	GPT-5 (50.62x)	o4-mini (29.74x)	DeepSeek R1 (6.70x)	gpt-oss-120b (3.53x)
set_cover_conflicts	Claude Sonnet 4.5 (6.58x)	GPT-5 Pro (medium) (5.77x)	o4-mini (5.59x)	GPT-5 (4.88x)
sha256_hashing	GPT-5 Pro (medium) (1.00x)	Claude Opus 4.1 (1.00x)	Claude Sonnet 4.5 (1.00x)	GLM-4.5 (1.00x)
shift_2d	GPT-5 (1.00x)	o4-mini (1.00x)	GPT-5 Pro (medium) (0.20x)	GLM-4.5 (0.20x)
shortest_path_dijkstra	Gemini 2.5 Pro (2.44x)	DeepSeek R1 (2.33x)	Qwen3 Coder (2.26x)	o4-mini (2.18x)
sinkhorn	Qwen3 Coder (2.25x)	Gemini 2.5 Pro (2.23x)	Claude Sonnet 4.5 (1.94x)	DeepSeek R1 (1.86x)
sparse_eigenvectors_complex	GPT-5 (1.01x)	GPT-5 Pro (medium) (1.01x)	DeepSeek R1 (1.00x)	Claude Sonnet 4.5 (1.00x)
sparse_lowest_eigenvalues_posdef	Qwen3 Coder (2.73x)	gpt-oss-120b (2.08x)	GLM-4.5 (2.07x)	o4-mini (1.89x)
sparse_lowest_eigenvectors_posdef	Qwen3 Coder (2.91x)	GLM-4.5 (2.55x)	DeepSeek R1 (2.47x)	gpt-oss-120b (2.28x)
sparse_pca	Qwen3 Coder (10.24x)	DeepSeek R1 (9.08x)	GPT-5 (6.40x)	Gemini 2.5 Pro (6.06x)
spectral_clustering	GLM-4.5 (19.39x)	gpt-oss-120b (17.19x)	DeepSeek R1 (13.51x)	Claude Opus 4.1 (10.53x)
stable_matching	o4-mini (1.73x)	GPT-5 Pro (medium) (1.67x)	GPT-5 (1.64x)	Qwen3 Coder (1.64x)
svd	o4-mini (1.62x)	GPT-5 Mini (1.61x)	DeepSeek R1 (1.02x)	gpt-oss-120b (1.02x)
svm	Claude Sonnet 4.5 (1.08x)	gpt-oss-120b (1.01x)	GLM-4.5 (1.00x)	Claude Opus 4 (1.00x)
sylvester_solver	GPT-5 (1.06x)	gpt-oss-120b (1.06x)	GPT-5 Mini (1.03x)	o4-mini (1.03x)
tensor_completion_3d	o4-mini (203.38x)	GLM-4.5 (79.10x)	Gemini 2.5 Pro (33.87x)	GPT-5 (29.79x)
toeplitz_solver	GPT-5 (1.00x)	o4-mini (1.00x)	GPT-5 Mini (1.00x)	Qwen3 Coder (1.00x)
tsp	GLM-4.5 (1.92x)	Claude Opus 4.1 (1.81x)	DeepSeek R1 (1.32x)	Claude Opus 4 (1.17x)
two_eigenvalues_around_0	o4-mini (1.92x)	GPT-5 Mini (1.83x)	Claude Opus 4.1 (1.80x)	Qwen3 Coder (1.79x)
unit_simplex_projection	o4-mini (3.61x)	DeepSeek R1 (3.53x)	GPT-5 (2.70x)	GLM-4.5 (2.69x)
upfirdn1d	DeepSeek R1 (1.13x)	GPT-5 (1.11x)	GLM-4.5 (1.11x)	gpt-oss-120b (0.99x)
vector_quantization	Qwen3 Coder (1.01x)	gpt-oss-120b (1.01x)	Claude Opus 4 (1.01x)	GPT-5 (1.01x)
vectorized_newton	Claude Opus 4.1 (Fail)	Claude Opus 4 (Fail)	Claude Sonnet 4.5 (Fail)	DeepSeek R1 (Fail)
vehicle_routing	Gemini 2.5 Pro (2.76x)	GLM-4.5 (2.15x)	Claude Opus 4 (1.40x)	DeepSeek R1 (1.22x)
vertex_cover	GPT-5 Pro (medium) (69.33x)	GPT-5 (68.29x)	GPT-5 Mini (49.16x)	Gemini 2.5 Pro (2.55x)
voronoi_diagram	GPT-5 (12.62x)	o4-mini (9.28x)	Gemini 2.5 Pro (3.35x)	Qwen3 Coder (2.49x)
wasserstein_dist	Qwen3 Coder (10.08x)	DeepSeek R1 (9.87x)	o4-mini (9.82x)	Claude Opus 4.1 (9.56x)
water_filling	o4-mini (514.52x)	Gemini 2.5 Pro (213.25x)	Claude Opus 4 (183.87x)	Claude Sonnet 4.5 (98.20x)
zoom_2d	GPT-5 (1.02x)	o4-mini (0.99x)	GPT-5 Pro (medium) (0.21x)	GPT-5 Mini (0.21x)

Leaderboard

AlgoTune Task Implementation

Results & Logs