Carview!

HOME
ABOUT
- RESULTS
- differences
- BENEFITS
- HISTORY
- TEAM
- LOCATION
- FACILITIES
- BANKING
- MEMBERSHIPS
- APPROVALS
- LICENCES
- SUPPLIERS
- SPONSORSHIPS
- MEDIA
- PRIVACY
AUCTIONS
SHIPPING
FEES
- TS REWARDS
TOOLS
guides
FAQ
CONTACT
- CONNECT

VEHICLES
BRAND
- JAPANESE CARS
  - DAIHATSU
  - EUNOS
  - FORD
  - HONDA
  - ISUZU
  - LEXUS
  - MAZDA
  - MITSUBISHI
  - MITSUOKA
  - NISSAN
  - SUBARU
  - SUZUKI
  - TOYOTA
- GERMAN CARS
- AMERICAN CARS
- BRITISH CARS
- ITALIAN CARS
- FRENCH CARS
- SWEDISH CARS
- KOREAN CARS
TYPE
- mobility
- VENDING
- instruction
- TAXIS
- AMBULANCES
- FIRE ENGINES
- HEARSES
- LIMOUSINES
- COMMERCIAL
CLASS
FUEL
TRUCKS
minitrucks
- DAIHATSU
- HONDA
- MAZDA
- MITSUBISHI
- NISSAN
- SUBARU
- SUZUKI
- DUMP
- CRANE
- CAMPER
- REFRIGERATED
- 4WD
- NEW
BUSES
MOTORHOMES
- YAHOO!
- RAKUTEN
- DEALER

PARTS
- FREE REPORT
- PARTS CONTAINERS
- PARTS SYSTEMS
- PARTS PROTECTION
- BODY SHELLS
- DISMANTLING
- ONLINE PARTS
- NEW PARTS
- INTERIOR PARTS
- EXTERIOR PARTS
  - BONNETS
  - BUMPERS
  - GRILLES
  - FENDERS
  - DOORS
  - TRUNKS
  - SPOILERS
  - LIGHTS
  - EMBLEMS
  - CAMERAS
- ENGINES
- TRANSMISSIONS
- WHEELS & TYRES
  - WHEELS
  - TYRES
CUTS
PERFORMANCE PARTS
TRUCK PARTS
MOTORBIKE PARTS
- MOTORBIKE ENGINES
- MOTORBIKE ACCESSORIES

MOTORBIKES
MARINE
FORKLIFTS
MACHINERY
AGRICULTURAL
OTHER
COUNTRY
- AUSTRALIA
- CANADA
- KENYA
- MYANMAR
- NEW ZEALAND
- PAKISTAN
- TANZANIA
- UNITED STATES

CARVIEW

MOTORHOMES

Select Language

HTTP/2 301 server: GitHub.com content-type: text/html location: https://cohenqu.github.io/rlad.github.io/ x-github-request-id: 43D4:2BC55:9981EE:AC7900:69535740 accept-ranges: bytes age: 0 date: Tue, 30 Dec 2025 04:38:25 GMT via: 1.1 varnish x-served-by: cache-bom-vanm7210076-BOM x-cache: MISS x-cache-hits: 0 x-timer: S1767069505.819675,VS0,VE199 vary: Accept-Encoding x-fastly-request-id: 6f967acc9cc668c6fb9c4684ba790102a3c09df7 content-length: 162 HTTP/2 200 server: GitHub.com content-type: text/html; charset=utf-8 last-modified: Fri, 03 Oct 2025 18:40:53 GMT access-control-allow-origin: * strict-transport-security: max-age=31556952 etag: W/"68e018b5-5694" expires: Tue, 30 Dec 2025 04:48:25 GMT cache-control: max-age=600 content-encoding: gzip x-proxy-cache: MISS x-github-request-id: 7C8B:234FE9:9AC2B9:ADBA2A:69535741 accept-ranges: bytes age: 0 date: Tue, 30 Dec 2025 04:38:25 GMT via: 1.1 varnish x-served-by: cache-bom-vanm7210076-BOM x-cache: MISS x-cache-hits: 0 x-timer: S1767069505.032378,VS0,VE218 vary: Accept-Encoding x-fastly-request-id: e35682c093c4ba979af442fae149ce2ce8443444 content-length: 5101 RLAD

Training LLMs to Discover Abstractions for Solving Reasoning Problems

Yuxiao Qu^1❖ , Anikait Singh^2❖, Yoonho Lee^2❖, Amrith Setlur¹ ,
Ruslan Salakhutdinov¹ , Chelsea Finn² , Aviral Kumar¹

¹Carnegie Mellon University ²Stanford University ^❖Equal Contribution

arXiv

Hugging Face Coming Soon

Standard reasoning vs. Reasoning abstractions. We depict the solution space as a graph of intermediate steps leading to correct or incorrect answers. (1) Standard reasoning explores this space along one sequential chain. (2) We generate textual abstractions by summarizing which intermediate steps led to which outcomes. (3) Such abstractions can be reused to guide reasoning more efficiently.

Abstract

Reasoning requires going beyond pattern matching or memorization of solutions to identify and implement "algorithmic procedures" that can be used to deduce answers to hard problems. Doing so requires reusing primitives, intermediate results, or procedures across multiple problems. While RL post-training on long chains of thought ultimately aims to uncover this kind of algorithmic behavior, the depth-first and "brute-force" nature of reasoning traces learned by these models suggests that this is far from a fulfilled promise. To address more effective reasoning, we introduce reasoning abstractions: concise natural language descriptions of procedural and factual knowledge that guide the model toward learning successful reasoning. We train models to be capable of proposing several useful abstractions given a problem, followed by RL training that incentivizes building a solution while using the information provided by these abstractions. This results in a two-player RL training paradigm, abbreviated as RLAD, that jointly trains an abstraction generator and an abstraction-conditioned solution generator. This setup effectively enables structured exploration, decouples learning signals of abstraction proposal and solution generation, and improves generalization to harder problems. We also show that spending more test-time compute into generating abstractions is more beneficial for performance than generating more solutions at large inference-time budgets, illustrating the role of abstractions in guiding global exploration.

Reasoning Abstractions and Why They Are Useful

Solving hard reasoning problems requires more than lengthening chains of thought — it requires reusable insights.

Reasoning abstractions are short natural-language descriptions that capture:

Procedural knowledge (e.g., “apply the quadratic formula in modular arithmetic”).
Factual knowledge (e.g., "a number has an inverse mod m only if gcd(x, m) = 1").
Cautionary patterns (e.g., "avoid assuming a denominator is invertible without checking").

These abstractions summarize what works and what fails across multiple solution attempts. When provided to LLMs:

They act like exam hints, guiding the model toward more promising strategies.
They improve exploration by broadening the search space beyond sequential brute force.
They can generalize across problems — helping models recognize shared substructures or common pitfalls.

Empirically, conditioning on abstractions boosts accuracy and pass@k across math reasoning, ARC program synthesis, and even non-math domains like legal reasoning and healthcare.

Figure: Examples of good reasoning abstractions in non-math domains. Adding the abstraction to the prompt of GPT-4o-mini consistently improves performance on unseen instances.

RLAD Framework

RLAD jointly trains:

Abstraction Generator – proposes problem-specific abstractions.
Solution Generator – learns to solve problems by leveraging abstractions.

Training proceeds in two phases:

Warm-start with supervised fine-tuning on abstraction – solution pairs from stronger models.
Reinforcement learning where abstractions are rewarded if they improve the success rate of solution generation.

Experimental Results

Main Performance Results on Math Reasoning Benchmarks

Approach	AIME 2025			DeepScaleR [Hard]			AMC 2023
	w/o abs (avg)	w/ abs (avg)	w/ abs (best)	w/o abs (avg)	w/ abs (avg)	w/ abs (best)	w/o abs (avg)	w/ abs (avg)	w/ abs (best)
Qwen-3-1.7B	33.75	36.25	40.00	20.21	22.14	32.50	86.41	78.01	84.53
+ DAPO	37.92	34.90	39.79	21.67	21.88	33.54	86.41	81.99	88.44
+ RLAD	38.04	42.45	48.33	23.54	24.84	35.54	87.25	88.35	91.72

Table: Accuracy on math reasoning benchmarks. RLAD achieves consistent gains in both abstraction-conditioned and w/o abstraction settings across AIME 2025, DeepScaleR Hard, and AMC 2023. We report performance without abstractions, with abstractions (pass@1 with 16 samples), and the best abstraction (pass@16).

A typical example of a reasoning abstraction proposed by our abstraction generator.

Figure: In the solution, we see references to the abstraction and keywords from the abstraction being used meaningfully in the reasoning trace of the solution generator model.

Corresponding Author: Yuxiao Qu
This page was built using the Academic Project Page Template which was adopted from the Nerfies project page.
This website is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

HOME
ABOUT
AUCTIONS
SHIPPING
FEES
TOOLS
HOW
FAQ
CONTACT

Original Source | Taken Source