rLLM

🚀 Reinforcement Learning for Language Agents🌟

rLLM is an open-source framework for post-training language agents via reinforcement learning. With rLLM, you can easily build your custom agents and environments, train them with reinforcement learning, and deploy them for real-world workloads.

Releases 📰

[2025/12/11] We release rLLM v0.2.1 which comes with support for Tinker backend, LoRA and VLM training, and support for Eval Protocol. We also bumped our verl backend to v0.6.1. [SDK Blogpost]

[2025/10/16] rLLM v0.2 is now officially released! We introduce AgentWorkflowEngine for training over arbitrary agentic programs. It also comes integrated with the official verl-0.5.0, featuring support for Megatron training. Check out this blog post for more.

[2025/07/01] We release DeepSWE-Preview, a 32B software engineering agent (SWE) trained with purely RL that achieves 59% on SWEBench-Verified with test-time scaling,(42.2% Pass@1), topping the SWEBench leaderboard for open-weight models.

[2025/04/08] We release DeepCoder-14B-Preview, a 14B coding model that achieves an impressive 60.6% Pass@1 accuracy on LiveCodeBench (+8% improvement), matching the performance of o3-mini-2025-01-031 (Low) and o1-2024-12-17.

[2025/02/10] We release DeepScaleR-1.5B-Preview, a 1.5B model that surpasses O1-Preview and achieves 43.1% Pass@1 on AIME. We achieve this by iteratively scaling Deepseek's GRPO algorithm from 8K→16K->24K context length for thinking.

Getting Started 🎯

rLLM requires Python >= 3.10 (3.11 is needed if using tinker). You can install it either directly via pip or build from source.

There are three ways that you can install rLLM:

Approach A: Direct Installation

uv pip install "rllm[verl] @ git+https://github.com/rllm-org/rllm.git"

(or replace the verl above for tinker to install with tinker backend, see below for more details)

Approach B: Building from Source with `uv`

Step 1: Clone and Setup Environment

# Clone the repository
git clone https://github.com/rllm-org/rllm.git
cd rllm
# Create an uv environment
uv venv --python 3.11
source .venv/bin/activate

Step 2: Install rLLM with Training Backend

rLLM supports two training backends: verl and tinker. Choose one based on your needs.

Option I: Using verl as Training Backend

uv pip install -e .[verl]

Option II: Using tinker as Training Backend

# can add --torch-backend=cpu to train on CPU-only machines
uv pip install -e .[tinker]

Approach C: Installation with Docker 🐳

For a containerized setup, you can use Docker:

# Build the Docker image
docker build -t rllm .
# Create and start the container
docker create --runtime=nvidia --gpus all --net=host --shm-size="10g" --cap-add=SYS_ADMIN -v .:/workspace/rllm -v /tmp:/tmp --name rllm-container rllm sleep infinity
docker start rllm-container
# Enter the container
docker exec -it rllm-container bash

For more detailed installation guide, including using sglang for verl backend, please refer to our documentation.

Awesome Projects using rLLM 🔥

DeepScaleR: Surpassing O1-Preview with a 1.5B Model by Scaling RL
DeepCoder: A Fully Open-Source 14B Coder at O3-mini Level
DeepSWE: Training a Fully Open-sourced, State-of-the-Art Coding Agent by Scaling RL
Tongyi DeepResearch: A New Era of Open-Source AI Researchers
Terminal-Bench-RL: Training Long-Horizon Terminal Agents with Reinforcement Learning
Cogito, Ergo Ludo: An Agent that Learns to Play by Reasoning and Planning
PettingLLMs: Using On-Policy Reinforcement Learning for Stronger Multi-Agent System
Cut the Bill, Keep the Turns: Affordable Multi-Turn Search RL

Acknowledgements

Our work is done as part of Berkeley Sky Computing Lab. The rLLM team is generously supported by grants from Laude Institute, AWS, Hyperbolic, Fireworks AI, and Modal. We pay special thanks to Together AI for the research partnership and compute support.

Citation

@misc{rllm2025,
  title={rLLM: A Framework for Post-Training Language Agents},
  author={Sijun Tan and Michael Luo and Colin Cai and Tarun Venkat and Kyle Montgomery and Aaron Hao and Tianhao Wu and Arnav Balyan and Manan Roongta and Chenguang Wang and Li Erran Li and Raluca Ada Popa and Ion Stoica},
  year={2025},
  howpublished={\url{https://pretty-radio-b75.notion.site/rLLM-A-Framework-for-Post-Training-Language-Agents-21b81902c146819db63cd98a54ba5f31}},
  note={Notion Blog}
  year={2025}
}

You may also cite our prior work DeepScaleR, DeepCoder, and DeepSWE.

Name		Name	Last commit message	Last commit date
Latest commit History 1,547 Commits
.github/workflows		.github/workflows
docs		docs
examples		examples
rllm		rllm
scripts		scripts
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.readthedocs.yaml		.readthedocs.yaml
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
build_docs.sh		build_docs.sh
mkdocs.yml		mkdocs.yml
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

rLLM

Releases 📰

Getting Started 🎯

Approach A: Direct Installation

Approach B: Building from Source with `uv`

Approach C: Installation with Docker 🐳

Awesome Projects using rLLM 🔥

Acknowledgements

Citation

About

Uh oh!

Releases 3

Packages

Uh oh!

Contributors 41

Languages

License

rllm-org/rllm

Folders and files

Latest commit

History

Repository files navigation

rLLM

Releases 📰

Getting Started 🎯

Approach A: Direct Installation

Approach B: Building from Source with uv

Approach C: Installation with Docker 🐳

Awesome Projects using rLLM 🔥

Acknowledgements

Citation

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Uh oh!

Contributors 41

Languages

Approach B: Building from Source with `uv`

Packages