| CARVIEW |
Select Language
HTTP/2 200
server: GitHub.com
content-type: text/html; charset=utf-8
last-modified: Wed, 06 Nov 2024 02:11:48 GMT
access-control-allow-origin: *
strict-transport-security: max-age=31556952
etag: W/"672ad064-5040"
expires: Mon, 29 Dec 2025 08:19:16 GMT
cache-control: max-age=600
content-encoding: gzip
x-proxy-cache: MISS
x-github-request-id: 6B9E:2BC55:88097D:98D588:6952372C
accept-ranges: bytes
age: 0
date: Mon, 29 Dec 2025 08:09:17 GMT
via: 1.1 varnish
x-served-by: cache-bom-vanm7210023-BOM
x-cache: MISS
x-cache-hits: 0
x-timer: S1766995757.799041,VS0,VE207
vary: Accept-Encoding
x-fastly-request-id: 88e080ada6763262db7dd0e3c466c701da2dc706
content-length: 3555
SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning
Featured on Hugging Face Daily Paper
SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning
ICRA 2024
Jianlan Luo1*,
Zheyuan Hu1*,
Charles Xu1,
You Liang Tan1,
Jacob Berg2,
Archit Sharma3,
Stefan Schaal4,
Chelsea Finn3,
Abhishek Gupta2,
Sergey Levine1
*Equal Contribution;
1Department of EECS, University of California, Berkeley;
2Department of Computer Science, University of Washington;
3Department of Computer Science, Stanford University;
4Intrinsic Innovation LLC
SERL is a ready-to-use software suite for robotic RL,
featuring sample efficient off-policy algorithms,
various reward specification methods, and advanced controller for popular robots.
It includes example tasks such as PCB assembly, cable routing, and reset-freeobject relocation.
Remarkably, it trains policies in just 25 to 50 minutes, outperforming
previous benchmarks with high success rates and robustness.
Uncut Training Process
SERL Successful Deployments
Peking Unversity Agibot Lab
Tsinghua University
ETH
Intrinsic LLC
Peking Unversity Agibot Lab
Peking Unversity Agibot Lab
Peking Unversity Agibot Lab
ETH
Have you used SERL successfully? Send us your videos to jianlanluo@berkeley.edu!
Zero-shot Robustness to Perturbations and Distractors
PCB Component Insertion
The agent successfully inserts the PCB component as trained.
The agent successfully inserts the PCB component after being blind-folded and the board moved.
The agent successfully inserts the PCB component despite multiple distractor objects on the PCB board.
The agent succeeds when the board is not fixed to the table.
The agent successfully inserts the PCB component into a the board in a different pose than training.
The agent successfully finds the holes in the board after it is moved.
The agent successfully finds the holes in the board after it is continuously moved.
Cable Routing
The agent successfully routes the cable as trained.
The agent generalizes to route the cable through an unseen clip pose.
The agent successfully routes the cable despite continuous perturbation to the clip.
The agent successfully routes the cable despite continuous perturbation to the clip.
The agent generalizes to route the cable through an unseen clip pose.
The agent generalizes to route the cable through an unseen clip pose.
Object Relocation
The agent relocates the object as trained.
The agent relocates the trained object despite multiple distractors in the bins.
The agent relocates the trained object despite a distractor in the scene.
The agent generalizes to relocate a different object than trained.
The agent completes the task as trained.
BibTeX
@misc{luo2024serl,
title={SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning},
author={Jianlan Luo and Zheyuan Hu and Charles Xu and You Liang Tan and Jacob Berg and Archit Sharma and Stefan Schaal and Chelsea Finn and Abhishek Gupta and Sergey Levine},
year={2024},
eprint={2401.16013},
archivePrefix={arXiv},
primaryClass={cs.RO}
}