CARVIEW

MOTORHOMES

Select Language

HTTP/2 200 server: GitHub.com content-type: text/html; charset=utf-8 last-modified: Fri, 16 May 2025 12:28:20 GMT access-control-allow-origin: * strict-transport-security: max-age=31556952 etag: W/"68272f64-9781" expires: Mon, 29 Dec 2025 08:04:19 GMT cache-control: max-age=600 content-encoding: gzip x-proxy-cache: MISS x-github-request-id: CB25:3A7A40:873980:97FF93:695233A7 accept-ranges: bytes age: 0 date: Mon, 29 Dec 2025 07:54:19 GMT via: 1.1 varnish x-served-by: cache-bom-vanm7210023-BOM x-cache: MISS x-cache-hits: 0 x-timer: S1766994859.891335,VS0,VE210 vary: Accept-Encoding x-fastly-request-id: 9dff5062a513b925b1b1d0defb1013b886430f4b content-length: 9216 Real2Render2Real

Real2Render2Real

Scaling Robotic Manipulation Data Without Dynamics Simulation or Robot Hardware

Justin Yu*, Letian Fu*, Huang Huang, Karim El-Refai, Rares Ambrus, Richard Cheng, Muhammad Zubair Irshad, Ken Goldberg

UC Berkeley, Toyota Research Institute* Equal contribution

Real2Render2Real (R2R2R) is a scalable pipeline for generating data to train generalist manipulation policies - without dynamics simulation or teleoperation.

View the full project video here.

[Paper] [arXiv] [Code]

We train modern RGB + Proprioception based imitation learning frameworks and VLA models (Vanilla Diffusion Policy, π₀-FAST); without requiring any teleoperation data!

Abstract ▾(click to expand)

Real Robot Rollouts

←

→

Click a thumbnail to change the video above.

We train and evaluate two modern robot visuomotor policies (π₀-FAST and Diffusion Policy) on either only rendered data generated by Real2Render2Real or only human teleoperated data across 5 manipulation tasks.

Performance Scaling

Comparative analysis of imitation-learning policies trained on R2R2R-generated data against human teleoperation data across 1050 physical robot experiments suggest that while real data is higher quality and more efficient per demonstration, R2R2R’s generation enables scaling trajectory diversity far beyond human throughput, achieving competitive final performance with less collection effort.

Scan, Track, Render

Click and move me!

←

→

Click a thumbnail to change the interactive view above.

The distinction we make between simulation and rendering is often a point of confusion:

When we refer to simulation, we mean the use of a physics engine to computationally model dynamic interactions. In contrast, rendering refers to generating visual data from a graphics engine.

Why No Dynamics Simulation? ▾(click to expand)

Rendering More Embodiments

Part trajectories from a single demonstration can be retargeted across different robot embodiments.

Domain Randomization

We randomize initial object poses, lighting, and camera poses to generate diverse synthetic rollouts for each object-task combination.

Trajectory Interpolation

From a single demonstration, R2R2R generates a distribution of plausible trajectories by interpolating 6-DoF part motion.

Click and move me!

Full Project Video

BibTeX

@misc{yu2025real2render2realscalingrobotdata,
        title={Real2Render2Real: Scaling Robot Data Without Dynamics Simulation or Robot Hardware},
        author={Justin Yu and Letian Fu and Huang Huang and Karim El-Refai and Rares Andrei Ambrus and Richard Cheng and Muhammad Zubair Irshad and Ken Goldberg},
        year={2025},
        eprint={2505.09601},
        archivePrefix={arXiv},
        primaryClass={cs.RO},
        url={https://arxiv.org/abs/2505.09601},
  }

Original Source | Taken Source