HTTP/2 200
server: GitHub.com
content-type: text/html; charset=utf-8
last-modified: Thu, 19 Sep 2024 17:44:07 GMT
access-control-allow-origin: *
strict-transport-security: max-age=31556952
etag: W/"66ec62e7-2321"
expires: Mon, 29 Dec 2025 02:33:22 GMT
cache-control: max-age=600
content-encoding: gzip
x-proxy-cache: MISS
x-github-request-id: 6827:36A0B4:82AC30:92DD15:6951E61A
accept-ranges: bytes
age: 0
date: Mon, 29 Dec 2025 02:23:23 GMT
via: 1.1 varnish
x-served-by: cache-bom-vanm7210040-BOM
x-cache: MISS
x-cache-hits: 0
x-timer: S1766975003.830344,VS0,VE224
vary: Accept-Encoding
x-fastly-request-id: 142b76458d43db32f7811a4d60557ccf2c78dcbd
content-length: 2502
Kevin Lu
|
I have a new website: link (you should be redirected automatically).
|
|
|
Decision Transformer: Reinforcement Learning via Sequence Modeling
Lili Chen*,
Kevin Lu*,
Aravind Rajeswaran,
Kimin Lee,
Aditya Grover,
Michael Laskin,
Pieter Abbeel,
Aravind Srinivas*,
Igor Mordatch*
Neural Information Processing Systems (NeurIPS), 2021
Official:
arXiv /
website /
poster /
tweet /
code
Press:
The Batch article /
SyncedReview article /
The Gradient article /
Yannic Kilcher video /
Eindhoven RL seminar
|
|
|
Pretrained Transformers as Universal Computation Engines
Kevin Lu,
Aditya Grover,
Pieter Abbeel,
Igor Mordatch
AAAI Conference on Artificial Intelligence, 2022
Official:
arXiv /
blog /
poster /
tweet /
code
Press:
The Batch article /
VentureBeat article /
TWIML podcast /
Yannic Kilcher video
|
|
Back when I was a TA at Berkeley, I wrote a
study guide
for our course on Probability and Random Processes.
|
|