HTTP/2 200
server: GitHub.com
content-type: text/html; charset=utf-8
last-modified: Thu, 19 Sep 2024 17:44:07 GMT
access-control-allow-origin: *
strict-transport-security: max-age=31556952
etag: W/"66ec62e7-2321"
expires: Sun, 28 Dec 2025 23:11:46 GMT
cache-control: max-age=600
content-encoding: gzip
x-proxy-cache: MISS
x-github-request-id: 6C56:3FD64F:80F848:90CFA0:6951B6DA
accept-ranges: bytes
date: Sun, 28 Dec 2025 23:01:46 GMT
via: 1.1 varnish
age: 0
x-served-by: cache-bom-vanm7210033-BOM
x-cache: MISS
x-cache-hits: 0
x-timer: S1766962906.342478,VS0,VE237
vary: Accept-Encoding
x-fastly-request-id: c908eb479439fb78dc28347b840d504beb36618c
content-length: 2502
Kevin Lu
|
I have a new website: link (you should be redirected automatically).
|
|
|
Decision Transformer: Reinforcement Learning via Sequence Modeling
Lili Chen*,
Kevin Lu*,
Aravind Rajeswaran,
Kimin Lee,
Aditya Grover,
Michael Laskin,
Pieter Abbeel,
Aravind Srinivas*,
Igor Mordatch*
Neural Information Processing Systems (NeurIPS), 2021
Official:
arXiv /
website /
poster /
tweet /
code
Press:
The Batch article /
SyncedReview article /
The Gradient article /
Yannic Kilcher video /
Eindhoven RL seminar
|
|
|
Pretrained Transformers as Universal Computation Engines
Kevin Lu,
Aditya Grover,
Pieter Abbeel,
Igor Mordatch
AAAI Conference on Artificial Intelligence, 2022
Official:
arXiv /
blog /
poster /
tweet /
code
Press:
The Batch article /
VentureBeat article /
TWIML podcast /
Yannic Kilcher video
|
|
Back when I was a TA at Berkeley, I wrote a
study guide
for our course on Probability and Random Processes.
|
|