HTTP/2 200
server: GitHub.com
content-type: text/html; charset=utf-8
last-modified: Tue, 15 Oct 2024 21:24:59 GMT
access-control-allow-origin: *
strict-transport-security: max-age=31556952
etag: W/"670eddab-2338"
expires: Mon, 29 Dec 2025 09:49:34 GMT
cache-control: max-age=600
content-encoding: gzip
x-proxy-cache: MISS
x-github-request-id: 70FF:2BC55:89B1B9:9AA687:69524C56
accept-ranges: bytes
age: 0
date: Mon, 29 Dec 2025 09:39:34 GMT
via: 1.1 varnish
x-served-by: cache-bom-vanm7210055-BOM
x-cache: MISS
x-cache-hits: 0
x-timer: S1767001174.194672,VS0,VE226
vary: Accept-Encoding
x-fastly-request-id: 9d50e9fbbbc0de599b70131314d18d3a7d129da5
content-length: 2426
Fanjia Yan
Research
I'm generally interested in interested in LLM agentic framework and Function calling/ Tool usage. Some of my works are listed below:
Your browser does not support the video tag.
Berkeley Function Calling Leaderboard
Fanjia Yan *,
Huanzhi Mao *,
Charlie Cheng-Jie Ji *,
Ion Stoica ,
Joseph E. Gonzalez ,
Tianjun Zhang ,
Shishir G. Patil
project page
/
github page
The first comprehensive evaluation on the LLM's ability to call functions and tools
Your browser does not support the video tag.
LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code
Naman Jain ,
King Han ,
Alex Gu ,
Wen-Ding Li ,
Fanjia Yan ,
Tianjun Zhang ,
Sida Wang ,
Armando Solar-Lezama
Koushik Sen ,
Ion Stoica
project page
/
code
/
arXiv
LiveCodeBench collects problems from periodic contests platforms and uses them for constructing a holistic benchmark for evaluating Code LLMs across variety of code-related scenarios continuously over time.
Your browser does not support the video tag.
Openfunctions
Charlie Cheng-Jie Ji *,
Huanzhi Mao *,
Fanjia Yan *,
Ion Stoica ,
Joseph E. Gonzalez ,
Shishir G. Patil
Tianjun Zhang ,
project page
/
Huggingface page
A SOTA open source generally purpose function calling model.