DependEval: Benchmarking LLMs for Repository Dependency Understanding

The repository contains the data and evaluation code for ACL 2025 Findings paper "DependEval: Benchmarking LLMs for Repository Dependency Understanding"

Introduction

We introduce DependEval, a hierarchical benchmark for evaluating LLMs on repository-level code understanding across 8 programming languages.

DependEval comprises 2,683 curated repositories across 8 programming languages, and evaluates models on three hierarchical tasks: Dependency Recognition, Repository Construction, and Multi-file Editing.

Our findings highlight key challenges in applying LLMs to large-scale development, and lay the groundwork for future improvements in repository-level understanding.

How to Run

# Implement your model in the `inference_func` inside run.py
# Then run the following commands for automatic inference and evaluation
conda create -n dependeval python=3.10 -y
conda activate dependeval
pip install -r requirements.txt
bash run.sh

Citation

Feel free to cite us

@misc{du2025dependevalbenchmarkingllmsrepository,
      title={DependEval: Benchmarking LLMs for Repository Dependency Understanding}, 
      author={Junjia Du and Yadi Liu and Hongcheng Guo and Jiawei Wang and Haojian Huang and Yunyi Ni and Zhoujun Li},
      year={2025},
      eprint={2503.06689},
      archivePrefix={arXiv},
      primaryClass={cs.SE},
      url={https://arxiv.org/abs/2503.06689}, 
}

Contact

If you meet any question during running please contact junjia001@e.ntu.edu.sg

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
assets		assets
data		data
utils		utils
README.md		README.md
eval_DR.py		eval_DR.py
eval_ME.py		eval_ME.py
eval_RC.py		eval_RC.py
inference.py		inference.py
requirements.txt		requirements.txt
run.py		run.py
run.sh		run.sh
run_llama.sh		run_llama.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

DependEval: Benchmarking LLMs for Repository Dependency Understanding

Introduction

How to Run

Citation

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

ink7-sudo/DependEval

Folders and files

Latest commit

History

Repository files navigation

DependEval: Benchmarking LLMs for Repository Dependency Understanding

Introduction

How to Run

Citation

Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages