You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
To evaluate your model on ChartQAPro, follow the steps below:
1. Format Your Predictions
Save your model's predictions in a .json file that contains a list of dictionaries.
Each dictionary should include the following keys (first three keys taken from the original huggingface dataset):
"Answer": the ground truth answer
"Question Type": the type of the question (e.g., Factoid, MCQ, etc.)
This will print your model’s performance across different question types and the overall score, following the official evaluation metrics used in the paper. 📊
If you use ChartQAPro in your research, please cite:
@misc{masry2025chartqaprodiversechallengingbenchmark,
title={ChartQAPro: A More Diverse and Challenging Benchmark for Chart Question Answering},
author={Ahmed Masry and Mohammed Saidul Islam and Mahir Ahmed and Aayush Bajaj and Firoz Kabir and Aaryaman Kartha and Md Tahmid Rahman Laskar and Mizanur Rahman and Shadikur Rahman and Mehrad Shahmohammadi and Megh Thakkar and Md Rizwan Parvez and Enamul Hoque and Shafiq Joty},
year={2025},
eprint={2504.05506},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2504.05506},
}