You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We store all datasets, predictions, and results from the paper in a HuggingFace dataset repository. You can download the dataset from HuggingFace by running the following command:
In tutorials/tutorial.ipynb, we walk through how to use the Smoothie algorithm. The tutorial can be easily adapted for your use case given that you provide a .jsonl file with the dataset inputs, and several json files each containing a different model/prompt's generations.
If you are interested in the mathematical derivation of Smoothie, check out tutorials/algorithm.ipynb.
dataset_configs: Contains the configuration files for all single-task and multi-task datasets.
plots: Contains plots for the paper.
prompt_templates: Contains the prompt templates for all single-task and multi-task datasets.
replication_scripts: Contains bash scripts for running experiments in the paper.
src: Contains the source code for formatting datasets, getting generations, running routing methods, and evaluating results. The subfolder paper contains code for producing the tables and plots in the paper.
tables: Contains latex tables for the paper.
tutorials: Contains tutorials for using Smoothie.
Citation
If you use Smoothie in your work, please cite the following paper:
@misc{guha2024smoothielabelfreelanguage,
title={Smoothie: Label Free Language Model Routing},
author={Neel Guha and Mayee F. Chen and Trevor Chow and Ishan S. Khare and Christopher Ré},
year={2024},
eprint={2412.04692},
archivePrefix={arXiv},
primaryClass={cs.AI},
url={https://arxiv.org/abs/2412.04692},
}