You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Training script examples are provided in the advisor_models directory, along with templates for new environments. You will also need to have specified an OPENAI_API_KEY and WANDB_API_KEY in your environment.
Advisor Models
Overview
Customizing powerful, black-box models is a major challenge, with most practitioners typically limited to static prompting.
We propose a framework to train a small open-source “advisor” model to guide black-box models via feedback, optimizing the model to your specific environment, task, or users with RL.
We show that Advisor Models are highly effective for personalizing and adapting black-box models to specific environments. We additionally test the system in reasoning-intensive tasks, finding properties of the framework under which Advisor Models work best, and demonstrating the system robustness across models and environments.
Example
📜 License
Advisor Models is Apache 2.0 licensed, making it suitable for both academic and commercial use.
@article{asawa2025trainadvisorsteeringblackbox,
title={How to Train Your Advisor: Steering Black-Box LLMs with Advisor Models},
author={Parth Asawa and Alan Zhu and Matei Zaharia and Alexandros G. Dimakis and Joseph E. Gonzalez},
year={2025},
journal={arXiv preprint arXiv:2510.02453},
}
About
How to Train Your Advisor: Steering Black-Box LLMs with Advisor Models