SAT-LM

Code for SatLM: SATisfiability-Aided Language Models using Declarative Prompting (NeurIPS 2023).

Setup

python==3.8
requirements: pip install -r requirements.txt
Set OPENAI KEY: export KEY=yourkey

Experiments

Preparation:
mkdir misc tmp

Since OpenAI will no longer support code-davinci-002, we provide cached outputs generated by code-davinci-002.

Please run the following command if you want to use cached code-002 ouputs:
unzip aux/cached_code-002_outputs.zip -d .

Experiments on Arithemetic Reasoning

GSM:
sh exp_scripts/gsm.sh test

GSM-system:
sh exp_scripts/gsm.sh system

Algebra:
sh exp_scripts/gsm.sh algebra

Experiments on Logical Reasoning

ARLSAT:
sh exp_scripts/arlsat.sh

BoardgameQA:
sh exp_scripts/boarddp1.sh # depth 1
sh exp_scripts/boarddp2.sh # depth 2
sh exp_scripts/boarddp3.sh # depth 3

CLUTRR:
sh exp_scripts/clutrr.sh

ProofWriter:
sh exp_scripts/proofd5.sh

Prompts

Prompts used in our experiments are stored as jsonline file in manual_prompts/

Citation

@InProceedings{Ye-Et-Al:2023:SAT,
  title = {SatLM: Satisfiability-Aided Language Models Using Declarative Prompting},
  author = {Xi Ye and Qiaochu Chen and Isil Dillig and Greg Durrett},
  booktitle = {Proceedings of NeurIPS},
  year = {2023},
}

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
annotations/arlsat		annotations/arlsat
aux		aux
data		data
exp_scripts		exp_scripts
manual_prompts		manual_prompts
prog_solver		prog_solver
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
api_utils.py		api_utils.py
requirements.txt		requirements.txt
run_manual.py		run_manual.py
run_multistage.py		run_multistage.py
task_evaluator.py		task_evaluator.py
task_helper.py		task_helper.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SAT-LM

Setup

Experiments

Experiments on Arithemetic Reasoning

Experiments on Logical Reasoning

Prompts

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

xiye17/SAT-LM

Folders and files

Latest commit

History

Repository files navigation

SAT-LM

Setup

Experiments

Experiments on Arithemetic Reasoning

Experiments on Logical Reasoning

Prompts

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages