Skip to content
/ GPO Public

"Robust Prompt Optimization for Large Language Models Against Distribution Shifts" in EMNLP 2023.

Notifications You must be signed in to change notification settings

li-moxin/GPO

Repository files navigation

GPO

Code for "Robust Prompt Optimization for Large Language Models Against Distribution Shifts" in EMNLP 2023. See the paper at this link. Check out the PPT in this repo.

Nov 27 2023: Updated GPO code and data. Our code is based on APE. We sincerely thank the APE authors!

TODO: update prompts and code for baselines.

Setup

After cloning the project, run the following steps to create the environment.

cd GPO
conda create -n gpo python=3.8
pip install -e .
cd experiments
mkdir results

Running the code

Firstly, remember to fill in your openai api key in experiments/run_gpo.py.

For example, to run GPO on Yelp and Flipkart,

python run_gpo.py yelp flipkart 6 6 1 36 0 0.6

The parameters are source_group, target_group, num_subsamples, num_demos, num_prompts_per_subsample, eval_num, seed, conf_threshold.

  • source_group: see all the data files at experiments/data/instruction_induction/raw/induce/
  • target_group: see all the data files at experiments/data/instruction_induction/raw/execute/
  • num_subsamples, num_demos, num_prompts_per_subsample, eval_num: come from APE. If not assigned, the code will automatically assign the default parameters in the paper.
  • seed: 0~4, for five times average.
  • conf_threshold: the consistency threshold. If not assigned, the code will automatically assign the default value in the paper.

The logs are saved at experiments/results.

Data

The datasets we used are stored under experiments/data/instruction_induction/raw/. In our experiments, the training and validation instances are randomly sampled from the entire original dataset. In this repo, we only keep the data used in our experiments under the this folder, where unused data are saved as None for saving space.

Contact

Kindly contact [email protected] for any issue, thank you!

Reference

If you find our work useful, kindly add the following citation. Thank you!

@inproceedings{
li2023robust,
title={Robust Prompt Optimization for Large Language Models Against Distribution Shifts},
author={Moxin Li, Wenjie Wang, Fuli Feng, Yixin Cao, Jizhi Zhang, Tat-Seng Chua},
booktitle={The 2023 Conference on Empirical Methods in Natural Language Processing},
year={2023},
url={https://openreview.net/forum?id=svUOik2Xu1}
}

About

"Robust Prompt Optimization for Large Language Models Against Distribution Shifts" in EMNLP 2023.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages