LLaMA-Pro Training

This is a modified version of the allenai/open-instruct repository used for the LLaMA-Pro project. This branch diverged from main at commit 9ebcb58. This branch implements the following features that are not present in 9ebcb58:

Tuning specific layers while freezing the other parameters.
Use gradient checkpointing to train.

The remaining portion of this README contains instructions to replicate training of the LLaMA-Pro model.

You can find more details in the original open-instruct.

Replicating Training

Set up environment

We provide a file containing a dump of our training environment.

You can install all required packages via

pip install -r requirements.txt

Prepare data

You can use the following command to prepare the instruction dataset.

python open_instruct/reformat_datasets.py --dataset evol_codealpaca meta_math SlimOrca WizardLM_evol_instruct_V2_196k

Launching Training

Then, edit the provided .sh files to set paths based on your own system's saved locations for checkpoints and data files. The example can be found in finetune_codealpaca.

Tip: If you want to train the specific layers, you can an argument extend_layers in the script to specify the layers for training.

You can use the following command to run instruction tuning (finetuning a pretrained model to follow instructions):

bash scripts/finetune_codealpaca.sh

Name		Name	Last commit message	Last commit date
Latest commit History 188 Commits
ds_configs		ds_configs
open_instruct		open_instruct
scripts		scripts
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LLaMA-Pro Training

Replicating Training

Set up environment

Prepare data

Launching Training

About

Uh oh!

Releases

Packages

Languages

License

hills-code/open-instruct

Folders and files

Latest commit

History

Repository files navigation

LLaMA-Pro Training

Replicating Training

Set up environment

Prepare data

Launching Training

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages