Instruct3

This repository contains the scripts for the supervised fine-tuning and direct preference optimization of the LLM-jp model.

Preparation

Setup Environment

See install_cu118.sh for CUDA 11.8 and install_cu12.1.sh for CUDA 12.1.

Prepare Data

Run the following command to download and preprocess the data (about 4GB in total).

python preprocess_sft.py --dataset-dir /path/to/dataset

python preprocess_dpo.py --dataset-dir /path/to/dataset

Prepare Config File

Please copy base_template.yaml to base.yaml.

cp configs/base_template.yaml configs/base.yaml

After copying, you need to modify the values work_dir and data_dir in configs/base.yaml. work_dir is the directory where the model and the log files are stored and data_dir is the directory where the input data files are stored. This is the same as the --dataset-dir option in the preprocessing script.

Checkpoint Conversion

Hugging Face -> Nemo

scripts/ckpt/convert_llama_hf_to_nemo.sh converts the Hugging Face checkpoint to the Nemo checkpoint. You may need to change job-name before running the script.

sbatch scripts/ckpt/convert_llama_hf_to_nemo.sh ${INPUT_HF_NAME_OR_PATH} ${OUTPUT_NEMO_PATH} ${HPARAMS_FILE}

Sample Code

# convert llm-jp-3-1.8b
sbatch scripts/ckpt/convert_llama_hf_to_nemo.sh llm-jp/llm-jp-3-1.8b /path/to/checkpoints/hf-to-nemo/llm-jp--llm-jp-3-1.8b ./megatron_configs/llmjp3/1.8b-exp2.yaml
# convert llm-jp-3-3.7b
sbatch scripts/ckpt/convert_llama_hf_to_nemo.sh llm-jp/llm-jp-3-3.7b /path/to/checkpoints/hf-to-nemo/llm-jp--llm-jp-3-3.7b ./megatron_configs/llmjp3/3.7b-exp1.yaml
# convert llm-jp-3-7.2b
sbatch scripts/ckpt/convert_llama_hf_to_nemo.sh llm-jp/llm-jp-3-7.2b /path/to/checkpoints/hf-to-nemo/llm-jp--llm-jp-3-7.2b ./megatron_configs/llmjp3/7.2b-exp1.yaml
# convert llm-jp-3-13b
sbatch scripts/ckpt/convert_llama_hf_to_nemo.sh llm-jp/llm-jp-3-13b /path/to/checkpoints/hf-to-nemo/llm-jp--llm-jp-3-13b ./megatron_configs/llmjp3/13b-exp4.yaml

Nemo -> Hugging Face

scripts/ckpt/convert_llama_nemo_to_hf.sh converts the Nemo checkpoint to the Hugging Face checkpoint. You may need to change job-name before running the script.

Note: Use the absolute path for the input and output paths.

sbatch scripts/ckpt/convert_llama_nemo_to_hf.sh ${INPUT_NEMO_PATH} ${OUTPUT_HF_PATH}

Sample Code

# convert `sft-model1` model
sbatch scripts/ckpt/convert_llama_nemo_to_hf.sh /path/to/checkpoints/hf-to-nemo/sft-model1 /path/to/checkpoints/nemo-to-hf/sft-model1

Supervised Fine-tuning

You may need to change job-name before running the script.

sbatch scripts/train/sft_1.8b.sh ${INPUT_NEMO_PATH}

Sample Code

# 1.8B model with 2 nodes (16 GPUs)
sbatch --nodes 2 scripts/train/sft_1.8b.sh /path/to/checkpoints/hf-to-nemo/llm-jp--llm-jp-3-1.8b
# 7.2B model with 4 nodes (32 GPUs)
sbatch --nodes 4 scripts/train/sft_7.2b.sh /path/to/checkpoints/hf-to-nemo/llm-jp--llm-jp-3-7.2b

Direct Preference Optimization

You may need to change job-name before running the script.

sbatch scripts/train/dpo_1.8b.sh ${INPUT_NEMO_PATH}

Sample Code

# 1.8B model with 2 nodes (16 GPUs)
sbatch --nodes 2 scripts/train/dpo_1.8b.sh /path/to/checkpoints/hf-to-nemo/sft-model1
# 7.2B model with 4 nodes (32 GPUs)
sbatch --nodes 4 scripts/train/dpo_7.2b.sh /path/to/checkpoints/hf-to-nemo/sft-model2

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
configs		configs
megatron_configs		megatron_configs
scripts		scripts
.flake8		.flake8
.gitignore		.gitignore
.isort.cfg		.isort.cfg
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
NeMo_v2.1.0rc0.patch		NeMo_v2.1.0rc0.patch
README.md		README.md
dataset.py		dataset.py
install_cu118.sh		install_cu118.sh
install_cu121.sh		install_cu121.sh
preprocess_dpo.py		preprocess_dpo.py
preprocess_sft.py		preprocess_sft.py
requirements_cu118.txt		requirements_cu118.txt
requirements_cu121.txt		requirements_cu121.txt
train_dpo.py		train_dpo.py
train_sft.py		train_sft.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Instruct3

Preparation

Setup Environment

Prepare Data

Prepare Config File

Checkpoint Conversion

Hugging Face -> Nemo

Sample Code

Nemo -> Hugging Face

Sample Code

Supervised Fine-tuning

Sample Code

Direct Preference Optimization

Sample Code

About

Releases 1

Packages

Contributors 2

Languages

License

llm-jp/instruct3

Folders and files

Latest commit

History

Repository files navigation

Instruct3

Preparation

Setup Environment

Prepare Data

Prepare Config File

Checkpoint Conversion

Hugging Face -> Nemo

Sample Code

Nemo -> Hugging Face

Sample Code

Supervised Fine-tuning

Sample Code

Direct Preference Optimization

Sample Code

About

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 2

Languages

Packages