Fine-Tuning Llama 3.1 8B with Unsloth

Overview

This repository contains code and instructions for fine-tuning the latest Llama 3.1 8B model using Unsloth. The goal is to enhance the model's performance and customizability for specific use cases by leveraging supervised fine-tuning (SFT) techniques.

Introduction

Llama 3.1 offers state-of-the-art performance, and fine-tuning this model can provide better results for custom applications at a lower cost compared to using general-purpose LLMs. This project demonstrates how to fine-tune Llama 3.1 8B on Google Colab using Unsloth, focusing on QLoRA for efficient memory usage.

Fine-Tuning Techniques

Supervised Fine-Tuning (SFT)

SFT improves and customizes pre-trained LLMs by retraining them on a smaller dataset of instructions and answers. It transforms a basic model into an assistant capable of following instructions and answering questions.

Techniques Used

Full Fine-Tuning: Retrains all parameters of the model.
Low-Rank Adaptation (LoRA): Introduces small adapters at each targeted layer, reducing memory usage and training time.
Quantization-aware Low-Rank Adaptation (QLoRA): An extension of LoRA, providing greater memory savings with slightly longer training times.

Implementation

Prerequisites

To run this project, you will need the following libraries:

pip install "unsloth[colab-new] @ git+https://github.com/unslothai/unsloth.git"
pip install --no-deps "xformers<0.0.27" "trl<0.9.0" peft accelerate bitsandbytes

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
src		src
Llama_3_1_8b_finetunning_using_unsloth.ipynb		Llama_3_1_8b_finetunning_using_unsloth.ipynb
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fine-Tuning Llama 3.1 8B with Unsloth

Overview

Introduction

Fine-Tuning Techniques

Supervised Fine-Tuning (SFT)

Techniques Used

Implementation

Prerequisites

About

Releases

Packages

Languages

Habib-Rehmn/llama_3.1

Folders and files

Latest commit

History

Repository files navigation

Fine-Tuning Llama 3.1 8B with Unsloth

Overview

Introduction

Fine-Tuning Techniques

Supervised Fine-Tuning (SFT)

Techniques Used

Implementation

Prerequisites

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages