2x Faster LLM Training on Windows | LLaMA-Factory with Unsloth and Flash Attention 2

Channel:

NanoNomad

Subscribers:

2,970

Published on June 20, 2024 7:19:47 AM ● Video Link: https://www.youtube.com/watch?v=KLHJqQSbuPE

Duration: 7:31

2,708 views

It was tough to get this working, but I think I've figured it out enough to share.

Here's a quick guide on how to set up LLaMA-Factory with support for Flash Attention 2 and Unsloth training on Windows. This is using a RTX3060 12GB GPU, Windows 10, and CUDA 12.1.

Unsloth is an optimization library that claims up to a 2x performance boost with no trade off in accuracy.

There's also a quick and dirty script to convert bulk raw text to a dataset file, and a little overview of the dataset setup. I also touch on how to fix the error when loading the trained adapter in the Text Generation WebUI caused by mismatching PEFT libs.

[00:00] Intro & Topics: Installing LLaMA-Factory, Unsloth; Adding Datasets; Making Datasets; Training
[01:05] System Specs... Probably CUDA 12.1 only?
[01:27] System requirements; Microsoft Build Tools, etc
[01:55] Creating the Conda environment and installing dependencies
[02:05] Install Clang
[02:27] Install Flash Attention 2
[02:44] Install LLaMA-Factory requirements
[03:01] Install LLaMA-Factory
[03:15] Reinstall Numpy; Install Triton for Windows
[03:42] Datasets
[05:10] .txt to alpaca format .json single text column script
[05:42] Run training with Unsloth
[06:10] Loading LoRA adapter in the Text Geneation WebUI (fixing config file errors)

Go here for copy-paste commands and links:
http://nanonomad.com/2024/06/20/llama-factory-with-flash-attention-2-and-unsloth/

Other Videos By NanoNomad

2025-08-09	Saturday Morning Console Wars: 40 Minutes of Restored Retro Console Commercials
2025-08-06	MS-DOS and Windows XP Gaming on a Thinkpad X61 [SoundBlaster Emulation with MIDI in DOS]
2025-07-11	Using Flux Kontext in Krita with the Generative AI Plugin
2025-04-17	DiffRhythm: Generative Music (done quickly)
2025-02-25	Is YuE the Stable Diffusion of Music? \| Generate Full-Length Songs with Vocals at Home
2025-02-10	Portable Whisper Speech to Text with Speaker Diarization and VAD \| Purfview Faster Whisper XXL
2024-07-03	Fine Tuning XTTS v2 for Hindi Speech with forked Coqui TTS
2024-06-26	Fine Tuning XTTS v2 with forked Coqui \| Coqui AI is dead; Long live Coqui!
2024-06-20	2x Faster LLM Training on Windows \| LLaMA-Factory with Unsloth and Flash Attention 2
2024-06-15	64kb Scene Demo/Intro/Cracktro Multimedia Mix #1 (90 min) \| Flash/Photo-sensitivity Warning
2024-06-10	Stable Audio Open 1.0 \| Open Source* Generative Audio and Fine Tuning*
2024-06-04	Troubleshooting Sega Saturn Emulation with Retroarch for iOS/Apple
2024-05-29	Play Windows 98 and MS-DOS Games on iPad/iOS/iPhone with DOSBox-Pure and Retroarch for FREE
2024-05-25	The Lost Art of Optical Disc Repair \| Fixing and Testing a PlayStation Disc
2024-05-22	Retroarch iOS Updates \| Improved Performance, MS-DOS Core, Doom and Touch Input
2024-05-17	RetroArch for iPad and iPhone now on the App Store \| Installation, Setup, Quick Performance Overview
2024-05-13	Micca Speck 4K Media Player \| Unboxing, Firmware Update, Setup, Demos, and Opinions
2024-05-06	Training SDXL to Generate Text Using IA3 LoRA \| It's like Kai's Power Tools, I Guess?
2024-04-17	Replacing Faulty Asus Phoenix RTX 3060 GPU Cooler - It's Easy
2024-03-21	Bark TTS, Seamless Translation, RVC, Music Generation and More with the TTS Generation WebUI
2024-02-14	Train Better Stable Diffusion Models \| Prep Datasets Using this Free "Magic" Image Tool

Tags:

Unsloth

large language model

LLM

fine-tuning

LLaMA

LLaMA-Factory

Channel	Latest
工作室CE	6 hours ago
RoninRevil	6 hours ago
Era Studio - Ascleios et Torra	6 hours ago
Ager	7 hours ago
Trifling Territory	7 hours ago
Inter	7 hours ago
GaryTheGammarid	7 hours ago
TGW360	7 hours ago
Liban Ali	7 hours ago
Court Jester+	7 hours ago
つるっぱげゲームズturuppage games	7 hours ago
Games	7 hours ago
Dieison Games	7 hours ago
Mike Straw Media	7 hours ago
TheRaijuu	7 hours ago
Team JHZ	7 hours ago
SPECGURU	7 hours ago
makaveli	7 hours ago
Heroes 3.5: WoG Portal	8 hours ago
Der Mikeintosh	8 hours ago
RISRISING	8 hours ago
RoyaloYT	8 hours ago
The Canadian Press	8 hours ago
Frankie DiF	8 hours ago
KevGaming87	8 hours ago