Fine tuning my first open LLM model
Disclaimer: This is a draft blogpost. I am dumping links, quotes and thoughts here until I'm actually ready to finalize the blogpost.
Table of Contents
Recommendations
Source: https://unsloth.ai/docs/get-started/fine-tuning-llms-guide
My Hardware and Software choices
- Using Macbook Air M5 32GB
- Why Unsloth?
- No particular reason, it's a decent option from reddit reviews and seems easy to get started.
- As recommended by Unsloth's docs, I will start with simplest setup first:
- I want to fine tune a small model first: Qwen 3.5 9B (around 8B as recommended by Unsloth)
- But it seems that Qwen 3.5 9B doesn't has Instruct, so maybe a GGUF+4-bit Qwen3 8B model instead which seems to have Instruct variant https://unsloth.ai/docs/get-started/unsloth-model-catalog#qwen-models
- I will try QLoRA first as recommended by Unsloth docs.
- Minimum 100 rows for fine tuning, but possibly upto 1000 rows if huggingface or Codex can give training data in ChatML or ShareGPT format needed for Instruct models.
- I want to fine tune a small model first: Qwen 3.5 9B (around 8B as recommended by Unsloth)
