Oops was doing checklist and realised we missed this rather important feature 💀
This is what is required to fine tuned pretrained base models... Will ship this ASAP!!!
from trl import SFTConfig, SFTTrainer
from datasets import load_dataset
trainer = SFTTrainer(
model="Qwen/Qwen3-0.6B-Base",
args=SFTConfig(
output_dir="Qwen3-0.6B-Instruct",
chat_template_path="HuggingFaceTB/SmolLM3-3B",
),
train_dataset=load_dataset("trl-lib/Capybara", split="train"),
)
trainer.train()
Oops was doing checklist and realised we missed this rather important feature 💀
This is what is required to fine tuned pretrained base models... Will ship this ASAP!!!