Skip to content

Add custom chat template support for instructing tuning #62

@supreme-gg-gg

Description

@supreme-gg-gg

Oops was doing checklist and realised we missed this rather important feature 💀

This is what is required to fine tuned pretrained base models... Will ship this ASAP!!!

from trl import SFTConfig, SFTTrainer
from datasets import load_dataset

trainer = SFTTrainer(
    model="Qwen/Qwen3-0.6B-Base",
    args=SFTConfig(
        output_dir="Qwen3-0.6B-Instruct",
        chat_template_path="HuggingFaceTB/SmolLM3-3B",
    ),
    train_dataset=load_dataset("trl-lib/Capybara", split="train"),
)
trainer.train()

Metadata

Metadata

Assignees

No one assigned

    Labels

    trainingFine tuning related features

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions