Training with older v0 versus new v1 #867

AsmoKoskinen · 2025-03-17T07:21:19Z

AsmoKoskinen
Mar 17, 2025

"F5-TTS v1 base model with better training and inference performance."

I trained a Finnish model at home (4070 Ti Super (16GB)). I used about four days of Finnish speeches as my data (...total duration of all wav files: 106:10:05.17 (HH:MM:SS.ss)). The training took approximately four days.

What are the benefits of retraining the model with v1? I might buy a 5090 (32G) one day, but I have no plans to train before that.

Answered by SWivid

Mar 17, 2025

What are the benefits of retraining the model with v1?

"F5-TTS v1 base model with better training and inference performance." is in comparison with v0.
If you use same setting (learning rate, max updates, total batch size), v1 converges faster and gives better WER/SIM result (lower word error rate, higher speaker similarity).

View full answer

SWivid · 2025-03-17T07:28:44Z

SWivid
Mar 17, 2025
Maintainer

What are the benefits of retraining the model with v1?

"F5-TTS v1 base model with better training and inference performance." is in comparison with v0.
If you use same setting (learning rate, max updates, total batch size), v1 converges faster and gives better WER/SIM result (lower word error rate, higher speaker similarity).

3 replies

AsmoKoskinen Mar 17, 2025
Author

Thank you, when I got some day faster GPU I do that again.

AsmoKoskinen Mar 21, 2025
Author

I started a new training with the same Finnish dataset as before. Looks good for the first day. And sounds, too.

RTX 4070 Ti Super.

{
"exp_name": "F5TTS_v1_Base",
"learning_rate": 0.0001,
"batch_size_per_gpu": 2000,
"batch_size_type": "frame",
"max_samples": 96,
"grad_accumulation_steps": 16,
"max_grad_norm": 0.3,
"epochs": 200,
"num_warmup_updates": 3000,
"save_per_updates": 10000,
"keep_last_n_checkpoints": -1,
"last_per_updates": 5000,
"finetune": true,
"file_checkpoint_train": "",
"tokenizer_type": "pinyin",
"tokenizer_file": "",
"mixed_precision": "bf16",
"logger": "tensorboard",
"bnb_optimizer": true
}

AsmoKoskinen Mar 23, 2025
Author

Done. Three days, 153 epochs and 195 000 steps. Dataset was 106 hours.

AsmoKoskinen · 2025-03-23T08:25:02Z

AsmoKoskinen
Mar 23, 2025
Author

Claude says:

Summary of the Training Process

The training has progressed excellently and achieved its goals! Here's a summary of the final screenshots:

Training Status

Progress: 153/200 epochs (76.5% complete)
Total time: 3 days 0 hours 17 minutes
Step count: 195.1k
Current loss: ~0.627 (smoothed), individual values vary

Loss Curve Analysis

Loss has decreased steadily throughout the entire training from the initial ~0.67 to the current ~0.627
During the last day, the loss curve shows a particularly clear downward trend
The curve shows no signs of plateauing, suggesting the model could benefit from additional training

Individual Loss Values

The terminal view shows excellent individual values:

0.441, 0.466, 0.496, 0.497

These are significantly better than at the beginning of training

Conclusions

The training has been highly successful
The model's performance has improved consistently throughout the training
Reducing the learning rate was a good decision as it helped the model find lower loss values
The model might benefit from additional training as the loss curve continues to decline

I recommend testing different checkpoints (e.g., latest vs. epoch 100) to compare audio quality. This will show how much the audio quality has improved and which checkpoint produces the best results.

3 replies

AsmoKoskinen Mar 23, 2025
Author

First two days:

{
"exp_name": "F5TTS_v1_Base",
"learning_rate": 0.0001,
"batch_size_per_gpu": 2000,
"batch_size_type": "frame",
"max_samples": 96,
"grad_accumulation_steps": 16,
"max_grad_norm": 0.3,
"epochs": 200,
"num_warmup_updates": 3000,
"save_per_updates": 10000,
"keep_last_n_checkpoints": -1,
"last_per_updates": 5000,
"finetune": true,
"file_checkpoint_train": "",
"tokenizer_type": "pinyin",
"tokenizer_file": "",
"mixed_precision": "bf16",
"logger": "tensorboard",
"bnb_optimizer": true
}

Last day:

{
"exp_name": "F5TTS_v1_Base",
"learning_rate": 0.00005,
"batch_size_per_gpu": 2000,
"batch_size_type": "frame",
"max_samples": 96,
"grad_accumulation_steps": 16,
"max_grad_norm": 0.3,
"epochs": 200,
"num_warmup_updates": 500,
"save_per_updates": 5000,
"keep_last_n_checkpoints": -1,
"last_per_updates": 5000,
"finetune": true,
"file_checkpoint_train": "",
"tokenizer_type": "pinyin",
"tokenizer_file": "",
"mixed_precision": "bf16",
"logger": "tensorboard",
"bnb_optimizer": true
}

AsmoKoskinen Mar 23, 2025
Author

I will upload files to the HF, and write a pull request.

AsmoKoskinen Mar 28, 2025
Author

I have added new files to the HF. But I don't think I'll make a new pull request. There is already a link to the Finnish model in HF.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training with older v0 versus new v1 #867

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments 6 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Training with older v0 versus new v1 #867

Uh oh!

AsmoKoskinen Mar 17, 2025

Replies: 2 comments · 6 replies

Uh oh!

SWivid Mar 17, 2025 Maintainer

Uh oh!

AsmoKoskinen Mar 17, 2025 Author

Uh oh!

Uh oh!

AsmoKoskinen Mar 21, 2025 Author

Uh oh!

AsmoKoskinen Mar 23, 2025 Author

Done. Three days, 153 epochs and 195 000 steps. Dataset was 106 hours.

Uh oh!

AsmoKoskinen Mar 23, 2025 Author

Uh oh!

AsmoKoskinen Mar 23, 2025 Author

Uh oh!

AsmoKoskinen Mar 23, 2025 Author

Uh oh!

AsmoKoskinen Mar 28, 2025 Author

AsmoKoskinen
Mar 17, 2025

Replies: 2 comments 6 replies

SWivid
Mar 17, 2025
Maintainer

AsmoKoskinen Mar 17, 2025
Author

AsmoKoskinen Mar 21, 2025
Author

AsmoKoskinen Mar 23, 2025
Author

AsmoKoskinen
Mar 23, 2025
Author

AsmoKoskinen Mar 23, 2025
Author

AsmoKoskinen Mar 23, 2025
Author

AsmoKoskinen Mar 28, 2025
Author