You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Spun out of PR #86 after the bisect that closed #83 turned out to be only a partial fix.
Background
PR #86 reverted PR #70 (inter-word `<break time="50ms"/>` tags) and verified the revert resolves the Whisper WER regression on `sp_neu_a_0001` (intensity arc `[1,1,1,2,1]` — almost all I1):
The Hebrew text is byte-identical between the 04-15 reference and the revert branch (verified by `diff` — empty). So the residual sp_it WER gap is purely audio rendering, not text content. Length-ratio 0.709 is the same Whisper-silence-detector fingerprint as #83 (~28% of words missing).
What this means
There is a second TTS-side change in the post-2026-04-15 window that fires at I3+. PR #86's central claim of "sole material cause" was wrong; #86 lands as a partial fix that closes nothing automatically. This issue tracks the actual residual cause.
Suspects (sorted by prior, re-derived)
This list is broader than the original #83 list because the original list missed a candidate (#74).
Spun out of PR #86 after the bisect that closed #83 turned out to be only a partial fix.
Background
PR #86 reverted PR #70 (inter-word `<break time="50ms"/>` tags) and verified the revert resolves the Whisper WER regression on `sp_neu_a_0001` (intensity arc `[1,1,1,2,1]` — almost all I1):
But verifying the same revert on a high-intensity scene (`sp_it_a_0001`, intensity arc `[1,2,3,4,5,4,3]`) shows the regression persists:
The Hebrew text is byte-identical between the 04-15 reference and the revert branch (verified by `diff` — empty). So the residual sp_it WER gap is purely audio rendering, not text content. Length-ratio 0.709 is the same Whisper-silence-detector fingerprint as #83 (~28% of words missing).
What this means
There is a second TTS-side change in the post-2026-04-15 window that fires at I3+. PR #86's central claim of "sole material cause" was wrong; #86 lands as a partial fix that closes nothing automatically. This issue tracks the actual residual cause.
Suspects (sorted by prior, re-derived)
This list is broader than the original #83 list because the original list missed a candidate (#74).
Suggested investigation path
Reproduction (same harness as #83)
The 04-15 reference WAV exists at `data/m2a_wettest/agg_m_30-45_001/sp_it_a_0001_00.wav` (155.9 s, WER 0.056).
Things NOT to do
References