Can Zipformer's 130ms of padding be reduced further? #2089
uni-rini-sharon
started this conversation in
General
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
The 13 frame padding for convolution and right context in zipformer is in effect adding 130ms of latency to the overall streaming ASR processing. Is there any way we can reduce this ? Has anyone tried training with causality or other methods to bring this down?
We see that the 13 frames constitute 7 frames for convolutional kernel functioning and 6 for zipformer's internal right context.
@csukuangfj @pkufool
Beta Was this translation helpful? Give feedback.
All reactions