Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add silence skipping to remove long delays before and after speech #187

Open
wants to merge 9 commits into
base: compose
Choose a base branch
from

Conversation

mobad
Copy link

@mobad mobad commented Apr 18, 2024

When using this app in English there is something like a 0.5s delay after every sentence making the TTS very annoying and slow.
This change adds a silence skipping audio processor before Sonic processing happens which makes things much nicer.
It's currently non-optional but it shouldn't be too hard to make it configurable.
There's also a lot of other changes like increasing the minimum api, upgrading exoplayer, and fixing Opus non-exo decoding streaming as well as some other minor changes so I don't really expect this PR to be used as is but it can be used as reference for future changes.
The InputStreamMediaDataSource changes are particularly important though as Opus streaming decoding is broken without it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant