Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Is there a maximum limit of characters for a single inference? #10

Open
AznamirWoW opened this issue Feb 11, 2025 · 5 comments
Open

Is there a maximum limit of characters for a single inference? #10

AznamirWoW opened this issue Feb 11, 2025 · 5 comments

Comments

@AznamirWoW
Copy link

Trying to run TTS on <200 characters seems fine, but on a larger test (~1600 characters) the result is a random mix of phrases in a random order.

@sekstini
Copy link
Collaborator

Yeah, the model was trained on clips of at most 30 seconds, so long prompts can be out of distribution. The way we handle this on the website is by chunking the text and generating audio for the chunks one at a time.

@6630507
Copy link

6630507 commented Feb 11, 2025

In the gradio interface, I get a loud and constant buzz like a sawtooth wave for anything after :30 sec.

Also happens after :30 sec using the same text in the sample python code on the project page.

@FurkanGozukara
Copy link

Yeah, the model was trained on clips of at most 30 seconds, so long prompts can be out of distribution. The way we handle this on the website is by chunking the text and generating audio for the chunks one at a time.

can you add this auto chunking to the gradio please ty

@spaghettiSystems
Copy link
Contributor

we may add it to the public inference code at some point but it's not a priority. you're invited to contribute it however

@InconsolableCellist
Copy link

I have a PR for this here

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

6 participants