You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I was wondering whether llama-tts doesn't support streaming (and instead requires generating the entire audio file from the text) due to technical limitations or if it simply hasn't been implemented yet.
I mean, in the same codebase we have support for text to text (large language models) and for text to speech (llama-tts) so since llama.cpp supports streaming for text generation...maybe we can pass that stream to the tts and get an audio stream?
I'm trying to understand if this would be feasible/possible.
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
I was wondering whether llama-tts doesn't support streaming (and instead requires generating the entire audio file from the text) due to technical limitations or if it simply hasn't been implemented yet.
I mean, in the same codebase we have support for text to text (large language models) and for text to speech (llama-tts) so since llama.cpp supports streaming for text generation...maybe we can pass that stream to the tts and get an audio stream?
I'm trying to understand if this would be feasible/possible.
Beta Was this translation helpful? Give feedback.
All reactions