Instructions to use kyutai/pocket-tts-without-voice-cloning with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Pocket-TTS
How to use kyutai/pocket-tts-without-voice-cloning with Pocket-TTS:
from pocket_tts import TTSModel import scipy.io.wavfile tts_model = TTSModel.load_model("kyutai/pocket-tts-without-voice-cloning") voice_state = tts_model.get_state_for_audio_prompt( "hf://kyutai/tts-voices/alba-mackenna/casual.wav" ) audio = tts_model.generate_audio(voice_state, "Hello world, this is a test.") # Audio is a 1D torch tensor containing PCM data. scipy.io.wavfile.write("output.wav", tts_model.sample_rate, audio.numpy()) - Notebooks
- Google Colab
- Kaggle
Advantage of without-voice-cloning
#2
by TomSchelsen - opened
Hi,
I failed to find what advantage without-voice-cloning provides over the plain pocket-tts model. The READMEs and the repository size seem identical. Could you indicate why one should choose this repo/model over the plain pocket-tts one ? Many thanks