fixed outetts docs

This commit is contained in:
Concedo 2025-04-07 21:31:43 +08:00
parent 6e42e673c6
commit a3f7de7142
2 changed files with 2 additions and 2 deletions

View file

@ -15,7 +15,7 @@ However, it can be slightly challenging to set it up.
You can also create your own cloned speaker voice. You can also create your own cloned speaker voice.
To do that, you will need Python 3.10+ with the `outetts` package installed. You can install it with this commands: To do that, you will need Python 3.10+ with the `outetts` package installed. You can install it with this commands:
``` ```
pip install outetts --no-deps pip install outetts==0.3.3 --no-deps
pip install scipy einops pyyaml huggingface-hub encodec matplotlib transformers pytorch-lightning tensorboardX soundfile numpy jsonargparse torchcrepe librosa inflect loguru polars natsort tqdm sounddevice mecab-python3 unidic-lite uroman openai-whisper pygame accelerate pip install scipy einops pyyaml huggingface-hub encodec matplotlib transformers pytorch-lightning tensorboardX soundfile numpy jsonargparse torchcrepe librosa inflect loguru polars natsort tqdm sounddevice mecab-python3 unidic-lite uroman openai-whisper pygame accelerate
``` ```
Then after that, obtain a wav file containing a sample of the voice you want to clone. Then after that, obtain a wav file containing a sample of the voice you want to clone.

View file

@ -9,7 +9,7 @@ model_config = outetts.HFModelConfig_v1(
interface = outetts.InterfaceHF(model_version="0.2", cfg=model_config) interface = outetts.InterfaceHF(model_version="0.2", cfg=model_config)
speaker = interface.create_speaker( speaker = interface.create_speaker(
audio_path="/path/to/audio.wav", audio_path="input_audio.wav",
# If transcript is not provided, it will be automatically transcribed using Whisper # If transcript is not provided, it will be automatically transcribed using Whisper
transcript=None, # Set to None to use Whisper for transcription transcript=None, # Set to None to use Whisper for transcription