fixed outetts docs

This commit is contained in:
Concedo 2025-04-07 21:31:43 +08:00
parent 6e42e673c6
commit a3f7de7142
2 changed files with 2 additions and 2 deletions

View file

@ -15,7 +15,7 @@ However, it can be slightly challenging to set it up.
You can also create your own cloned speaker voice.
To do that, you will need Python 3.10+ with the `outetts` package installed. You can install it with this commands:
```
pip install outetts --no-deps
pip install outetts==0.3.3 --no-deps
pip install scipy einops pyyaml huggingface-hub encodec matplotlib transformers pytorch-lightning tensorboardX soundfile numpy jsonargparse torchcrepe librosa inflect loguru polars natsort tqdm sounddevice mecab-python3 unidic-lite uroman openai-whisper pygame accelerate
```
Then after that, obtain a wav file containing a sample of the voice you want to clone.

View file

@ -9,7 +9,7 @@ model_config = outetts.HFModelConfig_v1(
interface = outetts.InterfaceHF(model_version="0.2", cfg=model_config)
speaker = interface.create_speaker(
audio_path="/path/to/audio.wav",
audio_path="input_audio.wav",
# If transcript is not provided, it will be automatically transcribed using Whisper
transcript=None, # Set to None to use Whisper for transcription