mirror of
https://github.com/LostRuins/koboldcpp.git
synced 2025-09-07 15:49:05 +00:00
fixed outetts docs
This commit is contained in:
parent
6e42e673c6
commit
a3f7de7142
2 changed files with 2 additions and 2 deletions
|
@ -15,7 +15,7 @@ However, it can be slightly challenging to set it up.
|
|||
You can also create your own cloned speaker voice.
|
||||
To do that, you will need Python 3.10+ with the `outetts` package installed. You can install it with this commands:
|
||||
```
|
||||
pip install outetts --no-deps
|
||||
pip install outetts==0.3.3 --no-deps
|
||||
pip install scipy einops pyyaml huggingface-hub encodec matplotlib transformers pytorch-lightning tensorboardX soundfile numpy jsonargparse torchcrepe librosa inflect loguru polars natsort tqdm sounddevice mecab-python3 unidic-lite uroman openai-whisper pygame accelerate
|
||||
```
|
||||
Then after that, obtain a wav file containing a sample of the voice you want to clone.
|
||||
|
|
|
@ -9,7 +9,7 @@ model_config = outetts.HFModelConfig_v1(
|
|||
interface = outetts.InterfaceHF(model_version="0.2", cfg=model_config)
|
||||
|
||||
speaker = interface.create_speaker(
|
||||
audio_path="/path/to/audio.wav",
|
||||
audio_path="input_audio.wav",
|
||||
|
||||
# If transcript is not provided, it will be automatically transcribed using Whisper
|
||||
transcript=None, # Set to None to use Whisper for transcription
|
||||
|
|
Loading…
Add table
Reference in a new issue