French Text to Speech Voice (siwis)

Voice and vocoder models for larynx based on the SIWIS.

Used in Rhasspy in the rhasspy-tts-larynx-hermes service.

Usage

$ larynx \
    --model /path/to/tts-checkpoint.pth.tar \
    --vocoder-model /path/to/vocoder-checkpoint.pth.tar \
    --output-file /path/to/output.wav \
    'Merci beaucoup!'

Docker

Run a web server at http://localhost:5002

$ docker run -it -p 5002:5002 \
    --device /dev/snd:/dev/snd \
    rhasspy/larynx:fr-siwis-1

Endpoints:

/api/tts - returns WAV audio for text
- GET with ?text=...
- POST with text body
/api/phonemize - returns phonemes for text
- GET with ?text=...
- POST with text body
/process - compatibility endpoint to emulate MaryTTS
- GET with ?INPUT_TEXT=...

Model Details

Type: Glow-TTS
Sample rate: 22050 Hz
Frequency range: 0-8000 Hz

See configuration for details.

Vocoder Details

Type: Multi-band MelGAN
Sample rate: 22050 Hz
Frequency range: 0-8000 Hz

See configuration for details.

Files

Some files are split into multiple parts so that they can be uploaded to GitHub. This is done with the split command:

split -d -b 25M FILE FILE.part-

They can be recombined simply with:

cat FILE.part-* > FILE

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
samples		samples
vocoder		vocoder
LICENSE		LICENSE
README.md		README.md
checkpoint_270000.pth.tar.gz.part-00		checkpoint_270000.pth.tar.gz.part-00
checkpoint_270000.pth.tar.gz.part-01		checkpoint_270000.pth.tar.gz.part-01
checkpoint_270000.pth.tar.gz.part-02		checkpoint_270000.pth.tar.gz.part-02
checkpoint_270000.pth.tar.gz.part-03		checkpoint_270000.pth.tar.gz.part-03
checkpoint_270000.pth.tar.gz.part-04		checkpoint_270000.pth.tar.gz.part-04
checkpoint_270000.pth.tar.gz.part-05		checkpoint_270000.pth.tar.gz.part-05
checkpoint_270000.pth.tar.gz.part-06		checkpoint_270000.pth.tar.gz.part-06
checkpoint_270000.pth.tar.gz.part-07		checkpoint_270000.pth.tar.gz.part-07
checkpoint_270000.pth.tar.gz.part-08		checkpoint_270000.pth.tar.gz.part-08
checkpoint_270000.pth.tar.gz.part-09		checkpoint_270000.pth.tar.gz.part-09
checkpoint_270000.pth.tar.gz.part-10		checkpoint_270000.pth.tar.gz.part-10
config.json		config.json
coverage.json		coverage.json
profile.yml		profile.yml
scale_stats.npy		scale_stats.npy

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

French Text to Speech Voice (siwis)

Usage

Docker

Model Details

Vocoder Details

Files

About

Releases

Packages

License

rhasspy/fr_larynx-siwis

Folders and files

Latest commit

History

Repository files navigation

French Text to Speech Voice (siwis)

Usage

Docker

Model Details

Vocoder Details

Files

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages