Now we have published the accompanying articles both in English and in Russian on habr.com:
- Russian - https://habr.com/ru/post/549480/
- English - https://habr.com/ru/post/549482/
📎 Speakers
10 voices (each available in 16 kHz and 8 kHz):
- 6 Russian voices;
- 1 English voice;
- 1 German voice, 1 Spanish voice, 1 French voice;
📎 Why is this Different?
- One-line usage;
- A large library of voices;
- A fully end-to-end pipeline;
- Naturally sounding speech;
- No GPU or training required;
- Minimalism and lack of dependencies;
- Faster than real-time on one CPU thread (!!!);
- Support for
16kHz and 8kHz out of the box;Please upvote, like, share and try!
📎 Bot
Someone even made a bot with it for Ksenia, but it is still buggy - @silero_tts_bot.
it bugs out when you add stress with
+ and changes its location in a weird way, but something like this works я ко+тик ко+тик ко+тик, пуши+стый живо+тик! (stress should be before letters, but it works this way, idk why).