New TTS Models for Minority Languages of the CIS / Russia In | Data Science by ODS.ai 🦜

New TTS Models for Minority Languages of the CIS / Russia

In collaboration with the community, we created totally unique models for the languages of the peoples of Russia / the CIS:

- Bashkir (aigul_v2)
- Kalmyk (erdni_v2)
- Tatar (dilyara_v2)
- Uzbek (dilnavoz_v2)

We also tried to create the Ukrainian voice, but the data we had (sourced from audiobooks) was not very good (all other voices were created from recordings).

Some models sound almost perfect, some a bit worse. Typically this boils down to how speakers can provide steady consistent recordings.

We used anywhere from 1 hour to 6 hours of recordings to create each voice.

These models obviously do not include automated stress and have the same major caveats as other v2 models (i.e. best used with batch size 1 on 2-4 CPU threads).

Link

- https://github.com/snakers4/silero-models#text-to-speech

Data Science by ODS.ai 🦜

💩 51.75K
Technologies

First Telegram Data Science channel. Covering all technical and popular staff about anything related to Data Science: AI, Big Data, Machine Learning, Statistics, general Math and the applications of f...

Join
▲ Vote (1)

New TTS Models for Minority Languages of the CIS / Russia In | Data Science by ODS.ai 🦜

Login