Get Mystery Box with random crypto!

New TTS Models for Minority Languages of the CIS / Russia In | Data Science by ODS.ai 🦜

New TTS Models for Minority Languages of the CIS / Russia

In
collaboration with the community, we created totally unique models for the languages of the peoples of Russia / the CIS:

- Bashkir (aigul_v2)
- Kalmyk (erdni_v2)
- Tatar (dilyara_v2)
- Uzbek (dilnavoz_v2)

We also tried to create the Ukrainian voice, but the data we had (sourced from audiobooks) was not very good (all other voices were created from recordings).

Some models sound almost perfect, some a bit worse. Typically this boils down to how speakers can provide steady consistent recordings.

We used anywhere from 1 hour to 6 hours of recordings to create each voice.

These models obviously do not include automated stress and have the same major caveats as other v2 models (i.e. best used with batch size 1 on 2-4 CPU threads).

Link

- https://github.com/snakers4/silero-models#text-to-speech