Speech-to-Text models for Congolese dialect of Swahili Language. Contains acoustic and language models to be used with deepspeech based ASR.
Acoustic model was trained on Gamayun audio mini-kit and TICO-19 testset using Coqui STT v0.10.0a13.
Total train size: 8.93 (mini-kit) + 3.27 (TICO-19 testset) = 12.2 hours
Dev size: 0.49 hours (mini-kit)
Test size: 1.71 hours (TICO-19 devset)
Contains two language models (scorers):
- General purpose language model (swc-general.scorer) is trained on a 37.7M word mixed Swahili text corpus
- Commands language model (swc-commands.scorer) is trained on 12 commands (numbers from 1 to 10 and yes/no) which are listed in `vocab-commands.txt`.
Recognition accuracy:
Developer: Alp Öktem
Disclaimer: This model is not tested in production and is provided as-is without any warranty.