I just found this.
This is huge!
As a german, I use thorsten medium as he simply made the best dataset.
Mixing english with german, speaking numbers, single letters, pausing without a “.” but just a linebreak, all those can be essential.
And… it is nearly perfect! And all local!
This is crazy!
eSpeak can finally go to rest!
Thorsten high is silly haha. Emotional is also not meant for TTS more for research I think.
I think thorsten made the only good model in German, I really want to make my own one! Or get some famous people on board?