That is crazy. Any way I can start using this soon? I have a backlog of articles I’d love to listen to.
TortoiseTTS might be the closest https://github.com/neonbjb/tortoise-tts
It's a few shot multi speaker model so you need just 3-4 little clips to train new voices.