amazing work!
Curious how you cloned the voices - tortoise? I've previously tried Herzog, but couldn't quite train the German accent...
I haven't tried Tortoise, thanks for pointing me to it. The voices were cloned by fine tuning a VITS model with coqui.ai. I used about two hours of speech for each speaker. With more time and resources, I'm certain it's possible to make those voices considerably better.
Can I get an invite link?
No need to be invited. Between their GitHub[1] page and the documentation[2], you'll find everything you need to get started.
[1] https://github.com/coqui-ai/TTS [2] https://tts.readthedocs.io/en/latest/