What does HackerNews think of tortoise-tts-fast?

Fast TorToiSe inference (5x or your money back!)

Language: Jupyter Notebook

The rules for live DJ sets is very different from copyright rules that limit published music. This is the same as when artists use samples of meme audio, which otherwise would be copyrighted. Guetta has a long very large body of work and deserves the benefit of the doubt when it comes to the question of Eminem okaying it. He very clearly wants there to be a discussion about this. Celebrity impersonators for singing have been a thing for a long time. This isn’t about one artist in particular.

I was playing with Tortoise TTS and was genuinely surprised by how good it is with just a few minutes of clean audio. It didn’t take me hours to train or fine tune, the generation step is sort of long, 5-7 minutes for 30 seconds, but it feels really similar to stable diffusion where you do quick test with slow samples and iterations to find a decent seed, and then you let it do a more complete regeneration. It’s zero shot generation that ran on my laptop 2070 max q and i7 10750h. It’s not perfect but it’s believable when layered with music.

https://github.com/152334H/tortoise-tts-fast