What does HackerNews think of TTS?

πŸΈπŸ’¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language: Python

#15 in Python
For offline/local TTS, Coqui TTS [0] is quite good. It's essentially a continuation of Mozilla's TTS engine that Mozilla stopped working on ~2 years ago (and IIRC it's largely the same team that worked on Mozilla TTS).

[0] https://github.com/coqui-ai/TTS

https://coqui.ai/

https://github.com/coqui-ai/TTS

I can never remember the name but always google: incessant loud chirp of the invasive frog

I wanted to make a human-like reading feature for our language-learning software. Training a model isn't too hard using something like https://github.com/coqui-ai/TTS.

The weak link was the available free/open datasets. You needed a single speaker with a pleasant voice, 20hrs+ material from varied sources, recorded in a good recording enviroment with a good mic etc. For English, the go-to was LJSpeech, which doesn't fulfill all these requirements. I say 'was', as I haven't followed developments recently.

Last year we decided to make our own dataset with a Irish woman, Jenny. She has a soft Irish lilt.

Never got around around to training the model, but I will upload the raw audio and prompts here in a few hours (need to pay my internet bill in town..):

https://github.com/dioco-group/jenny-tts-dataset/blob/main/R...

No need to be invited. Between their GitHub[1] page and the documentation[2], you'll find everything you need to get started.

[1] https://github.com/coqui-ai/TTS [2] https://tts.readthedocs.io/en/latest/

Looks like they've got some decently popular open source repos - https://github.com/coqui-ai/TTS for example - so I wouldn't be surprised at all if Google and others used that, even if only for their own research and comparison. That's my charitable interpretation anyway.
Hasn't been linked yet so: https://github.com/coqui-ai/TTS Derived from Mozilla's TTS project. Big fan.
I agree so much, that I've started learning ML to make a decent opensource many-languages TTS working on smartphones.

But really, the situation is pretty good, with a lot of code and dataset available as opensource. Notably, if you're not constrained to smartphones and the like, you can run on your computer quite a number of modern models, see for instance https://github.com/coqui-ai/TTS/ (which itself contains many different models).

The work that needs to be done is """just""" to turn those models into something suitable for smartphones (which will most likely include re-training), and to plug them back into Android's TTS API.

Cool article, I do like the extensibility provided via the unix philosophy. Another thing you can do nowadays is use off-the-shelf deep neural networks to do tts eg

https://github.com/NVIDIA/tacotron2

https://github.com/mozilla/TTS

https://github.com/CorentinJ/Real-Time-Voice-Cloning

https://github.com/coqui-ai/TTS

They're not all easy to setup however

https://github.com/coqui-ai/TTS the continuation of Mozilla TTS produces quite nice results I you pick the right models
This may be off-topic but: What's the relationship between Coqui (an OSS TTS startup) https://coqui.ai/about and Mozilla? I recall that the project at one point was called mozilla/TTS (https://github.com/mozilla/TTS/) and now I see that has a fork in the startup's own repo (https://github.com/coqui-ai/TTS). Presumably Common Voice is used to train mozilla/TTS and other OSS TTS solutions?
Hi I'm not blind but wanna just dump some links here I bookmarked related to accessibility. Might be useful.

[1] The actively developed version of Mozilla TTS, named coqui-TTS. My understanding is that the original team was let go from Mozilla and they formed coqui.

https://github.com/coqui-ai/TTS

They are also on Element Matrix:

https://matrix.to/#/#coqui-ai_TTS:gitter.im

[2] FOSS automated accessibility testing engine for websites and other HTML-based user interfaces:

https://github.com/dequelabs/axe-core

[3] Emacspeak, developed by someone who was blind since childhood:

https://en.wikipedia.org/wiki/Emacspeak

[4] UK government websites are famous for being accessible. They have design guidelines:

https://design-system.service.gov.uk/

[5] Similar system for the US govt.

https://designsystem.digital.gov/

[6] Mozilla MDN learn accessibility:

https://developer.mozilla.org/en-US/docs/Web/Accessibility

Check out Coqui TTS where we continue the work.

https://github.com/coqui-ai/TTS

Mozilla TTS is not maintained anymore (at least ATM).

Disclaimer: I've created both of the projects.

I think the creator of the project has moved on and it is not maintained anymore. But they have https://github.com/coqui-ai/TTS