What does HackerNews think of TTS?

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language: Python

#3 in Deep learning

#15 in Python

QuiLLMan: Voice chat with Vicuna-13B | Apr 2023

Coqui also looks interesting.

https://github.com/coqui-ai/TTS

Support for it was recently added to vocode:

https://github.com/vocodedev/vocode-python/pull/56

Ask HN: What's the best TTS engine you've heard? | Mar 2023

For offline/local TTS, Coqui TTS [0] is quite good. It's essentially a continuation of Mozilla's TTS engine that Mozilla stopped working on ~2 years ago (and IIRC it's largely the same team that worked on Mozilla TTS).

[0] https://github.com/coqui-ai/TTS

Ask HN: Best open-source text-to-speech tools? | Mar 2023

https://coqui.ai/

https://github.com/coqui-ai/TTS

I can never remember the name but always google: incessant loud chirp of the invasive frog

This Voice Doesn't Exist – Generative Voice AI | Jan 2023

Expand Context ↕

Effectively superseded by https://github.com/coqui-ai/TTS

Apple Books digital narration | Jan 2023

Expand Context ↕

I wanted to make a human-like reading feature for our language-learning software. Training a model isn't too hard using something like https://github.com/coqui-ai/TTS.

The weak link was the available free/open datasets. You needed a single speaker with a pleasant voice, 20hrs+ material from varied sources, recorded in a good recording enviroment with a good mic etc. For English, the go-to was LJSpeech, which doesn't fulfill all these requirements. I say 'was', as I haven't followed developments recently.

Last year we decided to make our own dataset with a Irish woman, Jenny. She has a soft Irish lilt.

Never got around around to training the model, but I will upload the raw audio and prompts here in a few hours (need to pay my internet bill in town..):

https://github.com/dioco-group/jenny-tts-dataset/blob/main/R...

An AI generated, never-ending discussion between Werner Herzog and Slavoj ŽIžek | Nov 2022

Expand Context ↕

No need to be invited. Between their GitHub[1] page and the documentation[2], you'll find everything you need to get started.

[1] https://github.com/coqui-ai/TTS [2] https://tts.readthedocs.io/en/latest/

https://news.ycombinator.com/item?id=33037821 | Sep 2022

From the Coqui TTS project <https://github.com/coqui-ai/TTS>

Ask HN: Best free software to read text aloud | Aug 2022

https://github.com/coqui-ai/TTS

Show HN: Voice Clones for Creators | Jun 2022

Expand Context ↕

Looks like they've got some decently popular open source repos - https://github.com/coqui-ai/TTS for example - so I wouldn't be surprised at all if Google and others used that, even if only for their own research and comparison. That's my charitable interpretation anyway.

Show HN: Voice Clones for Creators | Jun 2022

Hasn't been linked yet so: https://github.com/coqui-ai/TTS Derived from Mozilla's TTS project. Big fan.

NaturalSpeech: End-to-end text to speech synthesis with human-level quality | May 2022

Expand Context ↕

Coqui: https://github.com/coqui-ai/TTS

NaturalSpeech: End-to-End Text to Speech Synthesis with Human-Level Quality | May 2022

I agree so much, that I've started learning ML to make a decent opensource many-languages TTS working on smartphones.

But really, the situation is pretty good, with a lot of code and dataset available as opensource. Notably, if you're not constrained to smartphones and the like, you can run on your computer quite a number of modern models, see for instance https://github.com/coqui-ai/TTS/ (which itself contains many different models).

The work that needs to be done is """just""" to turn those models into something suitable for smartphones (which will most likely include re-training), and to plug them back into Android's TTS API.

Speech Synthesis on Linux (2020) | Sep 2021

Cool article, I do like the extensibility provided via the unix philosophy. Another thing you can do nowadays is use off-the-shelf deep neural networks to do tts eg

https://github.com/NVIDIA/tacotron2

https://github.com/mozilla/TTS

https://github.com/CorentinJ/Real-Time-Voice-Cloning

https://github.com/coqui-ai/TTS

They're not all easy to setup however

Speech Synthesis on Linux (2020) | Sep 2021

https://github.com/coqui-ai/TTS the continuation of Mozilla TTS produces quite nice results I you pick the right models

Mozilla Common Voice Adds 16 New Languages and 4,600 New Hours of Speech | Aug 2021

This may be off-topic but: What's the relationship between Coqui (an OSS TTS startup) https://coqui.ai/about and Mozilla? I recall that the project at one point was called mozilla/TTS (https://github.com/mozilla/TTS/) and now I see that has a fork in the startup's own repo (https://github.com/coqui-ai/TTS). Presumably Common Voice is used to train mozilla/TTS and other OSS TTS solutions?

WebAIM Screen Reader User Survey #9 | May 2021

Expand Context ↕

Hi I'm not blind but wanna just dump some links here I bookmarked related to accessibility. Might be useful.

[1] The actively developed version of Mozilla TTS, named coqui-TTS. My understanding is that the original team was let go from Mozilla and they formed coqui.

https://github.com/coqui-ai/TTS

They are also on Element Matrix:

https://matrix.to/#/#coqui-ai_TTS:gitter.im

[2] FOSS automated accessibility testing engine for websites and other HTML-based user interfaces:

https://github.com/dequelabs/axe-core

[3] Emacspeak, developed by someone who was blind since childhood:

https://en.wikipedia.org/wiki/Emacspeak

[4] UK government websites are famous for being accessible. They have design guidelines:

https://design-system.service.gov.uk/

[5] Similar system for the US govt.

https://designsystem.digital.gov/

[6] Mozilla MDN learn accessibility: