Going along with this: What are the latest and greatest open source speech-to-text models and/or tools out there?
Would love to hear from experienced practitioners and a bit of detail on the experience.
Thanks HN community!
NVidia NeMo: https://github.com/NVIDIA/NeMo