I've been using whisper to get transcripts from my local radio stations. I know it's out of scope for the original project but I hope someone can build a streaming input around it in the future. I currently pipe in and save 10 minute chunks that get sent off for processing.

whisper.cpp (https://github.com/ggerganov/whisper.cpp) supports streaming!