I’ve been using Speech Note (github link) for months, but it often gets things wildly wrong.
I thought it was my mic, so I got one that’s crystal clear. I also tried a ton of different models, and other than being slow (or fast), their accuracy is usually pretty similar.
But I’m still needing to take a lot of time to edit the results, and I wonder if there’s something I should be doing to get better results.
On other speech-to-text platforms (like Futo keyboard on Android), the results are fast and very accurate. I have a hard time believing that Speech Note can’t be as good.
Can any other users share their experience?
I haven’t used Speech Note, but I have been using Whisper with great success. I run it via Docker.