3
4 Comments

Most accurate ML Transcription API?

I'm building something that involves some speech-to-text transcription; so far I've implemented it with Google Cloud, but I'm pretty underwhelmed with the accuracy (50% or so from a Zoom call).

Anyone have experience with other offerings and some suggestions about what to try next?

Which ML-based speech-to-text transcription service is most accurate?
  1. Google Cloud
  2. AWS
  3. Microsoft Azure
  4. Symbl.ai
  5. Other
Vote
  1. 1

    I did a project that transcribed voicemails using AWS Transcribe. The results were far from stellar so I made the transcripts editable to allow users to fix transcription errors.

    1. 1

      How was the speaker diarization?

      What was your source audio like? And what would you estimate accuracy % to be?

      Thank you!

      1. 1

        It was only a single speaker in each voicemail. The input was a voicemail so the quality was phone call quality. Accuracy was pry ~60%.

        I don't think the tech is there yet so you're either gonna need to pay for human-assisted transcriptions or add some kind of self-editing interface. Or just settle for 60% accuracy.

        1. 1

          Right on, thanks again!

Trending on Indie Hackers
Getting first 908 Paid Signups by Spending $353 ONLY. 24 comments I talked to 8 SaaS founders, these are the most common SaaS tools they use 20 comments What are your cold outreach conversion rates? Top 3 Metrics And Benchmarks To Track 19 comments How I Sourced 60% of Customers From Linkedin, Organically 12 comments Hero Section Copywriting Framework that Converts 3x 12 comments Join our AI video tool demo, get a cool video back! 12 comments