Not getting desired result with speech-to-text API

FIT_LIFE · March 25, 2022, 2:16am

Howdy. Im trying to use speech-to-text API on an audio file that is 02:22 long and 1,4 MB.

Im using options like so:

const filename = "audio.mp3";
  const encoding = "mp3";
  const sampleRateHertz = 16000;
  const languageCode = "en-US";
  const model = "phone_call";

  const config = {
    encoding: encoding,
    sampleRateHertz: sampleRateHertz,
    languageCode: languageCode,
    model: model,
    enableAutomaticPunctuation: true,
  };

But my result is not that great tbh. Many words are very bad match of voice.
Anyone know any possible way to improve the speech-to-text?

Im trying to transcribe a audio file from a conversation between 2 people.
The general audio quality is great and clear.

Topic		Replies	Views
How to detect words in long audio file? General Discussion help_request	1	553	February 25, 2023
Fine-tuning speech to text model General Discussion datasets , help_request	1	1436	November 30, 2021
[Voice Recognition] How can I use the model? General Discussion models , help_request	1	661	October 6, 2021
Simple audio recognition: Recognizing keywords \| TensorFlow Core General Discussion models , help_request , tfcore	8	3687	December 28, 2022
Tflite accuracy decreased General Discussion models , datasets , help_request	5	1182	August 4, 2021

Not getting desired result with speech-to-text API

Related topics