Speech to text hangs (infintie load)

anonymous · January 22, 2023, 3:06pm

I uploaded a mp3 file (1.07 min) to transcribe using the video model, and the transcription is stuck, no errors, just infinite loading

This is the file https://drive.google.com/file/d/1QLucfAwJZXxSAOKSIWoPtIbljSM9pvsl/view?usp=sharing

UPDATE: It failed with “Error running recognize request. Too many retries, giving up.”

Poala_Tenorio · January 23, 2023, 10:47pm

May I know the documentation you are using in transcribing your audio?

anonymous · January 24, 2023, 9:52am

In the case above I’m sending the file to transcription using the UI in google cloud console

But I did it after noticing the issue by sending the same file using the Node.js api, following the docs here: https://cloud.google.com/speech-to-text/docs

Poala_Tenorio · January 24, 2023, 8:43pm

I was able to fetch the transcription in JSON format using gcloud CLI. I used Asynchronous Speech Recognition for transcribing an audio file that is longer than a minute. But I converted your audio file from Mp3 to FLAC since the process can’t be completed(based on my replication) when I used Mp3.

These are the commands that I used:

gcloud ml speech recognize-long-running ‘gs://bucket-name/audio.flac’ --language-code=‘en-US’ --async --audio-channel-count=2 --separate-channel-recognition> > gcloud ml speech operations describe [name]

This is a snippet of my output:
[upload|U3SRL4Nam0rC33oE49ZTuw==]

anonymous · January 25, 2023, 8:34am

So the issue is the file is an mp3 and not flac? should I also re-encode the file?

if it just a container change, then it is weird that mp3 fails and flac doesn’t because it is the same encoded data inside

also, I can’t access the output link you shared, it requires a google employee account

anonymous · June 13, 2023, 10:06am

@Poala_Tenorio it happens again, I sent a transcription long process and didn’t get any results (should be in a bucket)

this is the process id I received from the api: 7346104001135815711

is there a way I can check the status of this id?

anonymous · June 13, 2023, 10:14am

I manged to get the latest status of the process 7346104001135815711

received: code 13, Too many retries, giving up.

any way to see why is this happening?

Topic		Replies	Views
1 hour audio file transcription task running for more than 6 hours AI APIs speech-to-text	1	14	January 25, 2024
Calling speech-to-text suddenly giving me bad transcripts ( starting 2022-Dec-1) AI APIs speech-to-text	2	8	December 10, 2022
Understanding Processing Queue and Max Processing Time for Asynchronous Speech-to-Text in GCP AI APIs speech-to-text	1	33	September 5, 2024

Speech to text hangs (infintie load)

AI Suggested topics