I am using the Gemini API to generate transcripts for publicly available audio (no copyrights). Up until this point - no issues across a large dataset. However - for a given audio file, I am getting persistent RECITATION error reason issues (multiple retries). If the audio is transcribed elsewhere on the internet - it does not seem logical for the model to continue to reject the output if a similar transcription appears in the training set.
Anybody else having a similar issue, and for anyone from Google - any best practice on how to communicate to the model that this is a common audio file?