I found that when asking to transcribe noisy audio this finish reason often comes up if temperature =0. Upon closer inspection it seems to interpret repeated characters like “na na na…” or “a a a a…” that go for too long as “Recitation”! Obviously this kind of repeated pattern is in the training data verbatim many times.
My solution was to increase the temperature to 1 for transcriptions.
HTH someone.