MEDASR: Audio Length

kamalkraj · February 13, 2026, 3:10pm

What is the recommended size of MEDASR audio length for transcription. Or For all the audio is it recommended to use 20s sliding window with 2s overlap ?

Nireeksha_K_A · February 19, 2026, 10:53am

Hi @kamalkraj ,
Yes, that’s a solid approach. It is generally recommend splitting long-form audio into ~20-second chunks with a 2-second overlap (15–20s with 2–3s stride also works well).
The overlap ensures words spanning chunk boundaries are captured fully and helps maintain transcription continuity across segments.
Also ensure the audio is resampled to 16 kHz mono before processing.
Thank you!

Topic		Replies	Views
Comparison of Whisper vs Medasr HAI-DEF models	4	271	January 5, 2026
What is the limit on audio length when using Gemini API to do ASR task? Gemini API	1	290	July 2, 2024
Provenance of KenLM model included with MedASR HAI-DEF models	3	177	January 9, 2026
How to detect words in long audio file? General Discussion help_request	1	562	February 25, 2023
MedASR: Clarification Needed on Handling of Brace Tokens and Preprocessing Rules for Fine-Tuning & Decoding HAI-DEF model	2	149	February 24, 2026

MEDASR: Audio Length

Related topics