Hitting input token limits that are way lower than advertised in gemini 2.0

Evin_Callahan · January 6, 2025, 11:17pm

Hey folks,

I’m trying to send about 40k tokens or so, way less than what is permitted (1m) for gemini 2.0, and am seemingly having that break with the following websocket exception:

E       websockets.exceptions.ConnectionClosedError: received 1007 (invalid frame payload data) Request trace id: ffa37544583b21f9, [ORIGINAL ERROR] generic::invalid_argument: Input request contains (44599) tokens, whic; then sent 1007 (invalid frame payload data) Request trace id: ffa37544583b21f9, [ORIGINAL ERROR] generic::invalid_argument: Input request contains (44599) tokens, whic

The code works, I can take the same logic and supply less content and get what I want.

The logic is fairly simple, and expects an audio output that we then stream and play as needed. Code below. Any thoughts as to why this is failing?

config = genai.types.LiveConnectConfig(
    response_modalities=["AUDIO"],
    system_instruction=genai.types.Content(
        parts=[genai.types.Part(text=system_prompt)]
    ),
    generation_config=genai.types.GenerationConfig(
        temperature=self.settings.genai_model_temperature,
        max_output_tokens=8192,
    ),
    speech_config=genai.types.SpeechConfig(
        voice_config=genai.types.VoiceConfig(
            prebuilt_voice_config=genai.types.PrebuiltVoiceConfig(
                voice_name=VOICES[0]
            ),
        ),
    ),
)

async with self.client.aio.live.connect(
    model=self.settings.genai_model_name, config=config
) as session:
    await session.send(combined_prompt, end_of_turn=True)
    audio_data=[]

    async for response in session.receive():
        if not response.server_content.turn_complete:
            for part in response.server_content.model_turn.parts:
                if part.inline_data and part.inline_data.data:
                    audio_data.append(np.frombuffer(chunk, dtype="int16"))


with sd.OutputStream(samplerate=24000, channels=1, dtype="int16") as stream:
    stream.write(np.concatenate(audio_data))

The goal is to improve the audio for CustomPod as the audio 2.0 has is incredible.

nguadiana · January 7, 2025, 8:08am

Audio generation is only available for “early access” as seen in the experimental model details

regardless have you been able to run the example notebook ?

Evin_Callahan · January 7, 2025, 10:34pm

Yes the notebook works, and the code above does as well for a smaller input size.

It’s only when i send it a larger amount of content do I get that error message, consistently.

Is that part of the early access limitations?

Sandeep_Khanna_V_P · January 9, 2025, 11:47am

received 1007 (invalid frame payload data) Request trace id: fc1a2c4181400b5d, [ORIGINAL ERROR] generic::invalid_argument: Input request contains (95200) tokens, whic; then sent 1007 (invalid frame payload data) Request trace id: fc1a2c4181400b5d, [ORIGINAL ERROR] generic::invalid_argument: Input request contains (95200) tokens, whic

I get this same error - but the website mentions that

The following rate limits apply:

3 concurrent sessions per API key
4M tokens per minute

Is there any workaround

Naveen_Govindaraju · January 28, 2025, 9:58am

getting a similar error as well error message bellow

generic::invalid_argument: Input request contains (34407) tokens, whic; then sent 1007 (invalid frame payload data) Request

I consistently get this error ~34k ish tokens but that is still much lower than the advertised 1M context window

Topic		Replies	Views
RESOURCE_EXHAUSTED when use gemini-1.5-pro-002 Gemini API gemini-15	8	955	October 2, 2024
Gemini multimodal live api test code sample occurs resource_exhausted Gemini API vertexai	5	326	March 11, 2025
Video Understanding response cut off at token ~= 2k Gemini API bug , api , video	0	27	May 7, 2025
Error streaming live video using python Gemini Live API Gemini API feedback , python	1	62	December 30, 2024
Request payload size exceeds the limit Gemini API	1	531	May 31, 2024

Hitting input token limits that are way lower than advertised in gemini 2.0

Related topics