Gemini live api doesn't recognize screenshot images

Hal2002 · June 23, 2025, 12:22pm

Hi all, I’m trying to implement screensharing using the live api in an electron app i’m building. I’m using python as a backend and javascript in the front end. I’m essentially following the example code on the google ai studio website, in the streaming section. https://aistudio.google.com/live

This is the specific bit:

def _get_screen(self):
        sct = mss.mss()
        monitor = sct.monitors[0]

        i = sct.grab(monitor)

        mime_type = "image/jpeg"
        image_bytes = mss.tools.to_png(i.rgb, i.size)
        img = PIL.Image.open(io.BytesIO(image_bytes))
        img.thumbnail([1024, 1024])

        image_io = io.BytesIO()
        img.save(image_io, format="jpeg")
        image_io.seek(0)

        image_bytes = image_io.read()
        return {"mime_type": mime_type, "data": base64.b64encode(image_bytes).decode()}

    async def get_screen(self):

        while True:
            frame = await asyncio.to_thread(self._get_screen)
            if frame is None:
                break

            await asyncio.sleep(1.0)

            await self.out_queue.put(frame)

I’m getting the following error.

±±--------------- 1 ----------------
| Traceback (most recent call last):
| File “/Users/…/Documents/app/widget/backend/multimedia_service.py”, line 241, in receive_audio
| async for response in turn:
| File “/opt/anaconda3/envs/widget-python/lib/python3.11/site-packages/google/genai/live.py”, line 416, in receive
| while result := await self._receive():
| ^^^^^^^^^^^^^^^^^^^^^
| File “/opt/anaconda3/envs/widget-python/lib/python3.11/site-packages/google/genai/live.py”, line 497, in _receive
| raw_response = await self._ws.recv(decode=False)
| ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
| File “/opt/anaconda3/envs/widget-python/lib/python3.11/site-packages/websockets/asyncio/connection.py”, line 322, in recv
| raise self.protocol.close_exc from self.recv_exc
| websockets.exceptions.ConnectionClosedError: received 1007 (invalid frame payload data) Request contains an invalid argument.; then sent 1007 (invalid frame payload data) Request contains an invalid argument.

I have now narrowed the error to:

ValueError('Unsupported input type “<class 'google.genai.types.Content'>” or input content "parts=[Part(video_metadata=None, thought=None, inline_data=Blob(display_name=None, data=b'/9j/4AAQSkZJRgABAQAAAQABAAD/2wBDAAY…

It seems that gemini does not recognize how the images are encoded and is closing the websocket. What could be causing this? What am I missing in regards to how the images should be encoded?

Pannaga_J · June 25, 2025, 9:57am

Hi @Hal2002 I attempted to reproduce this issue . It functioned correctly when I included a screenshot in AI Studio for the gemini-2.5-flash-preview-native-audio-dialog model. Could you please confirm if this aligns with your testing method? If not, kindly provide the steps to reproduce the issue from my end
Thank you

Topic		Replies	Views
Received 1007 invalid payload using Gemini Live API Gemini API api , text	6	543	June 19, 2025
Unable to pass image data to gemini multimodal live api Google AI Studio gemini , model-code	2	181	January 26, 2025
[Forwarded from GitHub] Sending Image file into a chat session return 400 INVALID_ARGUMENT. But if a video or an audio file is sent it works Gemini API prompt , model	6	256	January 20, 2025
mimeType parameter with value application/json, which is not supported Gemini API python , generative-ai	4	140	May 30, 2025
Error analyzing image Status 400 Gemini API api	1	160	May 22, 2025

Gemini live api doesn't recognize screenshot images

Related topics