Latency in web search

Albano_Lucas · October 30, 2025, 3:24am

Good afternoon, I’m experiencing difficulties in developing my project. I’m unable to reduce the latency related to web searches. Currently, I’m using the Gemini 2.5 Flash model. My system is built with LangGraph, where there is an orchestrator agent that routes decisions and tool usage, and finally a node for synthesizing responses. Could someone suggest any techniques or methods to reduce latency?

Pannaga_J · November 21, 2025, 6:20am

Hi @Albano_Lucas Apologies for late response
Could you please try the following approaches? If they are existing methods, let me know if they helped.

1.Use Gemini’s Google Search for grounding, or ensure any external search tool runs asynchronously to avoid blocking the main process with network delays.

2. Try to move all static system instructions to the top of your prompt. Gemini 2.5 will automatically cache this context, leading to faster processing on subsequent requests.

3.If you could configure LangGraph to execute multiple searches or tasks in parallel branches instead of running them sequentially . This could help in reducing the overall processing duration.

4.In the UI part add stream status updates (like “Searching the web…”) to the user interface to manage user expectations and reduce the perception of waiting.

5.Use examples (few-shot prompting) instead of complex instructions for the final answer generation, allowing the model to produce the summary faster with less “thinking” time.

Thanks

Topic		Replies	Views
Gemini 3 slow and 2.5 strict JSON Gemini API api , models , gemini , gemini-3	1	126	February 26, 2026
Experiencing Extremely High Latency with Gemini 2.5 Flash & Pro Gemini API gemini-15 , feedback , api	1	350	November 26, 2025
Gemini-2.5-pro accessed over https://generativelanguage.googleapis.com/v1beta/openai/ has dramatic latency increase Gemini API api , model , gemini-2-5	10	1103	July 21, 2025
Gemini Live API Response Delay Issue Gemini API api , performance	9	587	December 5, 2025
Persistent High Latency with `gemini-2.5-pro` Gemini API generative-ai , gemini-2-5	4	1241	July 26, 2025

Latency in web search

Related topics