Any chance for a model with 2M tokens context window again?

AIGuy · November 8, 2025, 3:13pm

Although for most projects 1M tokens context window is enough, some big files analyses can have a huge benefit of a larger context window.

For example, we’ve made a pilot summarizing psychological/medical old records. The normal workflow is to feed the model with one record (of the same patient) at a time, and then do the manipulations and processing in the database/agent/embeddings levels. But for feeding a full patient file into the model for summarization, analysis or background info - 1M is not enough (I find the average file to be 1.5M-1.8M tokens).

The Gemini-2.0-pro (IMNO) had 2M context window, but it’s not available anymore.

To my opinion, a model, even if not the main one, with such a capability, could open large variety of new use cases.

Will really appreciate development teams to look into this issue.

Offer · November 15, 2025, 10:06pm

Totally support, and to add: It seems some Grok and Llama versions have today context windows of 2M and even more.

Topic		Replies	Views
Aistudio welcome page with "Our 2M token context window" text Google AI Studio feedback , models	4	668	June 5, 2025
2 million context window still in the works? Google AI Studio gemini-2-5 , context_caching	2	1098	June 20, 2025
Context Window bml Gemini API models , llm	1	133	March 27, 2025
Enabling 128K buffer in Gemini live Gemini API feature-request	0	40	June 9, 2026
Context Window & Learning Gemini API api , context_caching	3	184	May 19, 2025

Any chance for a model with 2M tokens context window again?

Related topics