Context Window & Learning

MarkEX · May 14, 2025, 8:00am

I have a database of information that I’d like Gemini to use. When I interface with Gemini, it uses this database to formulate a response. It works well, however, after some time, I then find the context window is full, and the database information is truncated, thereby impeding the operation of retrieving the necessary information. I then need to create a new chat, input the database again, which is tedious.

How can I get around this issue of the database information being truncated?

GUNAND_MAYANGLAMBAM · May 17, 2025, 12:20pm

Hi @MarkEX , Welcome to the forum.

Could you confirm whether you are passing the same database information in each subsequent call? Also, which model are you currently using? You might want to consider using gemini-1.5-pro, which supports a context window of up to 2 million tokens, this could be beneficial for your use case.

In the meantime, you may also want to explore our Document Q&A with Vector Database Cookbook for a more scalable solution.

MarkEX · May 18, 2025, 4:45pm

Ok - thanks for the reply.

Each chat uses the same database information, and the requests, which are dynamic, relate specifically to the database, in this case a book with various psychological terms.

What about Content Caching?

GUNAND_MAYANGLAMBAM · May 19, 2025, 9:42am

Yes, you can also use explicit context caching. Let me know if it solved your issue.

Thanks

Topic		Replies	Views
How to cache conversation like OpenAI/Claude/DeepSeek? Gemini API help-request	2	149	May 14, 2025
Context Caching causes Gemini 1.5 to get stuck in a loop Gemini API gemini-15 , bug	0	302	June 21, 2024
Rest api context caching Gemini API gemini-15 , api , models	5	164	August 19, 2024
Did My Vertex AI Input Caching Fail? Gemini API help-request , generative-ai	2	68	May 2, 2025
How to Start a Chat with Gemini Without Resending the File Gemini API api , github	3	129	February 26, 2025

Context Window & Learning

Related topics