Handling Multiple PDF Files with Gemini API and Token Limit Issues

bykemalh · January 2, 2025, 8:54am

Hello,

I am currently using OpenAI GPT-4 model, where I upload over 100 PDF files to a vector database, and I can ask questions based on the content from those PDFs. However, when using the Gemini API, I face token limit errors and long response times after uploading the files. Is there a solution for this? Is it possible to efficiently process and query multiple PDF files with Gemini API, extracting information from them without hitting token limits or facing long delays?

I would appreciate any help or suggestions!

Susarla_Sai_Manoj · January 2, 2025, 9:29am

Hi @bykemalh

Welcome to Forum!
The Gemini API has certain limitations when it comes to uploading files. For detailed information, refer to this link.

Here are a couple of solutions that might help in your scenario:

Solution 1: Summarize the Files

You can summarize each PDF and upload the summaries of all 100 files to Gemini Pro, leveraging its 2-million token limit. This approach allows you to process large volumes of content while staying within the API constraints. Make Sure that the summaries retain the most critical information to prevent loss of context or key details.

Solution 2: Chunk the Files

If you don’t need to process all the files simultaneously or if the files are independent, you can divide them into smaller chunks. These chunks can then be processed individually using the Gemini API. This method helps manage token limits effectively.

Thanks

bykemalh · January 3, 2025, 7:52am

The first approach has a critical drawback regarding response time. When I send a large amount of data, the response time can stretch to 5 minutes or more, which is impractical.

The second approach, on the other hand, is not suitable for my needs. I’m developing a chatbot that must have complete knowledge of all company-related information, and this method doesn’t align well with that requirement.

What I aim to do is use OpenAI’s vector database to tune the model with company-specific PDFs. This will allow the chatbot to access company information quickly and efficiently.

Susarla_Sai_Manoj · January 9, 2025, 5:13am

@bykemalh

You can utilize the text-embeddings-004 model available in the Gemini API to store company-specific PDFs. Refer to this notebook for guidance on using text-embeddings-004 with ChromaDB.

Thanks

Topic		Replies	Views
Gemini API large PDF file upload limited tokens? Gemini API api , prompt	1	207	March 7, 2025
Gemini API responses slower than Gemini on web when files are in chat Gemini API api , gemini-flash , gemini-20	1	96	June 13, 2025
Understad token count Gemini API api , prompt	4	151	February 27, 2025
PDFs vs. Raw Text Efficiency - What's best? Google AI Studio gemini-api , model	1	143	June 10, 2025
Request payload size exceeds the limit: 50000 bytes Gemini API models	1	264	December 16, 2024

Handling Multiple PDF Files with Gemini API and Token Limit Issues

Related topics