Gemini Fine-Tuned Model - Document Processing Error and Capability Inquiry

zak · April 3, 2025, 10:22am

Hello,

I am encountering an error when prompting a fine-tuned model with a PDF for analysis, both in AI Studio and using the Gemini API. The entire chat session ceases to function after I prompt it with a document.

I would like to know if a Gemini fine-tuned model has document understanding capabilities. I have been unable to find this information in the official documentation.

Please advise.

Thank you,
Zak

Govind_Keshari · April 3, 2025, 10:34am

Hi @zak,

Thanks for flagging. I am getting the same error in AI Studio. I will get back to you on this.

Govind_Keshari · April 4, 2025, 9:48am

Hey @zak,

Currently,Tuning only support text input-output pair. Once tuned, it’s text input text output model. A tuned model doesn’t support image input. A pdf might contain image or you can say it will not support pdf as well, so that’s why you are getting this error.

Thanks.

zak · April 6, 2025, 3:40pm

Hello @Govind_Keshari,

Thank you for the clarification. I understand that the fine-tuned Gemini model only supports text input-output pairs.

Using AI Studio and the Gemini API, I have noticed the lack of support for a “system prompt” with fine-tuned models. While the documentation (Gemini Model Tuning Limitations ) mentions the absence of chat-style multi-turn conversations, it doesn’t explicitly address the use of a system prompt to guide the model’s behavior.

The absence of a system prompt would prevent the fine-tuned model from effectively utilizing RAG (Retrieval-Augmented Generation) or processing text extracted from PDF documents, as these techniques typically rely on providing contextual instructions via a system prompt.

Could you please confirm whether system prompts are indeed unsupported for fine-tuned Gemini models? If so, are there any alternative strategies recommended for incorporating contextual information or instructions when using RAG or PDF-derived text with a fine-tuned model?

Thanks,
Zak

Govind_Keshari · April 7, 2025, 6:16am

Hey @zak,

Unfortunately, system instruction is also not supported with tuned model. It is also not mentioned in the doc. Here is the some limitations.

Topic		Replies	Views
Document learning? Gemini API	4	222	May 3, 2024
Gemini 2.0 and PDF OCR Fine-tuning Google AI Studio ai-studio , fine-tuning , gemini-flash	0	216	March 18, 2025
How come gemini studio can use unsupported mimetype? Gemini API gemini-15 , ai-studio , api	5	148	June 24, 2024
OpenAI compatibility for pdf file Gemini API api , openai_compatibility	4	145	April 10, 2025
Document/Files understading in Gemini with OpenAI SDK Gemini API learning , documentation	1	46	April 23, 2025

Gemini Fine-Tuned Model - Document Processing Error and Capability Inquiry

Related topics