Outputs from gemini-1.5-pro-001 (w/ cache) vs gemini-1.5-pro (w/o cache)

King_Choo · November 7, 2024, 1:55pm

Hello

Should prompt caching impact output generation? (not referring to latency + costs)? If so, how?

I’ve been testing out

gemini-1.5-pro-001 (with caching)
gemini-1.5-pro (without caching)

On the ‘same’ inputs. I was wondering whether there is meant to be a large difference in outputs between the two or would the large delta be due to the way I’ve implemented it?

More context:

gemini-1.5-pro vs gemini-1.5-pro-001 causes the difference in outputs
prompt caching implementation: (1) I have a lot of pdfs which are already parsed down into JSONs i.e. have metadata + the full text contents; (2) instead of passing it through the ‘contents’ param, I pass it all through the system_instructions param; (3) the input is one large prompt instead of multiple broken down system prompts i.e. one prompt has task, explanation of context given, context, desired structure of output, and further instructions)

More questions:
Q1: What’s the difference (if any) between passing it through the ‘contents’ param vs systems_instruction param?
Q2: What’s the difference (if any) between splitting the prompt (i.e. instructions)/context (i.e. different files) vs passing it all down together in one variable?

Thank you!

afirstenberg · November 7, 2024, 2:54pm

Just a note, “gemini-1.5-pro-001” and “gemini-1.5-pro” are two different models. It is not that surprising they’d create different results.

Topic		Replies	Views
Context Caching causes Gemini 1.5 to get stuck in a loop Gemini API gemini-15 , bug	0	271	June 21, 2024
Faster Response Times with Gemini 1.5 Pro? Gemini API api	1	151	January 30, 2025
Different Responses in AI Studio and API for Fine-Tuned Gemini 1.0 Model Gemini API ai-studio , api , fine-tuning	1	142	June 24, 2024
Significant Difference in Response Quality between Google AI Studio and Gemini 2.5 Pro API (gemini-2.5-pro-03-25) Gemini API feedback , api , gemini-25 , gemini-2-5	7	411	June 4, 2025
How to get consistent results when using gemini api? Gemini API gemini-15 , api	4	330	January 16, 2025

Outputs from gemini-1.5-pro-001 (w/ cache) vs gemini-1.5-pro (w/o cache)

Related topics