Requests in Batch processing have access to other batch requests causing hallucinations

Onyoursix · October 6, 2025, 7:02pm

I have a very specific data that is unique to country and regions that I’m processing. I made a mistake by including some empty strings when generating my JSONL files for the batch processing. I’m also using structured response.

As an example:

JSONL line one might be something like

Prompt: “Do XYZ on this data {data inserted here}”
Schema: {…my data schema}

JSONL line two with the empty string might be

Prompt:”Do XYZ on this data{““}”

Schema: {…my data schema}

In the response that I get back I get a hallucination for my JSONL the key for line one has the data that you would expect given the input while the key for line two (empty string) looks like completely legit data for that specific country.

For instance, I’m working with a data from Brazil, so all the names of people in the data are common for Brazil. If I pass in an empty string there is no way I would have expected it to hallucinate names and even legit locations in Brazil when there should be no context about Brazil given that I literally passed in nothing about it. The only way this makes sense is if while processing the bulk data it’s being processed in a manner where the individual requests have shared memory.

Is there anyway I can mitigate this?

Aciax_Hls · October 7, 2025, 2:49am

Penyusunan schma lo salah

4.JSONL line one might be something like

2.Prompt: “Do XYZ on this data {data inserted here}”
Schema: {…my data schema}

3.JSONL line two with the empty string might be

5.Prompt:”Do XYZ on this data{““}”

1.Schema: {…my data schema}

Woi buat alur kerja menggunakan penyusunan seperti awalnya lalublihat di terminal, , terinsal atau tidak, penempatan kode dan data schema tidak bisa diselang seling brooo

Payal_Sharma2 · December 23, 2025, 10:50am

Hi @Onyoursix, Welcome to AI Forum
This looks like a common issue where the model applies the ‘pattern’ of a batch to empty fields. To help you fix this, could you share the prompt you’re using and a small snippet of your code?
In the meantime, you might try updating your prompt instructions to explicitly handle empty inputs (e.g., ‘If the input data is empty or an empty string, return null for all schema fields’). This usually helps break the pattern the model is following.

Topic		Replies	Views
2.5 pro just started hallucinating Gemini API models	12	1603	June 2, 2025
Context caching - batch api requests Gemini API api , prompt	4	150	November 25, 2025
Gemini 2.5 Pro inserting random text and format tokens around json responses Gemini API bug , api	39	1910	May 13, 2025
Gemini-2.5-flash-lite produces incorrect structured output Gemini API api-key , gemini-flash-2-5	5	326	December 23, 2025
Invalid Responses with gemini-1.5-flash-002 in Document Classification Gemini API gemini-15 , api	6	177	November 22, 2024

Requests in Batch processing have access to other batch requests causing hallucinations

Related topics