Data Extraction Accuracy Issues from Documents due to Image Orientation and OCR

Vitaly_Pasternak · July 11, 2025, 12:31pm

I’m encountering recurring errors during structured data extraction when the source document images have incorrect orientation (skewed or rotated).

These errors are not related to LLM logic or prompt instructions. When I manually align the images to the correct orientation before processing, the errors disappear.

This suggests that the core issue lies in the image pre-processing and/or OCR stage, rather than the LLM’s text interpretation. The LLM model receives text that is already distorted or incorrectly structured by the OCR, making accurate data extraction impossible, even with detailed instructions in the prompt.

I’d prefer not to integrate a third-party OCR service/library before interacting with the API. Is this something Gemini developers can address?
I am currently using gemini-2.5-flash.

Topic		Replies	Views
Inconsistent Data Extraction and Skipped Content Using Gemini API Models Gemini API gemini-15 , api , models	1	95	June 12, 2025
Invoice extractor using gemini pro Gemini API	2	199	May 28, 2024
Gemini 2.0 flash - 1.5 pro Struggles with Basic Task Execution Gemini API gemini-15 , api , models	1	87	May 19, 2025
Is Gemini 1.5 Pro OCR better than 2.0 Flash OCR? Gemini API api , gemini-flash	4	228	May 10, 2025
Issue with Extracting Data from scanned image PDF in Gemini AI – Checkbox Responses Not Reading Correctly Google AI Studio ai	2	147	June 3, 2025

Data Extraction Accuracy Issues from Documents due to Image Orientation and OCR

Related topics