Bad process of delaings with PDFs

Oo_Fa · May 30, 2024, 9:29am

In the process of learning PDF files, AI does not handle many symbols well.
Some files are recognized by OCR and may contain some symbols and pictures.
AI will produce various misunderstandings during the recognition learning process.
For example, a Chinese music theory book.

Any progress to make it not a toy ?

bedros · June 3, 2024, 7:14am

Last time I checked, Gemini does not look through images in a PDF file and really just extracts the raw text content (in the case of AI Studio)

Oo_Fa · June 3, 2024, 2:48pm

I think so, and the problem is some OCR symbols couldn’t be extracted correctly, it may contain spaces, wrong line breaks. etc…
I guess working files processing is very needed if AI being man’s helper

Mrinal_Ghosh · September 9, 2025, 9:10am

Hi @Oo_Fa ,

Welcome to the Forum!!
Our apologies for the delayed response. We’d appreciate it if you could test this with the new Gemini models and report back whether the issue remains.

Topic		Replies	Views
Gemini 2.0 and PDF OCR Fine-tuning Google AI Studio ai-studio , fine-tuning , gemini-flash	1	419	June 12, 2025
Document learning? Gemini API	4	329	May 3, 2024
Lectura de documentos en pdf Google AI Studio ai-studio , feedback	1	95	June 20, 2025
Issue with Extracting Data from scanned image PDF in Gemini AI – Checkbox Responses Not Reading Correctly Google AI Studio ai	2	344	June 3, 2025
"The current model doesn't support files of this type." I'm getting error Google AI Studio model	1	2440	November 21, 2024

Bad process of delaings with PDFs

Related topics