Need text coordinates of the extracted value from the Original document uploaded

OrangiaNebula · September 11, 2024, 6:27am

Welcome to the forum.

If I understood correctly, you are supplying the document as an image and you want the model to answer with a precise bounding box for the text (the model properly identifies and returns the text, but the bounding box it returns is not precise enough). If that is the case, then also check for any answers to this post - Bounding Box Alignment Problems and Image Rescaling

Hope that helps.

Topic		Replies	Views
Issues with the Accuracy of Object Coordinates Detected by Gemini 1.5 in Images Gemini API gemini-15	6	328	June 10, 2024
Bounding Box for text in a document (Flash 2.0) Gemini API models	1	94	December 30, 2024
How to optimize graphic coordinates General Discussion models , android , tflite , help_request , java	7	1542	September 15, 2021
Help on calculating bounding vox General Discussion models , help_request	1	401	April 4, 2023
Bounding Box Alignment Problems and Image Rescaling Gemini API vision	0	391	September 10, 2024

Need text coordinates of the extracted value from the Original document uploaded

Related topics