Is there a way to use AI just to do something in especific?

rengow · January 7, 2025, 2:18pm

Hi. I will be short. I upload an invoice image and a prompt to Gemini with a script from Python and receive a JSON schema given in the prompt from that. How can I train the model in future on cases like, for example, “It’s not a “5”, it’s a “$” symbol” so I don’t need to change the prompt?.
Thank you!

OrangiaNebula · January 7, 2025, 6:44pm

Welcome to the forum.

One approach you might try is to give the model a few samples with the answer you expect to get in the prompt, before the image you want it to really process. So, your prompt would have a sequence of Part, with the last Part holding the image you want handled, and the model will complete the sequence by doing what it saw getting done two-three times before. The few-shot prompting technique is quite effective. Obviously, use samples that illustrate the edge cases you care about, such as the 5 vs $ case you described.

Hope that helps.

Topic		Replies	Views
Image Classification TensorFlow models	2	41	February 11, 2025
Training Gemini to write in a specific style Gemini API model-training	3	333	December 18, 2024
AI endpoint for Image generation which allows image+prompt -> image Gemini API api , gemini-api	6	187	October 7, 2024
How to improve gemini-1.5-flash output accuracy on images Gemini API gemini-15 , model	3	120	September 12, 2024
Creating Bot to Answer frequently ask questions and also able to do chitchat Gemini API gemini-15 , models	5	166	November 28, 2024

Is there a way to use AI just to do something in especific?

Related topics