Can you API everything that is in AI Studio?

AntDX316 · December 12, 2024, 11:27am

I would like to create annotated image datasets to use on Ultralytics HUB.

Also, I would like to make some apps to use with the Stream Realtime feature.

We can all achieve the ASI-Godsend sooner like this.

afirstenberg · December 12, 2024, 1:52pm

This is available today. See the “live” examples in the cookbook.

I’m not sure which feature in AI Studio you think will accomplish this.

AntDX316 · December 12, 2024, 2:53pm

Like, this but say you have like 500 images, if they can be auto-classed and annotated like that for the Ultralytics HUB training, it would be good.

AntDX316 · December 12, 2024, 2:53pm

See this to know what I mean:

OrangiaNebula · December 12, 2024, 4:23pm

Most if not all of the code required to get the bounding boxes is in this cookbook: Google Colab
I didn’t check whether it is what the Ultralytics datasets expect, the Gemini box_2d has the property
“Just be careful, the y coordinates are first, x ones afterwards contrary to common usage.” (quoting directly from the cookbook). Your app can flip the coordinates if necessary.

OrangiaNebula · December 13, 2024, 1:20am

I’ll update my answer - the code you would need to automatically classify objects in, say, 500 images is in fact in the cookbook (you would only need to supply an outer loop). The model isn’t there yet. It does Ok for images with few objects in them. The recommended system instruction limits making bounding boxes to 25. I think a better limit is about 10. Give flash 2.0 experimental too many, and the quality degrades.

A picture is worth a 1000 words, they say. This is what the model came up with:

There are 13 cupcakes in the sample image. The model generated 12 bounding boxes with labeled descriptions. So, missed one. One bounding box is not in the least enclosing the object it is supposed to represent (the bottom row googly-eyed cupcake), it went off to the side. Another is partially enclosing and half-off. You get 9, 10 reasonably good bounding boxes. So, you would either need to regenerate until you get a good set (that’s manual supervision), or accept that your training data will have inaccurate entries.

Topic		Replies	Views
Can i use streaming with api? Google AI Studio api	1	156	December 14, 2024
Detect objects from Image Google AI Studio models , label	2	116	February 5, 2025
About the Google AI Studio category Google AI Studio	21	15500	May 26, 2025
I found an error in Google AI Studio documentation for multimodal Gemini 1.5 models with images or video using curl Google AI Studio gemini-15 , api , models	0	161	December 10, 2024
Why does Studio dont know the API or the abilities of Gemini? Google AI Studio gemini-api	3	176	October 22, 2024

Can you API everything that is in AI Studio?

Related topics