Exploring Multi-Modal AI: Insights from Recent Tests on AI Studio

sps · April 26, 2024, 6:58am

Hi,

Today, I tested multi-modal inputs on the model and have some observations and feedback to report.

The experiment involved testing the AI studio and Gemini 1.5 Pro model’s vision and logic capabilities.

Here’s how it looks:

The model works, but here are the things that immediately stood out:

The unsafe content warning appeared on all outputs with the default safety settings.

Screenshot 2024-04-26 at 11.44.51 AM608×354 16.9 KB
Two out of three outputs were cut off mid-sentence as the model began to describe the animals. I couldn’t find any settings for max output tokens, but the get code button reveals it’s set to 8192.
In the last output, I initially thought some incorrectly encoded characters were present, but it turned out that the emojis aren’t being rendered.
Towards the end, I also noticed that the ability to add model messages was absent. This means we can’t try few-shot prompting.

asoroken · April 26, 2024, 3:07pm

Flagged to the team. thank you

Topic		Replies	Views
Getting Full Output Block Error with Gemini 2.5 pro in ai studio Google AI Studio ai-studio , models	1	83	May 28, 2025
"Something went wrong." [ANSWER] Stream Realtime Multimodal Live API with Gemini 2.0 Constant Bug Google AI Studio ai-studio , api , models	1	1835	January 27, 2025
Disappointment with 8192 Output Length Limit for Powerful AI Models Google AI Studio gemini-15 , models , ai	8	1985	October 7, 2024
Issue with Gemini 1.5 Pro EXP API: Getting Different Results Compared to AI Studio Playground Gemini API gemini-15 , api , models	0	178	October 25, 2024
I'm not having fun. An internal error has occurred Google AI Studio models	6	1029	January 9, 2025