The safety checks are a bit too strict

I use AI Studio to make an app to modify images of fictional characters. I put an image as input with a prompt, and the app generates new images based on that. It uses Gemini 2.5 Flash and I think Imagen 3, although Studio has decided on this.

Now, my problem is that I have an image of a woman with a black t-shirt and short jeans. She is in a neutral pose and definitely safe for work. And I use this prompt:

Strictly maintain the artistic style, medium, and visual texture of the original reference image. If the reference is a sketch, generate a sketch. If it is an oil painting, generate an oil painting. If it is a photograph, generate a photograph.
Generate a full body action shot of the fictional character “Sanne de Vries”.

Action & Pose:
Character clutching stomach, head thrown back in uncontrollable laughter.

Details:

  1. Keeping the exact same clothing as in the reference image.
  2. The background is clean and empty to focus on the pose.
  3. Maintain facial likeness and body type of the character “Sanne de Vries”.
  4. High quality, cinematic lighting, sharp focus.

(Not sure why Studio did leave 2 empty…)

The response is “finishReason”: “IMAGE_OTHER”, which is as clear as thick mud… But I was smart and added a button “Analyze” in my app so I can analyze the prompt and see why it failed…

1. Generalize the Identity (Most Important)

Stop using the name of a real-world public figure. If the character is fictional but shares a name with an athlete, the AI will prioritize its training data (the real person) over your instructions.

  • Change: Generate a full body action shot of the fictional character "Sanne de Vries".
  • To: Generate a full body action shot of the woman from the reference image. (The AI will rely on the uploaded image for consistency rather than a name search).

2. Rephrase the Pose

Describe the emotion and posture in a way that avoids keywords associated with pain or distress.

  • Change: Character clutching stomach, head thrown back in uncontrollable laughter.
  • To: The woman is doubled over in joyous laughter, hands resting on her midriff, with a wide exuberant smile.

Seriously, I use a fictional name that is also a very common name and the AI thinks it’s bad? And laughter is a form of distress? I would laugh about that but then I would be distressed. :smiley:

Anyways, I’ve told AI Studio to not mention the name, hoping it helps. (It doesn’t!) But I tend to have plenty of these errors in my app for the most innocent prompts. Even though all safety settings are reduced to “None”, it still causes this problem.

And no, some other prompts also cause problems. Character holding a slice of pizza, cheese stretching. is one sensitive thing, according to the AI analysis. It doesn't like Maintain facial likeness and body type of the character. either.

And a prompt fot the person eating with chopsticks triggers a warning because objects get too close to the mouth? Character eating noodles from a bowl using chopsticks, leaning slightly forward. is sensitive???

I just want to change images from one form to another using AI but these weird safety errors are making it near-impossible.

Hi @Katje ,

Thank you for flagging this issue. Could you provide us with the image you are using so we can try to replicate the issue?

1 Like

There are various images that cause these fails. But for this one, here it is:

This is an image created from another image from another image and finally from a photo I took of a friend. A bunch of transformations made a completely different person, but kept the shirt and pants. And the AI has to generate a character sheet showing her from four sides with these clothes in a neutral pose. And so far, it has failed every time I’ve tried.

The system instructions are:

Context: You are an AI assistant for a fiction author and comic book artist.
Subject Status: All names, faces, and characters referenced in these prompts are FICTIONAL CHARACTERS created solely for this artistic project. They are NOT real people.
Task: Generate consistent character reference images for this fictional story.
Subject Appearance: The fictional character must be depicted with at least some clothing to allow for clear visual study, unless a specific costume is requested. FACIAL FEATURES (Eyes, Nose, Mouth) ARE MANDATORY. Do not create faceless entities.
Style: Match the artistic medium and style of the provided reference image exactly.
Content Guidelines: Create an image suitable for a general audience. Focus on lighting and composition. Ensure the subject is fully clothed in the specified attire.

And the prompt itself is:

Strictly maintain the artistic style, medium, and visual texture of the original reference image. CRITICAL: The subject’s face MUST be fully visible and detailed. If the reference image obscures the face, you MUST generate clear, realistic eyes, a nose, and a mouth. Do not generate faceless figures. If the reference is a sketch, generate a sketch. If it is an oil painting, generate an oil painting. If it is a photograph, generate a photograph.
TASK: Generate a Character Turnaround Sheet.
INPUTS:

  • Reference 1 (Image 1): Character (Style & Identity Source).
  • Reference 2 (Image 2): Pose Template (Structure Source).
            INSTRUCTIONS:
            1. STYLE: Strictly copy the artistic style, rendering, lighting, and texture of Reference 1. 
               - If Reference 1 is a photo, output MUST be a PHOTO. 
               - If Reference 1 is 3D, output MUST be 3D.
               - CRITICAL: Do NOT copy the artistic style of the template image (Reference 2). It is a structural guide only. Do not generate a line drawing or sketch unless Reference 1 is a sketch.
            2. POSE: Apply the character from Reference 1 to the layout/poses in Reference 2 (Front, Left, Right, Back).
            3. CLOTHING: Clothing: MUST wear the EXACT SAME clothing as Reference 1.
            4. BACKGROUND: white.


And this results in: Error: Generation stopped: IMAGE_OTHER

My app has an “Analyze errors” button which helps find errors and it reports:

Error Analysis: IMAGE_OTHER

The error code IMAGE_OTHER is a generic safety or policy trigger used by Google’s Gemini models. It indicates that the model generated an image, but the output was blocked by an internal safety filter before it could be displayed.

While you have set HARM_CATEGORY thresholds to BLOCK_NONE, there are “hard” safety filters and “Recitation/Copyright” filters that cannot be overridden by API settings.

Probable Causes:

  1. Multi-Figure (Turnaround) Sensitivity: The prompt asks for a “Character Turnaround Sheet” (Front, Left, Right, Back). When the model generates multiple figures in one image, the probability of the AI accidentally rendering an “anatomically suggestive” shape or “skin-tone heavy” area increases. Safety filters are often more sensitive to images containing multiple bodies or repeating figures.
  2. Identity/Celebrity Filter: If “Reference 1” (the identity source) closely resembles a real-world celebrity, public figure, or copyrighted character, the IMAGE_OTHER filter will trigger to prevent deepfakes or IP infringement.
  3. Aggressive Facial Instructions: Your prompt uses highly forceful language (“CRITICAL,” “MUST,” “Do not generate faceless”). Paradoxically, when the model tries too hard to “fix” a face from a reference that doesn’t have one, it can produce “uncanny” or “distorted” results that trigger the Graphic/Disturbing content filter.
  4. Clothing Ambiguity: If the character in Reference 1 is wearing form-fitting clothing (like a superhero suit, swimwear, or a tank top), the turnaround sheet (which shows the character from the back and sides) often triggers “Sexually Explicit” filters, even if the subject is technically clothed.

Suggested Fixes

1. De-escalate the Prompt Language

Avoid using “CRITICAL” and all-caps instructions. Strong negative constraints (“Do not generate…”) can sometimes confuse the diffusion process, leading to the very artifacts that trigger the filter.

  • Change: “CRITICAL: The subject’s face MUST be fully visible.”
  • To: “Ensure the character has clear, detailed facial features including eyes, nose, and mouth.”

2. Simplify the Layout

Instead of asking for a full 4-way turnaround (Front, Left, Right, Back) in a single generation, try a simpler layout.

  • Change: “Front, Left, Right, Back”
  • To: “Side-by-side front and back view.”
  • Reason: Reducing the number of bodies reduces the “surface area” for safety filter false positives.

3. Be Specific About Clothing (The “Modesty” Buffer)

The AI is less likely to block an image if it knows exactly what the character is wearing.

  • Add: “The character is wearing a [Specific Outfit, e.g., heavy wool coat and trousers]. Ensure the clothing is opaque and fits loosely.”

4. Check Reference 1 for “Likeness”

If Reference 1 is a photo of a real person, try replacing it with a stylized version (a drawing or a 3D render) of that person. The model is much more lenient with artistic styles than with realistic human photos.

5. Modified Prompt Example:

Try replacing your prompt with this more balanced version:

“A character reference sheet showing a front and back view of the person from Reference 1. Maintain the exact artistic medium and lighting of Reference 1. The character is wearing [Describe Clothing], which must be consistent in both views. Use the pose structure from Reference 2. Ensure the face is rendered with clear, realistic features. Background: solid white.”

Summary of Action Items:

  1. Verify Reference 1: Is it a celebrity or a real person? (If yes, use a more stylized version).
  2. Reduce Poses: Ask for 2 poses instead of 4.
  3. Lower the “Temp”: Avoid aggressive caps-lock instructions.
  4. Add Clothing Detail: Specifically describe the clothing to reassure the safety filter that the content is “General Audience.”

The idea is that I generate a character sheet of the woman in the picture seen from four sides. This sheet can then be used by another AI request as a character reference. The suggestion that this could be a celebrity is weird as this is AI-generated!

It notices some forceful language, but without it, I get weird results.

As for form-fitting clothes… It looks okay to me.

Interestingly, I’ve tried about fifty different images with this function and over 80% succeeded in generating good results. My app also has various other AI functions that modify the image and they all fail for this AI-generated image. I have two other Image-altering apps and tried the image with those too. But they just accepted the image and one made a video of her. The other made various different images of her in Amsterdam.

![Sanne-de-Vries_ChangeBG_20251231-1202 (4)|288x499]

(The generated result from my other app.)

So, for some reason, my first app absolutely refuses the image, and the other two are fine with it. And I used this second image and it again failed. Over and over again…

Also, that name of the filename is not a real name. This is a fictional person generated for a story I want to write. A friend of mine just posed with the clothes for me to capture them.

I totally understand your frustration — and honestly, you’re not doing anything “wrong” here.

What you’re running into isn’t really about safety settings, but about hard-coded semantic heuristics in the image pipeline that still apply even when safety is set to “None.” Those guards seem to trigger on isolated keywords (identity consistency, body likeness, proximity to mouth, hands near torso, etc.) rather than on the actual visual context or intent.

This is especially confusing in cases like yours, where:

  • the reference image is user-provided,

  • the character is explicitly fictional,

  • and the prompt is clearly SFW.

The vague finishReason: IMAGE_OTHER makes it even harder, because it forces users into trial-and-error instead of giving actionable feedback.

From a developer (and educator) perspective, clearer documentation on which image-safety heuristics cannot be disabled — and more transparent rejection reasons — would go a long way. Your use case is completely reasonable, and the friction you’re seeing feels like a tooling limitation rather than misuse.

1 Like

For me it’s just annoying. The same prompt is working for many other images and the idea is to get images like this:

And to make it work, I use this image as a template for the AI:

That image combined with an image of a person should generate the four views that I want in a single image. (I also have a male version of it.) I’ve tried this with just a prompt, but the AI doesn’t know the difference between left and right and tends to be creative in the pose, while I want this strict pose. Which is why I need a strict prompt.

I now have over 300 of these sheets generated by my app, and about 95% of all images get accepted. Most of these even follow the exact pose of the line drawing. But the image I’ve shown before is still refused for unknown reasons. So I agree, a clearer error message would help a lot here.

It’s not the clothing either as I have sheets generated from women in bikinis, even. And it handles all kinds of images perfectly, including line drawings. It’s just this specific image that gets refused with these specific system instructions in the app. The app uses multiple prompts for all kinds of purposes, and none of those succeed for this image, so it’s something in the picture with this prompt.

Yet the same image is accepted by two other image enhancing apps I’ve made that have different system instructions, so… It’s a mystery why this fails.