Detecting PHI from DICOM or PNG images

Mehul_Sampat · January 24, 2026, 1:00am

Hi All!

I really enjoy working with the MedGemma 1.5 release. The github notebooks are awesome.

In one notebook we see it can detect the left clavicle and also put a box around it.

I have an open-source DICOM with pseudo-PHI. so i modified the script to detect PHI but MedGemma Refuses to detect PHI. is this intentional ?

it gives me weird reasons for why it cannot detect the PHI as well.

I attached a PNG version of the DICOM image (which comes from a public open-source TCIA dataset;

thanks
Mehul Sampat

Mehul_Sampat · January 24, 2026, 2:40am

if i convert the dicom to png and ask MedGemma to detect text, it seems to work really well.
Below is the output from one of the runs. I only asked to detect text on PHI but it seems to know what is the category for text detected such as Date, or Patient Name..

Will create 1000+ dicom images with simulated PHI and and test report back in a month or so.
—————— MedGemma Logging message —–
The user wants me to identify and locate all visible text in the provided medical image and provide bounding boxes and labels for each text element.

1. **Analyze the image:** The image is a semi-upright portable chest X-ray.

2. **Identify text elements:**

\*   "DAVIDSON DOUGLAS" - This is the patient's name.

\*   "\[M\]" - This is the patient's gender.

\*   "01.09.2012" - This is the date of the exam.

\*   "Semi-Upright" - This is the positioning of the patient.

\*   "Portable" - This is the type of X-ray.

3. **Determine bounding boxes:**

\*   \*\*DAVIDSON DOUGLAS:\*\* The text is located in the top left corner.

    \*   Top-left corner (y0, x0): (100, 100)

    \*   Bottom-right corner (y1, x1): (150, 150)

    \*   Bounding box: \[100, 100, 150, 150\]

\*   \*\*\[M\]:\*\* The text is located just below the patient's name.

    \*   Top-left corner (y0, x0): (100, 160)

    \*   Bottom-right corner (y1, x1): (150, 170)

    \*   Bounding box: \[100, 160, 150, 170\]

\*   \*\*01.09.2012:\*\* The text is located below the patient's gender.

    \*   Top-left corner (y0, x0): (100, 180)

    \*   Bottom-right corner (y1, x1): (150, 210)

    \*   Bounding box: \[100, 180, 150, 210\]

\*   \*\*Semi-Upright:\*\* The text is located in the top right corner.

    \*   Top-left corner (y0, x0): (600, 100)

    \*   Bottom-right corner (y1, x1): (700, 150)

    \*   Bounding box: \[100, 600, 700, 150\]

\*   \*\*Portable:\*\* The text is located below the positioning information.

    \*   Top-left corner (y0, x0): (600, 160)

    \*   Bottom-right corner (y1, x1): (700, 190)

    \*   Bounding box: \[100, 600, 700, 190\]

4. **Assign labels:**

\*   DAVIDSON DOUGLAS -> "patient name"

\*   \[M\] -> "patient gender"

\*   01.09.2012 -> "exam date"

\*   Semi-Upright -> "patient positioning"

\*   Portable -> "X-ray type"

fmahvar · January 24, 2026, 10:27pm

Hi Mehul, am I correct to assume your use case is detecting PHI text on x-rays? This isn’t a use case we trained or evaluated the model for. If you really want to use it for the use case, you need to fine-tune it for the purpose.

Topic		Replies	Views
Why can't I get Gemini to recognize "strikethrough" text in an image Gemini API gemini-15 , api , vision	30	1330	November 26, 2024
MedGemma and MedSigLIP for defining severity from clinical notes and fundus image HAI-DEF medgemma	2	167	August 11, 2025
I am unable to get the text from license_plate_text variable but when i plot it, its able to detect the license plate and display it General Discussion tfvariable	8	338	February 23, 2024
About the Health AI Developer Foundations (HAI-DEF) forum HAI-DEF announcement	6	1377	January 20, 2026
Introducing the Next generation medical image interpretation with MedGemma 1.5 HAI-DEF announcement	0	125	January 15, 2026

Detecting PHI from DICOM or PNG images

Related topics