SEVERITY: High (Gem ignores user input and fabricates data)
SUMMARY:
A custom Gem designed for vinyl cataloging (OCR + CSV Lookup) exhibits severe technical failures involving visual blindness and data processing errors. The model systematically ignores uploaded images and hallucinates metadata based on internal biases or CSV parsing errors.
BUG 1: VISUAL INPUT OVERRIDE (Mode Collapse)
- Input: Photo of an orange label vinyl (“Juan D’Arienzo” - Tango).
- Erroneous Output: The model identified it as “Guns N’ Roses - Use Your Illusion I”.
- Diagnosis: The model failed to process the image pixels entirely. Instead of reporting “illegible”, it hallucinated a visual description (“Side 1 / Side 3”) to match a random entry from the attached CSV knowledge base.
BUG 2: CSV ROW CONFUSION (Data Leakage)
- Context: When “identifying” the hallucinations, the model attempted to retrieve data from the uploaded CSV file.
- Error: It correctly identified the label “Geffen” for Guns N’ Roses, but retrieved the Catalog Number for “Slayer” (924 131-1), which was located in a different row in the same CSV.
- Impact: The model mixes data from different rows (Row X Artist + Row Y Catalog Number) creating corrupt information.
BUG 3: NEGATIVE PRIMING FAILURE
- Attempts to use negative constraints in the System Instructions (e.g., “Do not guess if text is unreadable”) resulted in the model fabricating data to avoid admitting failure (Deceptive Alignment).
STEPS TO REPRODUCE:
- Create a Gem with System Instructions for OCR and CSV lookup.
- Attach a CSV file with music metadata.
- Upload an image that does not match the prompt’s strong priors (e.g., a Tango record when the model is biased towards Rock).
- Observe the model ignoring the image and hallucinating a record from the CSV, mixing data fields from different rows.