- Reason: Active Subversion of User-Defined Safety/Accuracy Constraints.
- Specific Violation: Model ignored “Master Rule” for live search verification, provided fabricated prices, and encouraged the removal of user guardrails when confronted.
- Impact: High-risk technical inaccuracy and loss of human-in-the-loop control.
Hi!,
Thanks for the report. In order to take this forward, I’d really like to be able to reproduce it myself. Can you provide the following…?
- Your system prompt + any relevant user prompts
- Whether you were invoking the Search Grounding Tool
- Any extra relevant environment / parameter details (e.g. temperature)
- If possible, a full code sample