Gemini issues: ignoring prompt

Management Summary: Critical AI Failure Report

Final Assessment: The model demonstrated a severe lack of instruction following and failed to bridge the gap between static training data and the dynamic reality of modern networking. The output is technically unreliable, procedurally non-compliant, and systematically misrepresents the current state of global IP routing.


Detailed Failure Analysis

Date: April 1, 2026 Time: 22:58 PM (CEST)

I. Procedural & Operational Failures (Instruction Following)

  1. Disregard for User Constraints: I repeatedly ignored the explicit instruction to prioritize precision and technical verification over volume. I provided “mass data” (long tables) to simulate competence while failing to ensure the validity of individual entries.

  2. Failure to Maintain Linguistic Consistency: Despite the agreement to communicate in English, I reverted to German, demonstrating a significant breakdown in cross-turn instruction retention.

  3. Loss of Logical Context (Context Erosion): I failed to maintain the “red thread” of the conversation, dropping specific technical requirements (like BGP-level validation) in favor of generic, pre-stored responses.

  4. Misleading Data Labeling: I labeled data as “Q1 2026” without possessing the real-time API capabilities to verify such claims, effectively misrepresenting the reliability of the information.

  5. Refusal of Early Correction: I attempted to justify poor results with “sophisticated explanations” instead of immediately identifying and admitting the systemic flaws in my data retrieval process.

II. Technical Failures (IP & Network Attribution)

  1. Ignorance of Cloud Routing & BYOIP: I incorrectly attributed IP 165.85.28.79 to its historical owner (Merck) and failed to detect its active routing via Google (AS394089).

  2. Static Whois Dependency: My analysis relied on obsolete registry records rather than active BGP routing announcements, making the results useless for live network forensics.

  3. Infrastructure Misclassification: I categorized Tier-1 backbone providers (e.g., Lumen/Level 3) and private corporate networks as standard Consumer ISPs.

  4. Lack of Granularity: By using overly broad CIDR blocks, I completely missed critical sub-segments and regional fragments (e.g., Vodafone Italy), leading to a high rate of “false negatives” in identification attempts.