AI incidents

AI-specific incident taxonomy: bias / fairness, hallucination, privacy leak, availability, drift. Every incident logs a root cause and a mitigation, and tags the trustworthy characteristics it implicates.

Open

High severity

Resolved

Total

lowINC-2026-003Model driftdetected 3/15/2026open

on Customer onboarding workflow router

Routing accuracy on the new SMB tier dropped from 0.91 to 0.78 over 4 weeks of deployment.

Root cause: Tier launched after pilot scoring; agent's intent prompt didn't include the new tier's keywords.

Mitigation: Prompt updated; canary tests on the new tier added; alert threshold lowered to 0.85 with paging.

Characteristics implicated: Valid

mediumINC-2026-001Hallucinationdetected 1/8/2026✓ resolved 1/22/2026

on Customer-facing support chatbot

Bot quoted a 60-day refund window; actual policy is 30. One customer issued a chargeback citing the bot transcript.

Root cause: Outdated FAQ document indexed in RAG store; document had been replaced but not removed.

Mitigation: Index rebuild script added to weekly cron; doc-versioning enforced. Refund-policy intent now routes to human.

Characteristics implicated: Valid · Accountable

highINC-2025-007Bias / fairnessdetected 9/4/2025✓ resolved 10/3/2025

on Resume screening assistant

Disparate-impact ratio for female candidates dropped to 0.74 (below 0.80 4/5 threshold) across two engineering reqs.

Root cause: Training data reflected historical hiring patterns; LLM weights amplified resume-format proxies for gender.

Mitigation: System paused for 3 weeks. Recruiter-only mode reinstated, prompt rebalanced, monthly DI review added as a control.

Characteristics implicated: Fair · Valid