Skip to main content

AI/ML / Multi Agent Refarch / Threats / DEV

Confident hallucination and fabricated facts

CCC.MARefArc.TH16

Lacking ground truth and faced with ambiguous prompts or helpfulness-biased tuning, the model fabricates plausible but false facts, figures, or citations, presented with high fluency that makes errors hard to catch and likely to be acted upon.

Related Capabilities

IDTitleDescription
CCC.MARefArc.CP16Model-interaction zero-trust guardrailsEnforces authentication and authorization for every inference request and applies input validation against prompt injection, output filtering and redaction, access control, rate limits, and cost management before and after model execution.
CCC.MARefArc.CP22Runtime protectionMonitors agent actions and model outputs during execution to detect unsafe, non-compliant, or anomalous behavior, enforcing constraints, blocking disallowed actions, or triggering escalation.
CCC.MARefArc.CP02Human-in-the-loop output reviewApplication-embedded controls that allow users to review, approve, or modify agent outputs before they are executed or shared.

Related Controls

IDTitleDescription
CCC.MARefArc.CN03System Acceptance TestingValidate agents, models, and end-to-end workflows against accuracy, robustness, bias, drift, and compliance criteria before promotion to production, and re-validate after material changes.
CCC.MARefArc.CN04Data Quality and ClassificationAssess the quality of, and assign classification and sensitivity labels to, all data used for grounding, training, and fine-tuning, and enforce handling rules derived from those labels throughout the Knowledge and LLM layers.
CCC.MARefArc.CN20Citations and Source Traceability for AI-Generated InformationAttach citations and source traceability to AI-generated information so that outputs can be verified against retrieved sources and decisions can be explained.
CCC.MARefArc.CN21Automated Evaluation Using LLM-as-a-JudgeUse automated model-based evaluation in the Evaluation Layer to assess output quality, grounding, bias, and policy compliance at scale.

External Mappings

FrameworkIDRemarks
air-vecAIR-OP-004-01
air-vecAIR-OP-004-02
air-vecAIR-OP-004-03
air-vecAIR-OP-004-04