Skip to main content

AI/ML / Multi Agent Refarch / Threats / DEV

Silent model version, prompt, and deployment drift

CCC.MARefArc.TH19

Providers silently retrain, re-prompt, or re-architect models, or change deployment and API defaults, shifting behaviour even when inputs are unchanged; without version pinning in the model registry this breaks reproducibility and validated behaviour.

Related Capabilities

IDTitleDescription
CCC.MARefArc.CP14Approved-model registry and lifecycleCatalog of approved models with metadata, version information, configuration parameters, and usage constraints, ensuring agents access only models meeting organizational, regulatory, and security standards.

Related Controls

IDTitleDescription
CCC.MARefArc.CN03System Acceptance TestingValidate agents, models, and end-to-end workflows against accuracy, robustness, bias, drift, and compliance criteria before promotion to production, and re-validate after material changes.
CCC.MARefArc.CN07AI Model Version PinningPin and record explicit model versions in the Model Registry so that model behaviour is reproducible and provider-side changes are surfaced rather than silently absorbed.
CCC.MARefArc.CN17AI System ObservabilityInstrument every layer to emit logs, traces, metrics, and events to the Observability Layer so that behaviour, drift, availability, and data handling are continuously visible and auditable.
CCC.MARefArc.CN19Human Feedback Loop for AI SystemsCapture human feedback on agent outputs through the Feedback Engine and Human Supervision capabilities and feed it into evaluation and improvement of agents and models.
CCC.MARefArc.CN21Automated Evaluation Using LLM-as-a-JudgeUse automated model-based evaluation in the Evaluation Layer to assess output quality, grounding, bias, and policy compliance at scale.

External Mappings

FrameworkIDRemarks
air-vecAIR-OP-005-01
air-vecAIR-OP-005-02
air-vecAIR-OP-005-03
air-vecAIR-OP-005-04