Skip to main content

AI/ML / Multi Agent Refarch / Threats / DEV

Foundation-model training and fine-tuning data poisoning

CCC.MARefArc.TH06

Adversaries tamper with training, fine-tuning, or third-party data feeds behind the approved models, mislabeling data or embedding backdoor triggers and biases that corrupt downstream decisions without visible symptoms until a major failure.

Related Capabilities

IDTitleDescription
CCC.MARefArc.CP14Approved-model registry and lifecycleCatalog of approved models with metadata, version information, configuration parameters, and usage constraints, ensuring agents access only models meeting organizational, regulatory, and security standards.

Related Controls

IDTitleDescription
CCC.MARefArc.CN01Data Filtering From External Knowledge BasesSanitize, filter, and classify data ingested by the Knowledge Layer from internal and external source bases before it is embedded into the vector store or used for retrieval-augmented generation, preventing inadvertent exposure or manipulation of sensitive organizational knowledge.
CCC.MARefArc.CN04Data Quality and ClassificationAssess the quality of, and assign classification and sensitivity labels to, all data used for grounding, training, and fine-tuning, and enforce handling rules derived from those labels throughout the Knowledge and LLM layers.
CCC.MARefArc.CN08Role-Based Access Control for AI DataEnforce least-privilege, role-based access control over all AI data stores, including source bases, the vector store, and model artifacts.

External Mappings

FrameworkIDRemarks
air-vecAIR-SEC-009-01
air-vecAIR-SEC-009-03
air-vecAIR-SEC-009-04