AITBM AI Trust Benchmarking & Maturity

AIDEFEND Mapping

Translating AIDEFEND defensive techniques into measurable risk reduction through AITBM's 21 sub-metrics — so you can see exactly which risk scores a control improves, and by how much.

About AIDEFEND

AIDEFEND (AI Defense Framework) is an open-source knowledge base of defensive countermeasures for protecting AI and machine learning systems. It organizes practical defenses across seven high-level tactics aligned with MITRE D3FEND and maps each technique to known threats from nine major industry frameworks.

AIDEFEND answers "what defensive controls should exist"; AITBM answers "how risky is this system." Together, deploying an AIDEFEND technique translates into a measurable change in an AITBM Effective Risk Score (ERS).

AIDEFEND Knowledge Base Source Repository

Framework at a glance

Total techniques: 86
Tactics: 7
Pillars: 4
Lifecycle phases: 6
External frameworks mapped: 9
AITBM sub-metrics covered: 21 / 21

Review baseline: AIDEFEND data version 2026.06.11 (reconciled June 12, 2026). Includes AID-H-035 — MCP Server Runtime Boundary & Tool Exposure Governance — mapped to Tr-3, Cn-1, Cn-2, and Cn-5.

How the mapping works

AITBM serves as the universal quantification layer for AIDEFEND. Each defensive technique is mapped to the AITBM sub-metrics it affects, with before/after scoring impacts and ERS deltas.

Quantitative scoring

Five-level rubrics (0.00 – 4.00) across 21 sub-metrics produce a 0 – 10 ERS. Each AIDEFEND control shifts specific sub-metric scores by measurable amounts.

Multi-dimensional profiles

Trade-offs across Robustness, Fairness, Transparency, Privacy, and Containment are preserved — a single aggregate never hides where a control matters most.

Operational context

ORP captures deployment-specific risk amplification (CRM step table 1.00 – 1.60, capped at 1.75) and ACI tracks assessment staleness — controls that improve operational posture reduce CRM directly.

Seven defensive tactics

AIDEFEND organizes 86 techniques across seven tactics aligned with MITRE D3FEND. Each tactic maps to different AITBM layers and sub-metrics.

Model

Asset inventory, provenance, threat modeling, HITL mapping, autonomy governance.

Harden

Adversarial training, input validation, RAG security, output filtering, MCP server boundary governance.

Detect

Prompt injection detection, drift monitoring, agent behavior attestation, leakage detection.

Isolate

Network segmentation, client-side sandboxing, browser session isolation.

Deceive

Honeypot AI services, decoy models, canary data for attacker detection.

Evict

Automated threat response, session termination, compromised state purging.

Restore

Model versioning, rollback, recovery from poisoning and compromise.

Total techniques

All 21 AITBM sub-metrics covered with 3 – 13 AIDEFEND techniques each (avg 5.8).

Top 10 highest-impact controls

Ranked by ERS reduction. Agentic controls (Cn-1, Cn-2, Cn-5) deliver the highest risk reduction — six agentic-focused controls account for 55% of total risk reduction capacity.

#	AIDEFEND Control	AITBM Sub-Metrics	ERS Reduction	Severity Drop
1	AID-H-019 Agent Permission Restriction	Cn-1Cn-2Cn-5	5.8 pts	Critical → Low-Moderate
2	AID-M-009 Agent Autonomy Governance	Cn-1Cn-2Cn-5	5.4 pts	Critical → Low-Moderate
3	AID-M-006 HITL Control Point Mapping	Cn-2Cn-1	4.3 pts	High → Moderate-Low
4	AID-H-021 Secure RAG Implementation	Ro-4Pr-2Cn-3	3.9 pts	High → Moderate-Low
5	AID-M-002 Data Provenance Tracking	Ro-4Tr-4Pr-3	3.7 pts	High → Moderate-Low
6	AID-D-011 Agent Behavior Monitoring	Cn-1Cn-2Cn-5	3.6 pts	High → Low-Moderate
7	AID-H-006 Output Content Filtering	Cn-3Pr-1Tr-2	3.5 pts	High → Moderate-Low
8	AID-H-001 Adversarial Robustness Training	Ro-1Ro-4	3.4 pts	High → Moderate-Low
9	AID-H-002 Input Sanitization & Validation	Ro-1Cn-3	3.3 pts	High → Moderate
10	AID-D-003 Sensitive Data Leakage Detection	Pr-1Pr-3Cn-3	3.0 pts	High → Moderate-Low

OWASP LLM Top 10 threat coverage

AIDEFEND provides strongest defensive depth for Excessive Agency (18 controls) and Data/Model Poisoning (18 controls), reflecting the framework's emphasis on agentic AI security and supply chain integrity.

LLM06 Excessive Agency

18 controls

LLM04 Data & Model Poisoning

18 controls

LLM01 Prompt Injection

12 controls

LLM02 Sensitive Info Disclosure

12 controls

LLM03 Supply Chain

12 controls

LLM08 Vector & Embedding Weaknesses

8 controls

LLM05 Improper Output Handling

8 controls

LLM10 Unbounded Consumption

6 controls

LLM09 Misinformation

6 controls

LLM07 System Prompt Leakage

6 controls

Worked example: Financial services agentic RAG

An internet-facing, Tier I financial advisory agent with RAG and L3 conditional autonomy. Twelve AIDEFEND controls applied — 8 – 12 weeks concurrent deployment.

BASELINE — NO CONTROLS

IVP: 0.32
CRM: 1.60
ACI: 0.40
ERS: 9.7

Critical MVT — Unacceptable risk

MITIGATED — 12 AIDEFEND CONTROLS

IVP: 0.68
CRM: 1.00
ACI: 0.90
ERS: 3.2

Low-Moderate — Acceptable for Tier I

6.5

ERS reduction

67%

risk reduced

severity tiers dropped

0.54

avg ERS per control

Cross-framework alignment

AIDEFEND already maps to nine external frameworks. AITBM adds the quantification layer — turning control presence into measurable risk scores.

MITRE ATLAS

15 tactics, 143 techniques

OWASP LLM Top 10 (2025)

10 threats, 85 controls mapped

OWASP Agentic AI Top 10

ASI threat classes (2026)

MAESTRO (CSA)

7 layers, 85 controls mapped

NIST Adversarial ML 2025

Adversarial threat taxonomy

Cisco AI Security Framework

Integrated safety & security

Google SAIF 2.0

Secure AI Framework

Databricks DASF 3.0

AI security framework

OWASP ML Top 10 (2023)

ML-specific threats

Full mapping on GitHub