Read the full pillar: EU AI Act Article 15 explainer →
What the regulation requires
1. High-risk AI systems shall be designed and developed in such a way that they achieve an appropriate level of accuracy, robustness, and cybersecurity, and that they perform consistently in those respects throughout their lifecycle. 3. The levels of accuracy and the relevant accuracy metrics of high-risk AI systems shall be declared in the accompanying instructions of use. 4. High-risk AI systems shall be as resilient as possible regarding errors, faults or inconsistencies that may occur within the system or the environment in which the system operates, in particular due to their interaction with natural persons or other systems. Technical and organisational measures shall be taken in this regard. The robustness of high-risk AI systems may be achieved through technical redundancy solutions, which may include backup or fail-safe plans.
What you face if you don't comply
Article 15 becomes enforceable on 2 August 2026 for high-risk AI systems under Annex III. Providers must declare accuracy metrics in the instructions for use and demonstrate consistent performance across the lifecycle; non-compliance via the Article 16 provider-obligation chain is sanctionable up to €15M or 3% of global annual turnover under Article 99(4). For RAG-based high-risk systems, "appropriate accuracy" is not a self-asserted figure — it is a metric declared on the label and defensible against post-market evidence.
How RAG Benchmarking addresses this
- ¶ 15(1)Reproducible accuracy benchmarks for RAG pipelines (retrieval recall, answer faithfulness, citation precision) with versioned eval sets
- ¶ 15(3)Generates the accuracy-metrics block for the Article 13 instructions for use, with confidence intervals and eval-set provenance
- ¶ 15(4)Robustness suite: input perturbations, noisy-context, adversarial-passage, and OOD query stress tests with pass/fail thresholds
- ¶ 15(4)Lifecycle drift monitoring — replays the declared eval set against the live system on a schedule and alerts on metric regression
Source: eur-lex.europa.eu/…/CELEX:32024R1689 · Retrieved