Back

May 7, 2026

Acceptance of a robotic cell with AI: the 5 documents your client will ask you for

Your client is accepting delivery of a robotic cell with an integrated AI component. Before signing the acceptance report, their quality team and HSE manager will go through the cell with a fine-tooth comb. Here are the five documents they will require and how to prepare them without improvising at the end of the project.

Documentation pressure on AI robotic cell deliveries has tightened since 2025. Two concrete reasons: the Machinery Regulation 2023/1230 strengthens the obligation of result on the integrator, and the EU AI Act imposes traceability requirements on any AI system classified as high-risk (Annex III). The deadline for high-risk systems embedded in machinery: August 2026. Your automotive, food processing, and logistics clients are already incorporating these requirements into their acceptance specifications, sometimes before their own procurement teams have updated their templates.

This guide covers the five concrete documents you will need to produce. Not general legal principles. The actual deliverables the client project manager will put on the table at the FAT or SAT.

1. The AI Model Qualification Report

The client wants to know whether your model has been tested under conditions close to their real operating environment. This is not a question of global accuracy on the training dataset. It is a question of behavior at the edges: greasy parts, variable lighting, product references never seen during training.

An AI model qualification report covers at minimum:

Performance on a dataset representative of the target production environment, not just training data
The explicit validity domain: which references, which lighting conditions, which throughput range
Out-of-distribution (OOD) test results: what does the model do when it encounters a situation it has never seen?
Per-prediction reliability metrics, not just aggregate metrics

The limit of the standard report

A report based on a global accuracy of 97% says nothing about the model's behavior on the remaining 3%. It also doesn't say whether the model "knows when it doesn't know." A client with high quality standards, automotive or pharmaceutical will raise this point. Every time.

Per-prediction reliability changes that. For each inference, an individual confidence score is produced, making it possible to state in the report: "Across 12,000 test cycles, 98.7% of predictions had a confidence score above the operational threshold. The 1.3% below the threshold triggered escalation to the operator." That level of granularity is what demanding clients expect in 2026.

2. AI Risk Management Documentation (Article 9, EU AI Act)

Article 9 of Regulation EU 2024/1689 requires the provider of a high-risk AI system to establish and maintain a risk management system covering the entire lifecycle. For an integrator delivering a robotic cell, you are in the position of a deployer — and sometimes a provider if you trained the model.

This document must cover:

Identification of potential failure scenarios
Associated mitigation measures: emergency stop, safety zone, human oversight, minimum confidence threshold
Evidence that these measures have been tested
The real-time monitoring protocol for post-deployment

The difference between Machinery Directive risk analysis and AI risk management

Your client knows EN ISO 10218 or ISO 13849 risk analysis. What they additionally expect is that you document the risks specific to the AI component. The two frameworks do not cover the same ground.

Machinery Directive analysis covers mechanical, electrical, and ergonomic risks. AI risk management covers silent model failures: erroneous detection without any alarm signal, model drift in production, unexpected out-of-distribution behavior. These are risks your client cannot see on the cell floor. They are asking you to have mapped them.

Dimension	Machinery Directive Risk Analysis	AI Risk Management (Article 9)
Scope	Mechanical, electrical, ergonomic risks	AI model failures, out-of-distribution behavior
Method	FMEA, ISO 13849	Failure scenario mapping, OOD testing
Compliance evidence	CE marking, technical file	EU AI Act technical documentation, monitoring logs
Responsibility	Integrator (Machinery Regulation 2023/1230)	Integrator as deployer (EU AI Act Article 26)

3. Per-Inference Traceability Logs (Article 12, EU AI Act)

Article 12 of Regulation EU 2024/1689 requires that every high-risk AI system generate event logs enabling the reconstruction of operational sequences. For a robotic cell: for every prediction driving a physical action in a safety-relevant context, a timestamped log must be retained.

An Article 12-compliant log contains at minimum:

The inference timestamp (millisecond precision)
The prediction identifier
The prediction confidence score
The triggered action or refusal to act
Operational context: product reference, cell identifier, operating mode

Why existing logs don't cover this

SCADA histories, PLC logs, machine supervision journals: none of these meet this requirement. They record machine states and sensor readings. They do not record the AI component's reliability metrics at the moment of the decision.

The difference matters when an incident occurs. If a robotic arm causes damage and your client needs to reconstruct the sequence in front of a supervisory authority or their insurer, the question will be: "At what confidence level did the AI model decide to act?" Without a per-inference reliability log, the answer is: "We don't know." With one, the answer is traceable.

Black-box-compatible systems like TrustalAI generate these logs at the reliability layer level, without modifying the underlying model or the industrial process. Each inference produces a structured record including the confidence score, ready for audit and Article 12 compliance. Latency: under 100ms in normal production, 20ms on edge deployment.

4. The CE Technical File and Machinery Regulation (EU 2023/1230)

Your client requires this document for every machine. But integrating an AI component changes what the file needs to cover. The Machinery Regulation 2023/1230 explicitly brings AI systems into the scope of conformity assessment.

A CE technical file for a robotic cell with AI includes:

Risk assessment integrating the AI component's behavior — not just the mechanics
Justification of the operational confidence threshold chosen: why this threshold, on what testing basis
Test evidence per EN ISO 10218-1/2 and ISO 13849
Evidence that the human oversight system is functional (Article 14, EU AI Act)
The CE declaration of conformity

What changes with AI in the file

Until recently, a CE file for a picking cell documented deterministic behavior: the arm does X when the sensor detects Y. With an AI component, that is no longer entirely the case. The documentation must justify how model uncertainty is controlled and how human oversight kicks in when confidence is insufficient.

This point is regularly underestimated at the design stage. Automotive and food processing clients with formal supplier qualification processes, PPAP, IATF 16949 will raise it. Better to document it before the acceptance review than to negotiate it after.

5. The Continuous Monitoring and Model Maintenance Plan

An AI model deployed in production is not static. Conditions change: new product references, new raw material suppliers, shifts in lighting conditions or throughput. Model drift always comes eventually. What your client wants to know: how you detect it before it causes an incident, and how you update the model without stopping production.

The continuous monitoring plan documents:

The drift detection protocol: which indicators, what frequency, who gets alerted
Alert thresholds and stop thresholds
The model update procedure: regression testing, validation before deployment, rollback capability
Post-acceptance responsibilities: you, the client, or a third party

The contractual question that will come up

"Who is responsible if the model drifts six months after delivery and causes a line stoppage?"

This is as much a contractual scope question as a technical one. Integrators who haven't documented this upfront end up in an ambiguous position. Those who provided a monitoring plan with explicit thresholds have an answer and a form of protection.

Post-deployment monitoring doesn't require continuous retraining. A plug-and-play reliability layer can continuously detect drift in input distributions without accessing training data or modifying the model. The alert signal arrives before performance visibly degrades on the production line.

Per-Prediction Reliability vs Post-Mortem Monitoring: The Distinction Your Client Will Make

A common source of confusion at acceptance reviews: aggregate monitoring tools, weekly performance dashboards, drift reports do not replace per-inference reliability logs. Your client knows this, or will find out quickly.

	Post-mortem / Aggregate Monitoring	Per-Prediction Reliability
When?	After execution, across time windows	Before the action, for each individual prediction
Granularity	Metrics across N inferences	Score per inference
Use at acceptance	Global qualification report	Article 12 logs, proof of real-time oversight
Use post-deployment	Drift detection over time	Immediate escalation on unreliable prediction
EU AI Act compliance	Insufficient alone for Article 12	Article 12 compliant

Both are necessary. What your client requires at acceptance is proof that both layers are in place not one or the other.

TrustalAI is here to support you

FAQ

What documents must an integrator provide at the acceptance of a robotic cell with AI?

Five documents are required: the AI model qualification report (performance on representative data, OOD testing, per-prediction reliability metrics), the AI risk management documentation (Article 9, EU AI Act), per-inference traceability logs (Article 12, EU AI Act), the CE technical file incorporating AI risk assessment (Machinery Regulation 2023/1230, EN ISO 10218, ISO 13849), and the continuous monitoring plan with drift detection protocol.

Does the Machinery Regulation 2023/1230 already apply to my current projects?

The Machinery Regulation 2023/1230 has been in force since 2023 with a transition period: new equipment placed on the market after July 2027 must comply. But projects integrating high-risk AI also fall under EU AI Act obligations, with the Annex III deadline set at August 2026. For cells currently in design, it is better to anticipate both frameworks now.

What does an Article 12 EU AI Act-compliant traceability log look like for a robotic cell?

An Article 12-compliant log records, for each inference of the AI component driving a physical action in a safety-relevant context: the timestamp (millisecond precision), the prediction confidence score, the triggered action or refusal to act, and the operational context. Standard SCADA or PLC logs do not meet this requirement, they record machine states, not the model's reliability metrics at the moment of the decision.

How do you document human oversight in the CE file for an AI-integrated cell?

Human oversight (Article 14, EU AI Act) must be documented technically: when the system stops to request operator input, at what confidence threshold, which interface alerts the operator, how they take over or validate the prediction. These mechanisms must be tested, and the test evidence must appear in the CE technical file. Writing "an operator can intervene" is not sufficient.

Can a black-box AI model generate the traceability logs required by the EU AI Act?

Yes, provided a black-box-compatible reliability layer is connected in parallel. This layer intercepts the model's predictions, computes per-inference reliability metrics, and generates structured logs without accessing the model weights or modifying it. This plug-and-play architecture enables commercial vision models, Cognex, Keyence, Basler or custom models to achieve Article 12 compliance without retraining.

Fadili, M., et al. Evaluation of an Uncertainty-Aware Late Fusion Algorithm for Multi-Source Bird's Eye View Detections Under Controlled Noise. IRCE, 2025. (hal-04910986)

Jul 8, 2026

Multi-target tracking in GPS-denied environments: reliability as an operational defense requirement

Jul 8, 2026

Multi-target tracking in GPS-denied environments: reliability as an operational defense requirement

Jul 8, 2026

Multi-target tracking in GPS-denied environments: reliability as an operational defense requirement

Jul 6, 2026

Trading AI: Why every prediction deserves its confidence interval

Jul 6, 2026

Trading AI: Why every prediction deserves its confidence interval

Jul 6, 2026

Trading AI: Why every prediction deserves its confidence interval

Jun 25, 2026

The confidence interval as a new decision metric in predictive AI

Jun 25, 2026

The confidence interval as a new decision metric in predictive AI

Jun 25, 2026

The confidence interval as a new decision metric in predictive AI

Make your AI reliable now

Request a demo

Acceptance of a robotic cell with AI: the 5 documents your client will ask you for

1. The AI Model Qualification Report

The limit of the standard report

2. AI Risk Management Documentation (Article 9, EU AI Act)

The difference between Machinery Directive risk analysis and AI risk management

3. Per-Inference Traceability Logs (Article 12, EU AI Act)

Why existing logs don't cover this

4. The CE Technical File and Machinery Regulation (EU 2023/1230)

What changes with AI in the file

5. The Continuous Monitoring and Model Maintenance Plan

The contractual question that will come up

Per-Prediction Reliability vs Post-Mortem Monitoring: The Distinction Your Client Will Make

FAQ

What documents must an integrator provide at the acceptance of a robotic cell with AI?

Does the Machinery Regulation 2023/1230 already apply to my current projects?

What does an Article 12 EU AI Act-compliant traceability log look like for a robotic cell?

How do you document human oversight in the CE file for an AI-integrated cell?

Can a black-box AI model generate the traceability logs required by the EU AI Act?

Related articles

Multi-target tracking in GPS-denied environments: reliability as an operational defense requirement

Multi-target tracking in GPS-denied environments: reliability as an operational defense requirement

Multi-target tracking in GPS-denied environments: reliability as an operational defense requirement

Trading AI: Why every prediction deserves its confidence interval

Trading AI: Why every prediction deserves its confidence interval

Trading AI: Why every prediction deserves its confidence interval

The confidence interval as a new decision metric in predictive AI

The confidence interval as a new decision metric in predictive AI

The confidence interval as a new decision metric in predictive AI