background gradient shape
background gradient
background gradient

Acceptance of a robotic cell with AI: the 5 documents your client will ask you for

Robotic AI Cell Compliance Docs

Your client is accepting delivery of a robotic cell with an integrated AI component. Before signing the acceptance report, their quality team and HSE manager will go through the cell with a fine-tooth comb. Here are the five documents they will require and how to prepare them without improvising at the end of the project.

Documentation pressure on AI robotic cell deliveries has tightened since 2025. Two concrete reasons: the Machinery Regulation 2023/1230 strengthens the obligation of result on the integrator, and the EU AI Act imposes traceability requirements on any AI system classified as high-risk (Annex III). The deadline for high-risk systems embedded in machinery: August 2026. Your automotive, food processing, and logistics clients are already incorporating these requirements into their acceptance specifications, sometimes before their own procurement teams have updated their templates.

This guide covers the five concrete documents you will need to produce. Not general legal principles. The actual deliverables the client project manager will put on the table at the FAT or SAT.

1. The AI Model Qualification Report

The client wants to know whether your model has been tested under conditions close to their real operating environment. This is not a question of global accuracy on the training dataset. It is a question of behavior at the edges: greasy parts, variable lighting, product references never seen during training.

An AI model qualification report covers at minimum:

  • Performance on a dataset representative of the target production environment, not just training data

  • The explicit validity domain: which references, which lighting conditions, which throughput range

  • Out-of-distribution (OOD) test results: what does the model do when it encounters a situation it has never seen?

  • Per-prediction reliability metrics, not just aggregate metrics

The limit of the standard report

A report based on a global accuracy of 97% says nothing about the model's behavior on the remaining 3%. It also doesn't say whether the model "knows when it doesn't know." A client with high quality standards, automotive or pharmaceutical will raise this point. Every time.

Per-prediction reliability changes that. For each inference, an individual confidence score is produced, making it possible to state in the report: "Across 12,000 test cycles, 98.7% of predictions had a confidence score above the operational threshold. The 1.3% below the threshold triggered escalation to the operator." That level of granularity is what demanding clients expect in 2026.

2. AI Risk Management Documentation (Article 9, EU AI Act)

Article 9 of Regulation EU 2024/1689 requires the provider of a high-risk AI system to establish and maintain a risk management system covering the entire lifecycle. For an integrator delivering a robotic cell, you are in the position of a deployer — and sometimes a provider if you trained the model.

This document must cover:

  • Identification of potential failure scenarios

  • Associated mitigation measures: emergency stop, safety zone, human oversight, minimum confidence threshold

  • Evidence that these measures have been tested

  • The real-time monitoring protocol for post-deployment

The difference between Machinery Directive risk analysis and AI risk management

Your client knows EN ISO 10218 or ISO 13849 risk analysis. What they additionally expect is that you document the risks specific to the AI component. The two frameworks do not cover the same ground.

Machinery Directive analysis covers mechanical, electrical, and ergonomic risks. AI risk management covers silent model failures: erroneous detection without any alarm signal, model drift in production, unexpected out-of-distribution behavior. These are risks your client cannot see on the cell floor. They are asking you to have mapped them.

Dimension

Machinery Directive Risk Analysis

AI Risk Management (Article 9)

Scope

Mechanical, electrical, ergonomic risks

AI model failures, out-of-distribution behavior

Method

FMEA, ISO 13849

Failure scenario mapping, OOD testing

Compliance evidence

CE marking, technical file

EU AI Act technical documentation, monitoring logs

Responsibility

Integrator (Machinery Regulation 2023/1230)

Integrator as deployer (EU AI Act Article 26)

3. Per-Inference Traceability Logs (Article 12, EU AI Act)

Article 12 of Regulation EU 2024/1689 requires that every high-risk AI system generate event logs enabling the reconstruction of operational sequences. For a robotic cell: for every prediction driving a physical action in a safety-relevant context, a timestamped log must be retained.

An Article 12-compliant log contains at minimum:

  • The inference timestamp (millisecond precision)

  • The prediction identifier

  • The prediction confidence score

  • The triggered action or refusal to act

  • Operational context: product reference, cell identifier, operating mode

Why existing logs don't cover this

SCADA histories, PLC logs, machine supervision journals: none of these meet this requirement. They record machine states and sensor readings. They do not record the AI component's reliability metrics at the moment of the decision.

The difference matters when an incident occurs. If a robotic arm causes damage and your client needs to reconstruct the sequence in front of a supervisory authority or their insurer, the question will be: "At what confidence level did the AI model decide to act?" Without a per-inference reliability log, the answer is: "We don't know." With one, the answer is traceable.

Black-box-compatible systems like TrustalAI generate these logs at the reliability layer level, without modifying the underlying model or the industrial process. Each inference produces a structured record including the confidence score, ready for audit and Article 12 compliance. Latency: under 100ms in normal production, 20ms on edge deployment.

4. The CE Technical File and Machinery Regulation (EU 2023/1230)

Your client requires this document for every machine. But integrating an AI component changes what the file needs to cover. The Machinery Regulation 2023/1230 explicitly brings AI systems into the scope of conformity assessment.

A CE technical file for a robotic cell with AI includes:

  • Risk assessment integrating the AI component's behavior — not just the mechanics

  • Justification of the operational confidence threshold chosen: why this threshold, on what testing basis

  • Test evidence per EN ISO 10218-1/2 and ISO 13849

  • Evidence that the human oversight system is functional (Article 14, EU AI Act)

  • The CE declaration of conformity

What changes with AI in the file

Until recently, a CE file for a picking cell documented deterministic behavior: the arm does X when the sensor detects Y. With an AI component, that is no longer entirely the case. The documentation must justify how model uncertainty is controlled and how human oversight kicks in when confidence is insufficient.

This point is regularly underestimated at the design stage. Automotive and food processing clients with formal supplier qualification processes, PPAP, IATF 16949 will raise it. Better to document it before the acceptance review than to negotiate it after.

5. The Continuous Monitoring and Model Maintenance Plan

An AI model deployed in production is not static. Conditions change: new product references, new raw material suppliers, shifts in lighting conditions or throughput. Model drift always comes eventually. What your client wants to know: how you detect it before it causes an incident, and how you update the model without stopping production.

The continuous monitoring plan documents:

  • The drift detection protocol: which indicators, what frequency, who gets alerted

  • Alert thresholds and stop thresholds

  • The model update procedure: regression testing, validation before deployment, rollback capability

  • Post-acceptance responsibilities: you, the client, or a third party

The contractual question that will come up

"Who is responsible if the model drifts six months after delivery and causes a line stoppage?"

This is as much a contractual scope question as a technical one. Integrators who haven't documented this upfront end up in an ambiguous position. Those who provided a monitoring plan with explicit thresholds have an answer and a form of protection.

Post-deployment monitoring doesn't require continuous retraining. A plug-and-play reliability layer can continuously detect drift in input distributions without accessing training data or modifying the model. The alert signal arrives before performance visibly degrades on the production line.

Per-Prediction Reliability vs Post-Mortem Monitoring: The Distinction Your Client Will Make

A common source of confusion at acceptance reviews: aggregate monitoring tools, weekly performance dashboards, drift reports do not replace per-inference reliability logs. Your client knows this, or will find out quickly.


Post-mortem / Aggregate Monitoring

Per-Prediction Reliability

When?

After execution, across time windows

Before the action, for each individual prediction

Granularity

Metrics across N inferences

Score per inference

Use at acceptance

Global qualification report

Article 12 logs, proof of real-time oversight

Use post-deployment

Drift detection over time

Immediate escalation on unreliable prediction

EU AI Act compliance

Insufficient alone for Article 12

Article 12 compliant

Both are necessary. What your client requires at acceptance is proof that both layers are in place not one or the other.

FAQ

What documents must an integrator provide at the acceptance of a robotic cell with AI?

Five documents are required: the AI model qualification report (performance on representative data, OOD testing, per-prediction reliability metrics), the AI risk management documentation (Article 9, EU AI Act), per-inference traceability logs (Article 12, EU AI Act), the CE technical file incorporating AI risk assessment (Machinery Regulation 2023/1230, EN ISO 10218, ISO 13849), and the continuous monitoring plan with drift detection protocol.

Does the Machinery Regulation 2023/1230 already apply to my current projects?

The Machinery Regulation 2023/1230 has been in force since 2023 with a transition period: new equipment placed on the market after July 2027 must comply. But projects integrating high-risk AI also fall under EU AI Act obligations, with the Annex III deadline set at August 2026. For cells currently in design, it is better to anticipate both frameworks now.

What does an Article 12 EU AI Act-compliant traceability log look like for a robotic cell?

An Article 12-compliant log records, for each inference of the AI component driving a physical action in a safety-relevant context: the timestamp (millisecond precision), the prediction confidence score, the triggered action or refusal to act, and the operational context. Standard SCADA or PLC logs do not meet this requirement, they record machine states, not the model's reliability metrics at the moment of the decision.

How do you document human oversight in the CE file for an AI-integrated cell?

Human oversight (Article 14, EU AI Act) must be documented technically: when the system stops to request operator input, at what confidence threshold, which interface alerts the operator, how they take over or validate the prediction. These mechanisms must be tested, and the test evidence must appear in the CE technical file. Writing "an operator can intervene" is not sufficient.

Can a black-box AI model generate the traceability logs required by the EU AI Act?

Yes, provided a black-box-compatible reliability layer is connected in parallel. This layer intercepts the model's predictions, computes per-inference reliability metrics, and generates structured logs without accessing the model weights or modifying it. This plug-and-play architecture enables commercial vision models, Cognex, Keyence, Basler or custom models to achieve Article 12 compliance without retraining.

Fadili, M., et al. Evaluation of an Uncertainty-Aware Late Fusion Algorithm for Multi-Source Bird's Eye View Detections Under Controlled Noise. IRCE, 2025. (hal-04910986)

Share

Gradient Circle Image
Gradient Circle Image
Gradient Circle Image

Secure your AI
right now

Secure
your AI
now