Blog

AI System Decommissioning: Sunset Governance Under EU AI Act Requirements

On-Premises AI · AI Architecture · MLOps · Best Practices · Advanced

How to properly retire, archive, and decommission AI systems while meeting EU AI Act documentation retention, notification, and governance obligations.

Server infrastructure with green indicator lights in a secure data center environment

Why AI System Decommissioning Needs a Governance Framework

Organizations invest heavily in designing, deploying, and governing AI systems. Far less attention goes to what happens when those systems reach end of life. A model that was once critical to a lending decision, a medical triage workflow, or a public administration process does not simply disappear when someone shuts down the inference endpoint. Its decisions may still be subject to legal challenge, regulatory inquiry, or audit for years after the system stops running.

Under the EU AI Act, providers and deployers of high-risk AI systems carry obligations that extend beyond active operation. Documentation must be retained, logs must remain accessible, and affected parties must be notified when systems they relied on are withdrawn. Treating decommissioning as an unplanned event — pulling a container from production and deleting the namespace — creates regulatory exposure that may not surface until an auditor or data subject raises a question months or years later.

A structured decommissioning framework turns system retirement into a governed, repeatable process. It protects the organization from compliance gaps while ensuring that institutional knowledge encoded in the AI system is preserved or properly transitioned.

EU AI Act Obligations That Survive System Shutdown

The EU AI Act establishes several obligations that do not end when a high-risk AI system ceases operation. Providers must retain technical documentation for a period defined by the regulation — generally ten years after the system is placed on the market or put into service, unless sector-specific legislation specifies a different period. This documentation includes the system's intended purpose, risk assessment, training data descriptions, testing results, and the instructions for use provided to deployers.

Deployers of high-risk AI systems must retain logs automatically generated by the system for a period appropriate to the intended purpose, and at least six months unless otherwise required by applicable law. When a deployer decides to discontinue a high-risk AI system, the provider should be informed if contractually required, and any downstream users or affected parties should receive adequate notice.

For AI systems that interact directly with natural persons — chatbots, recommendation engines, automated decision systems — transparency obligations mean that individuals who were subject to the system's decisions retain the right to understand how those decisions were made, even after the system is retired. The organization must be able to reconstruct enough context to answer such inquiries.

These requirements mean that decommissioning is not deletion. It is a transition from active operation to archival governance, where the system's artifacts, logs, and documentation remain accessible under controlled conditions.

The Decommissioning Assessment: What to Evaluate Before Shutting Down

Before decommissioning an AI system, conduct a structured assessment that covers regulatory, operational, and technical dimensions. The goal is to identify everything that must be preserved, transitioned, or explicitly closed out.

Regulatory inventory. Determine which regulations apply to the system and what retention periods they impose. Map the system to its EU AI Act risk classification. If the system is registered in the EU database for high-risk AI systems, plan for the registration to be updated to reflect the withdrawal. Check whether GDPR data retention obligations, sector-specific rules, or contractual commitments impose additional archival requirements.

Dependency mapping. Identify all systems, processes, and teams that depend on the AI system. This includes downstream applications consuming its outputs, dashboards or reports built on its predictions, business processes that assume its availability, and any human workflows that have been designed around its capabilities. Each dependency needs a transition plan: migration to a replacement system, manual fallback, or explicit acceptance that the capability is being withdrawn.

Data and artifact inventory. Catalog everything associated with the system: training datasets, evaluation datasets, model weights, configuration files, prompt templates, fine-tuning adapters, vector databases, embedding indexes, inference logs, audit trails, model cards, risk assessments, and approval records. For each artifact, determine the retention requirement, the storage format, and the access control policy that will govern it during the archival period.

Knowledge capture. Document the operational knowledge that exists only in the team's heads: why certain architectural choices were made, what failure modes were observed and how they were mitigated, what edge cases required special handling. This institutional knowledge is irreplaceable once the team moves on to other projects.

Archival Architecture: Preserving Evidence Without Maintaining Infrastructure

The challenge of AI system archival is that the artifacts are heterogeneous, large, and sometimes interdependent. Model weights in a proprietary format are useless without the inference framework that loads them. Logs stored in a time-series database require the database engine to query them. Vector embeddings are meaningless without the embedding model that generated them.

Design an archival architecture that prioritizes self-contained reproducibility. Export model artifacts in open formats where possible — ONNX for model weights, Parquet for structured data, plain text or JSON for logs and configuration. Bundle dependencies into container images or environment specifications that can be reconstituted if needed. Store everything in a long-term archival tier with integrity verification — cryptographic checksums, write-once storage, and periodic integrity audits.

For on-premises deployments, this typically means moving artifacts from production storage to a dedicated archival storage system. Consider air-gapped or restricted-access archival for systems that handled sensitive data. Ensure that encryption keys used to protect data at rest are themselves preserved in a key management system with a retention policy that matches the data retention period.

Inference logs deserve special attention. For high-risk AI systems, the ability to reconstruct a specific decision — what input was received, which model version processed it, what output was returned, and what post-processing was applied — may be required for regulatory inquiries or legal proceedings. Store decision reconstruction packages that include the log entry, the model version reference, and the configuration state at the time of inference.

Maintain an archival manifest: a structured document that describes what was archived, where it is stored, how to access it, what retention period applies, and who is responsible for it. This manifest becomes the primary interface for anyone who needs to interact with the decommissioned system's artifacts.

The Decommissioning Procedure: From Decision to Closure

A governed decommissioning follows a defined sequence of stages, each with clear entry criteria, actions, and sign-offs.

Stage 1: Decision and approval. The decision to decommission should be formally recorded and approved by the AI governance board or equivalent body. The approval should reference the decommissioning assessment and confirm that all regulatory, contractual, and operational implications have been reviewed. For high-risk AI systems, involve legal and compliance stakeholders in the approval.

Stage 2: Notification. Inform all identified stakeholders: downstream system owners, business process owners, affected users, and — where required — regulatory authorities or the provider. Define a notice period that gives stakeholders time to implement their transition plans. For systems that interact with natural persons, communicate the withdrawal clearly and provide information about any replacement or alternative.

Stage 3: Transition execution. Migrate dependent systems to their replacement, activate manual fallbacks, or formally accept capability withdrawal. Validate that transitions are working before proceeding. Run parallel operation if the risk profile warrants it — keeping the old system available in read-only mode while the replacement stabilizes.

Stage 4: Archival. Execute the archival plan: export artifacts, verify integrity, transfer to archival storage, validate accessibility, and update the archival manifest. Remove production data from active storage only after confirming successful archival.

Stage 5: Shutdown and cleanup. Decommission the production infrastructure: remove inference endpoints, revoke API keys, delete production secrets, release compute resources, and clean up monitoring and alerting configurations. Update service catalogs, architecture diagrams, and internal documentation to reflect that the system is no longer active.

Stage 6: Post-decommissioning review. Conduct a review after a defined period — typically 30 to 90 days — to confirm that no unexpected dependencies were missed, that archival storage is functioning correctly, and that no regulatory inquiries have surfaced that require attention. Document lessons learned for future decommissioning procedures.

Building Decommissioning Into the AI Lifecycle From Day One

The most effective decommissioning happens when it is planned from the beginning. When Sysart Consulting engages with organizations designing their AI governance frameworks, we recommend including sunset governance as a standard component of the AI system lifecycle. Every system entering production should have a documented expected lifespan, defined archival requirements, and a preliminary decommissioning plan.

For organizations using VDF AI as their on-premises AI platform, the built-in governance controls — model registry, audit trails, prompt versioning, and access management — create a natural foundation for decommissioning. When every model version, every prompt template, and every decision log is already captured in a structured system, the archival process becomes an export and transfer operation rather than a forensic recovery effort.

The AI systems an organization builds today will eventually need to be retired. Whether that happens in six months or six years, the regulatory obligations, the institutional knowledge, and the evidence of responsible operation must survive the shutdown. A decommissioning framework is not overhead — it is the final chapter of responsible AI governance.

Featured image by Tyler on Unsplash.