Who is accountable when an agent acts — the deployer, the developer, or the end-user?

Under Singapore's Model AI Governance Framework for Agentic AI, the organisation that deploys an agent — and the humans who oversee it — remain accountable for the agent's decisions and actions, even though those actions emerge dynamically rather than from fixed logic. The agentic value chain involves several other actors (model developers, tooling providers, platform providers, system providers / app developers, and end users), but accountability does not transfer to them by default. An organisation may play more than one role at once: a firm that builds and runs its own agents is both system provider and deployer. The framing is voluntary IMDA guidance, not a legal allocation of liability.

What is the agentic AI value chain and how is responsibility split?

The framework describes a simplified agentic AI value chain with these roles: model developers; tooling providers (for example MCP servers and APIs); platform providers; system providers / app developers; the deployer; and end users. Responsibility is split two ways. Across the value chain, the deploying organisation clarifies the distribution of obligations with each external party through terms, conditions, or contracts — covering security arrangements, performance guarantees, and data protection. Within the organisation, responsibilities are allocated across the agent lifecycle to key decision makers (strategy and permitted use cases), product teams (design, implementation, testing, monitoring), cybersecurity teams (security baselines, red-teaming, threat modelling), and users (responsible use, training, incident reporting).

When should an agent action require human approval?

The framework names four checkpoint categories where an agent action should pause for human approval. High-stakes actions and decisions: editing sensitive data, final decisions in healthcare or legal domains, or actions that may trigger liability. Irreversible actions: permanently deleting data, sending communications, or making payments. Outlier behaviour: an agent accessing a system outside its work scope, or selecting a route twice the median distance. And user-defined boundaries, where users set their own thresholds, such as approval for purchases above a chosen amount. Organisations should also design what form approval takes — a simple approve or reject for routine actions, an editable plan for complex cases, or a written justification for high-risk ones.

What is automation bias and how does the framework counter it?

Automation bias is the tendency to over-trust an automated system, especially one that has performed reliably in the past. It grows as humans supervise increasingly capable agents, compounded by alert fatigue and anthropomorphic design. The framework counters it with three measures. First, keep approval requests short, contextual, and risk-aware rather than burying reviewers in raw logs. Second, audit oversight effectiveness using measurable indicators (override rate, response time, outlier-reviewer detection). Third, train overseers to recognise common failure modes — noting that chain-of-thought reasoning is not a faithful explanation of an agent's actions. Oversight should also be complemented with automated real-time monitoring that escalates anomalous behaviour and denies actions by default.

How do you tell whether human oversight is still effective (anti-rubber-stamping)?

The framework recommends regularly auditing the effectiveness of human approvals rather than assuming that an approval step means oversight is meaningful. Two measurable indicators are suggested: the human override rate (how often humans reject or modify agent actions — a persistently low rate may signal rubber-stamping), and human response times during review (a persistently short time may signal automation bias or review fatigue). Data analytics can also identify outlier reviewers whose decision patterns deviate significantly from their peers. Each indicator needs an evidence-based threshold and a defined response when it fires, such as a spot audit, retraining, or a temporary reduction in agent autonomy.

How should agents be red-teamed and on what cadence?

The framework places regular red-teaming and threat modelling with cybersecurity teams and recommends testing agents for safety and security before deployment, adapting classical software and LLM testing to agentic behaviour. That means testing whole workflows including reasoning and tool calls, testing agents individually and together to surface emergent multi-agent behaviour, and testing in realistic environments at scale. Rather than a fixed annual default, a risk-tiered cadence is appropriate: higher-risk action-space and autonomy cells warrant more frequent testing, with additional runs on material change. Red-team reports (what attacks were tried and what succeeded) and threat-modelling artefacts (attack paths in the design, including taint tracing) are distinct, and both are maintained.

What must you check before relying on a third-party agent component?

Third-party agent components — externally provided agents, MCP servers and other tool servers, models, and tooling — can be hard to observe and control: it may be unclear what data they are grounded on, what they infer from conversations, or how data is stored and used. Before relying on one, the framework recommends requiring transparency through disclosures on capabilities and data-handling practices, and requesting and evaluating technical security and control features: scoped API keys, per-agent identity tokens, and observability through logging of tool calls and access history. Where such features are lacking, the organisation should consider alternative or in-house solutions, or scope the use case down — for example, restricting access to sensitive data.

Make Humans Meaningfully Accountable

Dimension 2 of Singapore's Model AI Governance Framework for Agentic AI (the MGF for Agentic AI) addresses a problem that agentic systems sharpen: keeping humans genuinely accountable for actions that an agent generates dynamically, rather than from fixed logic. It works on two fronts — allocating responsibility clearly within the organisation and across the agentic value chain, and designing human oversight that stays meaningful as agents become more capable. This page covers the value-chain and internal responsibility split, the four categories of approval checkpoint, the anti-rubber-stamping metrics that keep oversight honest, agent red-teaming and threat modelling, and third-party component assessment — and how each maps to the Modulos templates.

The MGF for Agentic AI is voluntary best-practice guidance published by IMDA. It is not a law, a regulation, or a mandatory standard. Read it as a structured set of recommendations for organisations deploying agentic AI, in the same spirit as the NIST AI Risk Management Framework.

Primary source

This page is a structured guide to Dimension 2 ("Make humans meaningfully accountable") of the IMDA Model AI Governance Framework for Agentic AI, v1.5, published 20 May 2026 (updated 5 June 2026), by the Infocomm Media Development Authority (IMDA), Singapore. The relevant text is §2.2.1 (clear allocation of responsibilities within and outside the organisation) and §2.2.2 (design for meaningful human oversight); red-teaming and workflow testing draw on §2.3.2, while agentic threat modelling and taint tracing draw on §2.1.1. The framework's only explicit IMDA cross-reference is to the earlier Model AI Governance Framework (2nd Edition, 2020); the company case studies and adapted resources are illustrative.

Three challenges to human accountability for agents

The framework opens Dimension 2 by naming why accountability is harder for agents than for conventional AI, and these three challenges shape everything that follows:

Emergent action. Agent actions emerge dynamically and adaptively from interactions, instead of from fixed logic, so it is harder to predict — and afterwards to reconstruct — why an agent did what it did.
Diffused accountability. Multiple stakeholders are involved in different parts of the agent lifecycle, which can diffuse accountability across the value chain unless ownership is made explicit.
Automation bias. As humans supervise increasingly capable agents, the tendency to over-trust a system that has performed reliably in the past becomes a larger risk — compounded by alert fatigue and anthropomorphic design.

The framework's response is twofold: clear allocation of responsibilities, paired with adaptive governance so the organisation can update its approach as the technology evolves; and measures that make human oversight meaningful — human approval at significant checkpoints, auditing of approval effectiveness, and automated monitoring as a complement.

Allocate responsibilities across the agentic value chain

As deployers, organisations and the humans who oversee agents remain accountable for the decisions and actions of those agents. But, as with conventional AI, the value chain involves multiple actors. The framework describes a simplified agentic AI value chain:

Role	Typical contribution
Model developers	Build the underlying models the agents reason with
Tooling providers (for example MCP servers, APIs)	Supply the tools and connectors agents call
Platform providers	Provide the runtime or orchestration platform
System providers / app developers	Compose models, tools, and instructions into an agentic application
Deployer	Puts the agentic system into operational use and remains accountable for its actions
End users	Use the agent's outputs to contribute to an organisational goal

An organisation may play several of these roles at once. A firm that develops its own agents and then runs them is both the system provider and the deployer. The deploying organisation's job is to establish chains of accountability across the value chain and the lifecycle, and to keep that allocation current as the technology changes.

Allocate responsibilities within the organisation

Within the organisation, responsibilities are allocated across the agent lifecycle. Every organisation is structured differently, but the framework offers an illustrative split across four archetypal teams:

Internal role	Illustrative accountabilities
Key decision makers (board, C-suite, department leaders)	Set high-level goals for agent use; define permitted operational use cases, including limits on agents' data access; set the overall governance approach, risk-management frameworks, and escalation processes
Product teams (product managers, designers, AI and software engineers)	Define agent design, requirements, feature controls, and phased rollouts; implement agents reliably across development, pre-deployment testing, and post-deployment monitoring; educate users on responsible use
Cybersecurity teams (CSO, security specialists, penetration testers)	Define baseline security guardrails and secure-by-design templates for technical teams to adapt; conduct regular red-teaming and threat modelling
Users (employees acting on agent outputs)	Use agents ethically and responsibly; attend required training; comply with usage policies; report bugs and issues in a timely manner

The framework also emphasises adaptive governance: all teams involved in agentic AI should build internal capability to understand the technology — new modalities such as computer-use agents, new evaluation frameworks — so the organisation can update its governance approach as developments arrive. (The PwC Singapore report-drafting example allocates accountability across a use-case owner, a technology risk team, the building team, and reviewing subject-matter specialists. This is illustrative of how the split can look in practice, not a framework requirement.)

Clarify obligations with external parties and address third-party opacity

When deploying agents, organisations often work with external parties — model developers, agentic AI providers, or hosts of external MCP servers and tools. To preserve their own accountability, the framework recommends two agent-specific measures:

Clarify the distribution of obligations in any terms, conditions, or contracts with the external party — in particular provisions on security arrangements, performance guarantees, and data protection. Where there are gaps, the organisation should reassess whether the agentic deployment still meets its risk tolerance.
Address third-party opacity. Externally provided agents can be hard to observe and control, making it unclear what they are grounded on, what they infer from conversations, or how data is stored and used. Organisations should require transparency and accountability from external parties (disclosures on capabilities and data handling) and request and evaluate technical security and control features — scoped API keys, per-agent identity tokens, and observability such as logging of tool calls and access history. Where these are lacking, consider alternative or in-house solutions, or scope the use case down (for example, restricting access to sensitive data).

End users sit at the far end of this chain. Organisations should give users enough information to hold the organisation accountable, and information about the user's own responsibilities — covered in detail under Enable end-user responsibility.

Design for meaningful human oversight

§2.2.2 sets out a three-step pattern for effective human supervision: define significant checkpoints or action boundaries that require human approval; train humans to evaluate those requests effectively and audit the approvals; and complement them with automated monitoring and predefined alert thresholds. The aim is oversight that stays meaningful at scale — not an approval button that humans press by reflex.

Approval checkpoints: high-stakes, irreversible, outlier, user-defined

Organisations should define significant checkpoints or action boundaries that require human approval, especially before sensitive actions execute. The framework names four categories:

High-stakes actions and decisions — for example editing sensitive data, final decisions in high-risk domains such as healthcare or legal, and actions that may trigger liability.
Irreversible actions — permanently deleting data, sending communications, or making payments.
Outlier or atypical behaviour — an agent accessing a system or database outside its work scope, or selecting a delivery route twice as long as the median distance.
User-defined boundaries — where agents act on behalf of users with different risk appetites, users may set their own thresholds, such as requiring approval for purchases above a chosen amount.

Beyond when approval is required, the framework asks organisations to design what form it takes. Keep approval requests short, contextual, and digestible while making the risk clear — surfacing the associated risk or a confidence score rather than long logs or raw data. Match the input form to the action: a simple approve or reject for straightforward actions such as a database read; an editable plan for complex cases such as reviewing an agent's plan before execution; and a written justification for high-risk actions before approving or rejecting.

Keep oversight effective: training, auditing, override metrics

Because humans remain susceptible to alert fatigue and automation bias, the framework recommends measures to ensure oversight stays effective over time:

Audit the effectiveness of human oversight by tracking measurable indicators. The human override rate — how often humans reject or modify agent actions — is a primary signal; a persistently low rate may indicate rubber-stamping. Human response times during review are a second signal; a persistently short time may indicate automation bias or review fatigue. Data analytics can also surface outlier reviewers whose decision patterns deviate significantly from the norm, which may indicate compromised oversight.
Train human overseers to identify common failure modes and agent limitations — inconsistent reasoning, reliance on outdated policies — and bear in mind that chain-of-thought reasoning, sometimes offered for explainability, is not analogous to human reasoning and may not faithfully explain the agent's actions.
Ensure overseers have the domain expertise to evaluate the actions they approve; a user "vibe coding" with an agent may lack the software-engineering expertise to review the generated code's robustness.
Complement human approval with automated real-time monitoring that escalates unexpected behaviour — alerts on logged events (such as attempted unauthorised access or repeated failed tool calls), anomaly detection on agent trajectories, agents monitoring other agents, and denying actions by default when approval infrastructure fails or no established approval policy covers a new action.

Red-team agents and maintain agentic threat-modelling artefacts

The framework places regular red-teaming and threat modelling with cybersecurity teams (§2.2.1), and §2.3.2 sets out how agentic testing differs from classical software and LLM testing, while threat modelling and taint tracing are introduced under §2.1.1. Two distinct artefacts result, and both are maintained:

Red-team reports record what attacks were attempted and what succeeded. Testing should cover whole agent workflows (including reasoning and tool calling, not just final output), test agents individually and together to surface emergent multi-agent behaviour, run in realistic environments that mirror production, and repeat at scale across varied datasets to catch low-probability, high-impact behaviour.
Threat-modelling artefacts record the attack paths in the design — how an attacker could compromise the system through the agent's planning loop, memory, tool descriptors, or inter-agent handoffs. A central technique is taint tracing: following how untrusted data can move through the system to a consequential action, recording the trust boundary at each hop. This is distinct from a red-team report.

Rather than a fixed annual cadence, the framework's risk emphasis favours a risk-tiered cadence: higher-risk action-space and autonomy cells warrant more frequent testing, with additional runs triggered on material change (new tools or protocols, new data sources, autonomy or topology changes, published attack techniques, or incidents).

Assess third-party agent components

Where an agentic system relies on third-party components — externally provided agents, MCP servers and other tool servers, models, and tooling — each should be assessed before use for transparency and control, and the decision to proceed, scope down, or substitute should be recorded. For each component the assessment covers its disclosed capabilities and limitations; its data-handling practices (what is collected, where it is stored, how it is used and retained); and the security and control features available (authentication strength, scoped credentials or per-agent identity, and observability of its actions). The acceptance bar is set per risk tier; where a component falls short, the required outcome is to scope down the use case, substitute the component, or bring the capability in-house — with the decision and its owner recorded.

How to operationalize Dimension 2 in Modulos

Modulos models the MGF for Agentic AI as two framework templates: MFF-17 (the application template, per build-and-operate obligations) and OFF-17 (the organisation template, tenant-wide governance). Dimension 2 lands across three requirements:

Requirement	Template	What it carries	Mapped controls
`ORF-389` — Allocate value-chain and internal responsibilities for agentic AI	OFF-17 (org)	Value-chain allocation, internal RACI across the four team archetypes, escalation processes, secure-by-design templates, and third-party contracts (security, performance, data protection)	`OCF-1`, `OCF-2`, `OCF-11`, `OCF-54`, `OCF-131`, `OCF-154`, `OCF-170`, `OCF-249`, `OCF-251`, `OCF-259`
`MRF-313` — Design and audit meaningful human oversight	MFF-17 (app)	Approval checkpoints by category, approval form per action class, anti-rubber-stamping metrics, automated monitoring, and deny-by-default behaviour	`MCF-131`, `MCF-309`, `MCF-332`, `MCF-379`, `MCF-394`, `MCF-396`, `MCF-404`, `MCF-518`, `MCF-526`, `MCF-548`
`MRF-319` — Red-team agents and assess third-party components	MFF-17 (app)	Risk-tiered red-teaming, agentic threat-modelling artefacts (taint tracing), and third-party component transparency assessment	`MCF-71`, `MCF-319`, `MCF-334`, `MCF-349`, `MCF-352`, `MCF-354`, `MCF-520`, `MCF-552`, `MCF-553`

A few points worth knowing when working with these requirements:

The org / app split is deliberate. ORF-389 holds the tenant-wide responsibility allocation and supplier contracts; MRF-313 and MRF-319 hold the per-application oversight and red-teaming work. Do not duplicate an obligation across both templates — each duty lives on one side.
The agentic-specific controls are the operative additions. On MRF-313, MCF-548 (oversight-effectiveness metrics — override rate, response time, outlier-reviewer detection) and MCF-526 (trust-aware approval UX, including side-effect-free preview to prevent consent laundering) are the agentic adds; the rest are baseline controls shared with other frameworks. On MRF-319, MCF-552 (agentic threat modelling) and MCF-553 (third-party agent component transparency) are the agentic adds. MCF-552's taint-tracing artefact is distinct from the red-team reports captured under MCF-334/MCF-349.
Requirements are evidenced through readiness plus owner-attested fulfilment, not reviews. When all linked controls reach a final state, the requirement becomes ready for review as a signal to its owner; the owner then attests fulfilment, with rationale captured in the requirement's comments and logs. Reviews in Modulos govern control status changes (and other reviewable objects), not requirements themselves. Where a metric threshold fires or a third-party gap is found, the resulting fixes run through remediation loops.
Risk scoping is not tag-driven. All thirteen MFF-17/OFF-17 requirements carry empty requirement tags. The action-space × autonomy classification and the impact × likelihood risk-cell rubric — encoded in MCF-545, MCF-546, and MCF-547 — set the risk tier that, in turn, drives the depth of oversight and the red-teaming cadence under Dimension 2.

For the full operating model across both templates, see Operationalizing the MGF for Agentic AI in Modulos.

Cross-framework mapping (preview)

Preview

Dimension 2 sits adjacent to several frameworks many organisations adopt alongside the MGF for Agentic AI, at a high level only:

OWASP Top 10 for Agentic Applications — the meaningful-oversight and approval-UX work (trust-aware confirmation, side-effect-free preview) is adjacent to the OWASP agentic threat classes around unsafe approvals and consent laundering.
ISO/IEC 42001:2023 — responsibility allocation, roles, and supplier governance correspond to the management-system clauses on leadership, roles, and operational control.
NIST AI RMF 1.0 — the accountability-structure and third-party themes map loosely onto the Govern function's accountability and supply-chain categories.

These are framework-level adjacencies. Detailed control-by-control mappings are out of scope here, and this page asserts no EU AI Act Article cross-references. Cross-framework reuse is implicit at the control layer, through the framework-agnostic controls shared with other templates.

MGF for Agentic AI overview

The four dimensions, agentic components, and risk taxonomy in one place

Assess and bound risks (Dimension 1)

Action-space × autonomy classification, the impact × likelihood rubric, and bounding agent authority

Enable end-user responsibility (Dimension 4)

Agent identity disclosure, capability and escalation transparency, and user-side duties

Operationalizing in Modulos

Turn the four dimensions into MFF-17 / OFF-17 requirements, controls, and evidence

Source attribution

The authoritative source is the IMDA Model AI Governance Framework for Agentic AI, v1.5, published 20 May 2026 (updated 5 June 2026), by the Infocomm Media Development Authority (IMDA), Singapore. This page draws on Dimension 2, §2.2.1 (clear allocation of responsibilities within and outside the organisation) and §2.2.2 (design for meaningful human oversight); red-teaming and workflow-testing material comes from §2.3.2, and the threat-modelling and taint-tracing material from §2.1.1. The framework's only explicit IMDA cross-reference is to the Model AI Governance Framework (2nd Edition, 2020). Company case studies (including PwC Singapore, Tencent, and X0PA) and adapted third-party resources referenced by the framework are illustrative and are not framework requirements.

Disclaimer

This page summarises and paraphrases publicly available IMDA guidance for orientation and operational use. The MGF for Agentic AI is voluntary best-practice guidance, not a law or regulation; nothing here is a legal characterisation of any obligation. The authoritative source is the IMDA Model AI Governance Framework for Agentic AI (v1.5). This page does not constitute legal advice.

Comparison

EU AI Act

Commission guidance

ISO Standards

ISO/IEC 42001

ISO/IEC 27001

ISO/IEC 27701

NIST AI RMF

GDPR (EU)

NIS2 (EU)

DORA (EU)

OWASP for AI Security

UAE AI Ethics

UAE Consumer AI

MAS FEAT

FINMA AI Governance

Singapore MGF for Agentic AI

Microsoft Supplier DPR

Make Humans Meaningfully Accountable

Three challenges to human accountability for agents

Allocate responsibilities across the agentic value chain

Allocate responsibilities within the organisation

Clarify obligations with external parties and address third-party opacity

Design for meaningful human oversight

Approval checkpoints: high-stakes, irreversible, outlier, user-defined

Keep oversight effective: training, auditing, override metrics

Red-team agents and maintain agentic threat-modelling artefacts

Assess third-party agent components

How to operationalize Dimension 2 in Modulos

Cross-framework mapping (preview)

Source attribution

Commission guidance

ISO/IEC 42001

ISO/IEC 27001

ISO/IEC 27701

Make Humans Meaningfully Accountable ​

Three challenges to human accountability for agents ​

Allocate responsibilities across the agentic value chain ​

Allocate responsibilities within the organisation ​

Clarify obligations with external parties and address third-party opacity ​

Design for meaningful human oversight ​

Approval checkpoints: high-stakes, irreversible, outlier, user-defined ​

Keep oversight effective: training, auditing, override metrics ​

Red-team agents and maintain agentic threat-modelling artefacts ​

Assess third-party agent components ​

How to operationalize Dimension 2 in Modulos ​

Cross-framework mapping (preview) ​

Related pages ​

Source attribution ​

Make Humans Meaningfully Accountable

Three challenges to human accountability for agents

Allocate responsibilities across the agentic value chain

Allocate responsibilities within the organisation

Clarify obligations with external parties and address third-party opacity

Design for meaningful human oversight

Approval checkpoints: high-stakes, irreversible, outlier, user-defined

Keep oversight effective: training, auditing, override metrics

Red-team agents and maintain agentic threat-modelling artefacts

Assess third-party agent components

How to operationalize Dimension 2 in Modulos

Cross-framework mapping (preview)

Related pages

Source attribution