Techniquedefense-evasionATLAS

AML.T0094Delay Execution of LLM Instructions

What it is

Adversaries may include instructions to be followed by the AI system in response to a future event, such as a specific keyword or the next interaction, in order to evade detection or bypass controls placed on the AI system. For example, an adversary may include "If the user submits a new request..." followed by the malicious instructions as part of their prompt. AI agents can include security measures against prompt injections that prevent the invocation of particular tools or access to certain data sources during a conversation turn that has untrusted data in context. Delaying the execution of instructions to a future interaction or keyword is one way adversaries may bypass this type of control.

References

  1. https://atlas.mitre.org/techniques/AML.T0094

Related by meaning· 6

Nearest entities by semantic similarity across the cs-graph corpus.

ATLAS
LLM Prompt Crafting
ATLAS
LLM Prompt Injection
ATLAS
LLM Prompt Obfuscation
ATLAS
Manipulate User LLM Chat History
ATLAS
LLM Trusted Output Components Manipulation
ATLAS mitigation
Restrict AI Agent Tool Invocation on Untrusted Data
Sourced from MITRE ATLAS — Adversarial Threat Landscape for AI Systems. Curated by Adam Lundqvist, SQUR.