Techniquedefense-evasionATLAS

AML.T0067LLM Trusted Output Components Manipulation

What it is

Adversaries may utilize prompts to a large language model (LLM) which manipulate various components of its response in order to make it appear trustworthy to the user. This helps the adversary continue to operate in the victim's environment and evade detection by the users it interacts with. The LLM may be instructed to tailor its language to appear more trustworthy to the user or attempt to manipulate the user to take certain actions. Other response components that could be manipulated include links, recommended follow-up actions, retrieved document metadata, and [Citations](/techniques/AML.T0067.000).

References

  1. https://atlas.mitre.org/techniques/AML.T0067

Related by meaning· 6

Nearest entities by semantic similarity across the cs-graph corpus.

ATLAS
LLM Prompt Obfuscation
ATLAS
LLM Prompt Crafting
ATLAS
Manipulate User LLM Chat History
ATLAS
LLM Response Rendering
ATLAS
LLM Data Leakage
ATLAS
LLM Prompt Injection
Sourced from MITRE ATLAS — Adversarial Threat Landscape for AI Systems. Curated by Adam Lundqvist, SQUR.