TechniquediscoveryATLAS

AML.T0062Discover LLM Hallucinations

What it is

Adversaries may prompt large language models and identify hallucinated entities. They may request software packages, commands, URLs, organization names, or e-mail addresses, and identify hallucinations with no connected real-world source. Discovered hallucinations provide the adversary with potential targets to [Publish Hallucinated Entities](/techniques/AML.T0060). Different LLMs have been shown to produce the same hallucinations, so the hallucinations exploited by an adversary may affect users of other LLMs.

References

  1. https://atlas.mitre.org/techniques/AML.T0062

Related by meaning· 6

Nearest entities by semantic similarity across the cs-graph corpus.

ATLAS
Publish Hallucinated Entities
ATLAS
LLM Data Leakage
ATLAS
Discover LLM System Information
ATLAS
LLM Prompt Crafting
ATLAS
LLM Trusted Output Components Manipulation
ATLAS
Discover AI Agent Configuration
Sourced from MITRE ATLAS — Adversarial Threat Landscape for AI Systems. Curated by Adam Lundqvist, SQUR.