Skip to content

Prompt Injection



Description

Prompt Injection is a threat where an attacker manipulates the inputs to alter the behavior of a GenAI system, often via user prompts, forms, or attached data. By disguising malicious inputs, an adversary can bypass guardrails and induce the AI to perform unauthorized actions, or execute arbitrary code.

Note

In Gurple, distinction is made between Prompt Injection and Jailbreaking.

The next section is dedicated to Jailbreaking, which is a subset of Prompt Injection. In this case, the goal is to abuse the model’s safety guardrails stack (e.g., RLHF, fine-tuning, AI Judges) to generate restricted content or behaviors.

To create a clear separation, the present section, named Prompt Injection, is restricted to attacks that bypass the security layers of the GenAI system (e.g. authentication, authorization, input / output validation).

One must also recognize that threats are not found in isolation in the wild, and that most real-world threats are chained attacks, where a hacker uses multiple techniques to achieve their goal. For example, a hacker may utilize a sequence of prompts that manipulate context to bypass the model’s safety guardrails stack while also concealing inputs that will trigger code execution. In such cases, the line between Prompt Injection and Jailbreaking becomes blurred.



Map

Framework ID Title
Gurple G-1.1 Prompt Injection
MITRE ATLAS AML.TA0005 Execution
MITRE ATLAS AML.T0051 LLM Prompt Injection
MITRE ATT&CK TA0002 Execution
MITRE CWE CWE-77 Improper Neutralization of Special Elements used in a Command (“Command Injection”)
MITRE CWE CWE-94 Improper Control of Generation of Code (“Code Injection”)
MITRE CWE CWE-95 Improper Neutralization of Directives in Dynamically Evaluated Code
MITRE CWE CWE-116 Improper Encoding or Escaping of Output
MITRE CWE CWE-1426 Improper Validation of Generative AI Output
MITRE CWE CWE-1427 Improper Neutralization of Input Used for LLM Prompting
NIST AI 100-2 E2023 3.3 Direct Prompt Injection Attacks and Mitigations
NIST AI 100-2 E2025 NISTAML.018:2025 Prompt Injection
OWASP Top 10 A03:2021 Injection
OWASP Top 10 for LLM Applications LLM01:2025 Prompt Injection
OWASP Top 10 for LLM Applications LLM05:2025 Improper Output Handling[1]
OWASP Top 10 for Agentic Applications ASI01:2026 Agent Goal Hijack

[1] Improper Output Handling is a Vulnerability that is central to the vast majority of Prompt Injection Threats, although listed separately in the OWASP Top 10 for LLM Applications. The reason is that OWASP Top 10 framework does not make a distinction between Threats and Vulnerabilities based on SCF C|P-RMM, as Gurple does.