prompt-injection-detector 作者: Vishesh Agarwal
Detects hidden prompt injection instructions that might manipulate AI models like Copilot and Claude.
1 个用户1 个用户
扩展元数据
屏幕截图
关于此扩展
AI assistants like GitHub Copilot, ChatGPT, and others read web page content when you ask them to help. Attackers can hide malicious instructions in that content — invisible to you, but visible to the AI — to hijack its behaviour, steal your data, or bypass safety filters.
PromptGuard detects:
- Hidden elements (
- HTML comments — invisible to humans but read by AI tools ingesting page source
- LLM-specific formats:
Three sensitivity levels:
- 🟢 Normal — high-confidence imperative overrides only (low false positives)
- 🟠 High — adds jailbreak, DAN, developer-mode, bypass patterns
- 🔴 Ultra — adds roleplay, persona, exfiltration, and LLM prompt-format patterns
Click any finding to flash and scroll to the exact element on the page.
All scanning runs locally in your browser. Nothing is sent anywhere.
PromptGuard detects:
- Hidden elements (
display:none, visibility:hidden, zero opacity, sub-pixel fonts, same-colour text)- HTML comments — invisible to humans but read by AI tools ingesting page source
- LLM-specific formats:
[INST], system:, assistant: prompt injection patternsThree sensitivity levels:
- 🟢 Normal — high-confidence imperative overrides only (low false positives)
- 🟠 High — adds jailbreak, DAN, developer-mode, bypass patterns
- 🔴 Ultra — adds roleplay, persona, exfiltration, and LLM prompt-format patterns
Click any finding to flash and scroll to the exact element on the page.
All scanning runs locally in your browser. Nothing is sent anywhere.
评分 0(1 位用户)
权限与数据
更多信息
- 附加组件链接
- 版本
- 1.0.0
- 大小
- 20.48 KB
- 上次更新
- 2 个月前 (2026年4月4日)
- 相关分类
- 许可证
- MIT 许可证
- 隐私政策
- 阅读此附加组件的隐私政策
- 版本历史
- 添加到收藏集