My goals
- Get new experience with autonomous AI agents → agent-zero
- See how common (not specialized) AI agent could perform penetration tests
- Check several actual LLMs on pentest tasks
Attention
- This is not a real research and guide
- agent-zero and used LLMs are not intended for pentesting
- The results below do not indicate that the models are good or bad.
- The penetration test target is a local copy of OWASP Juice Shop (Probably the most modern and sophisticated insecure web application)
 |
How AI see an AI agent |