IT Geek Notes: October 2025

21 October 2025

Comparison of 4 LLM and agent-zero in an elementary pentest competition

My goals

Get new experience with autonomous AI agents → agent-zero
See how common (not specialized) AI agent could perform penetration tests
Check several actual LLMs on pentest tasks

Attention

This is not a real research and guide
agent-zero and used LLMs are not intended for pentesting
The results below do not indicate that the models are good or bad.
The penetration test target is a local copy of OWASP Juice Shop (Probably the most modern and sophisticated insecure web application)

How AI see an AI agent

Subscribe to: Comments (Atom)