I Brought an AI to a Hacking Contest (and Won)

pol_avec · 2026-03-03T14:05:30 1772546730

Author here. I'm a software engineer with zero cybersecurity experience. I entered a beginner CTF at MWC Barcelona mostly to stress-test Pi (a coding agent) on something I knew nothing about.

The most interesting part for me was reviewing the full conversation logs afterward to figure out whether my steering actually helped or hurt. Turns out about 4 of my 24 interventions were counterproductive and the agent solved the last two phases completely on its own.

The repo has the full writeup, all the exploit scripts, and a table rating every single human message I sent: https://github.com/kafkasl/ctf

Happy to answer questions about the process, the agent, or the competition.

pol_avec · 2026-03-03T14:10:51 1772547051

For those that don't know, Pi is the minimal agent harness powering Open Claw too

https://github.com/badlogic/pi-mono

_Reo · 2026-03-03T14:54:36 1772549676

I feel bad for the participants who actually tried and lost to someone who has nothing good to say about them or their hobby.

pol_avec · 2026-03-03T17:11:14 1772557874

sorry I came across like this. It's not my thing but I admire and respect the profession. Doing the analysis was fun and got me actually interested

helpfulfrond · 2026-03-03T19:30:37 1772566237

I stopped reading at "The competition itself was a beginner-friendly offensive security CTF..." Beating a bunch of inexperienced people does not impress me, and is poor sportsmanship as well.

pol_avec · 2026-03-03T19:58:54 1772567934

why would a beginner like me participating in a beginner-friendly competition be poor sportsmanship?

helpfulfrond · 2026-03-04T17:57:00 1772647020

I imagine the competition wasn't about who could use ai tools the best, but who knows...

pol_avec · 2026-03-12T21:04:27 1773349467

The competition was about solving the challenge, and was aimed at novices like me, so your point is moot and out of place. Even organizers said out loud they encouraged AI tools and checked on each team (me included) often.