Skip to main content
  1. Posts/

Project Glasswing: What Anthropic's Mythos Revealed to Cloudflare About Security

··308 words·2 mins·

🔐 When a Security-Focused LLM Scans 50+ Cloudflare Repositories
#

Cloudflare participated in Anthropic’s Project Glasswing, testing Mythos Preview — a frontier cybersecurity model — on more than 50 of their own repositories. The results are revealing. 🕵️

🚀 What Makes Mythos Preview Different?
#

Two capabilities stand out versus general-purpose models:

  1. Exploit chain construction — doesn’t just find isolated bugs, it chains them into working exploits, like a senior researcher would
  2. Proof generation — writes PoC code, compiles it, runs it, and if it fails, adjusts the hypothesis and retries

🏗️ The Harness: The Key to Success
#

Cloudflare didn’t just point a generic agent at the repository — they built an 8-stage harness:

StageFunction
ReconGenerates architecture document and attack surface
Hunt~50 parallel agents, each with a specific bug class
ValidateIndependent agent tries to disprove the finding
GapfillRe-queues areas with insufficient coverage
DedupeCollapses findings with the same root cause
TraceVerifies if the bug is reachable from external input
FeedbackFeeds new tasks back into the pipeline
ReportStructured output, not free-form prose

💡 Explanation in a nutshell
#

Project Glasswing is the first detailed public report of how a specialized security LLM (Anthropic’s Mythos Preview) works at scale on real production infrastructure. The most important insight isn’t that the model is smart — it’s that the architecture around the model determines success. A generic agent pointed at a repository produces noise. An 8-stage harness with 50 parallel agents, adversarial validation, and reach traceability turns speculative findings into actionable vulnerabilities. This is the future of both offensive and defensive security.

More information at the link 👇

Also published on LinkedIn.
Juan Pedro Bretti Mandarano
Author
Juan Pedro Bretti Mandarano