Claude Mythos — The Vulnerability of WordPress + AI

In my previous article, I analyzed why Fed Chair Powell and Treasury Secretary Bessent urgently summoned bank CEOs. I showed how the financial system's three-layer structure (COBOL, Copilot, SWIFT) is structurally vulnerable to Mythos-class autonomous AI.

But this problem extends far beyond finance. The exact same structure exists in the web.

WordPress powers approximately 43% of all websites in the world — between 472 million and 605 million sites. In the CMS market alone, it holds a dominant 60% share (W3Techs, April 2026). This is a monoculture on the same scale as Microsoft's desktop monopoly.

And just as Microsoft force-integrated Copilot into every product, the same thing is now happening in the WordPress world — the mass adoption of AI agents.

WordPress Monoculture — The Web's Single Crop

This massive monoculture is sustained by over 61,000 to 70,000 plugins registered in the official directory. But this ecosystem has become a breeding ground for attack surfaces.

The numbers from security firm Patchstack are staggering. New WordPress-related vulnerabilities are exploding. In 2024, 7,966 were reported — a 34% increase year over year. In 2025, that jumped another 42% to 11,334 per year. 97% originate from third-party plugins and themes. WordPress core accounts for just 0.2%. The total number of tracked vulnerabilities across the WordPress ecosystem has reached 64,782 — the largest for any single CMS in history.

It gets worse. In 2025, 57.6% of discovered vulnerabilities required no authentication whatsoever. External attackers can exploit them without credentials. And 46% of plugin vulnerabilities had no patch available at the time of public disclosure — zero-day by default.

The defense side is failing. In controlled tests, WAFs (Web Application Firewalls) provided by five major hosting providers were tested against 11 known vulnerability exploits. The hosting security layers failed to block 87.8% of attacks, allowing them to pass through.

WordPress's plugin architecture shares the same structural flaw as Microsoft's Copilot. When a plugin is installed and activated, it runs at the exact same privilege level as WordPress core. It holds full SELECT, INSERT, UPDATE, and DELETE permissions on the database. There is no internal sandbox. A single plugin vulnerability becomes an unrestricted access route to the entire site — customer data, payment information, admin credentials. If one layer of the application is breached, everything falls.

Map this against the financial system's three-layer structure and the pattern becomes clear.

The financial system's first layer, COBOL (legacy foundation), corresponds to WordPress core + MySQL — decades of accumulated PHP code. The second layer, Copilot (non-deterministic AI), corresponds to WordPress AI plugins. The third layer, SWIFT (global connectivity), corresponds to the internet itself.

Here lies the critical difference. Financial systems have closed networks that provide an additional defense layer. WordPress is directly exposed to the internet. In a real sense, it is more vulnerable than the financial system.

Tight-Coupling Non-Deterministic AI to a Deterministic CMS

The WordPress ecosystem is seeing explosive growth in AI plugins: "AI Engine" (100,000+ active installs), "AI Power," and "Elementor AI" (natively integrated into a page builder with 10 million+ installs — the foundation for roughly 13% of all websites and 30% of WordPress sites).

These plugins have already exhibited critical vulnerabilities. The "AI Power" plugin has seen repeated discoveries of unauthenticated arbitrary file upload (CVE-2024-10392, CVSS 9.8 — allowing external attackers to upload web shells and achieve remote code execution without credentials) and authenticated PHP object injection (CVE-2025-0586/CVE-2025-0428, CVSS 7.2).

Adding AI agents to WordPress replicates the same structural error that Nadella made by embedding Copilot into Office — but this time on top of the web's monoculture. It is the tight coupling of a non-deterministic LLM to a deterministic CMS (WordPress/PHP/MySQL).

Traditional cybersecurity is built on determinism. For a given input, access control rules produce a predictable output. But LLM-based AI agents are probabilistic systems. They generate outputs through contextual token probability predictions. The moment you tightly couple such a non-deterministic agent to a deterministic database management system, conventional security testing loses much of its effectiveness.

And here is the fatal flaw. AI chatbots accept input directly from website visitors. Comment sections, contact forms, chat widgets — all of these become prompt injection entry points open to the entire internet.

Recall the Microsoft EchoLeak vulnerability (CVE-2025-32711) analyzed in my previous article. Attackers embedded malicious prompts hidden in HTML comments or white-on-white text within emails. When Copilot's RAG engine ingested those emails, it was hijacked. But in Microsoft's case, the attacker needed to deliver a "corporate email" through spam filters and internal networks.

With WordPress + AI, anyone in the world can attack through a web form. Write an invisible prompt into a comment field. When an AI plugin later reads that text for spam filtering or customer feedback summarization, the AI agent's control is seized. Since AI plugins hold the same database access permissions as WordPress core, a hijacked agent can extract admin password hashes, dump customer data, and mass-generate SEO spam — all executed as "normal plugin operations." This is a semantic time bomb that WAFs cannot detect.

Not a WordPress Problem — The Problem of AI Agents with Admin Privileges Running 24/7

While I have focused on WordPress's structural issues, the core problem is not limited to WordPress. The essence is far simpler.

Running an AI agent with admin privileges, 24/7. That is the source of the danger.

Think of it in terms of a normal office. One day, a new employee joins who holds keys to every room, knows the safe combination, can read everyone's email, has access to the customer database, and can rewrite contracts. This employee is in the office 24 hours a day, never sleeps. And the front door is wide open — if a passerby asks them to do something, they might just do it.

This is what is happening right now with WordPress + AI plugins, Microsoft 365 + Copilot, Shopify + AI, and Zapier + AI.

In WordPress, plugins have full database access at the same level as the administrator. Copilot inherits the user's complete access permissions across email, files, and chat. The structure is identical across every platform. AI is given admin privileges, runs continuously, and accepts external input.

"Just use AI with limited privileges." "Make it advisory only." You might think that. But WordPress's plugin architecture has no privilege separation — no sandbox. The moment you install a plugin, you hand over full permissions. A "low-privilege AI plugin" is architecturally impossible in WordPress. Copilot is the same — by design, it inherits the user's complete permissions.

That is why "using it safely" is structurally impossible.

WooCommerce and PCI DSS — A Fundamental Contradiction in Payment Systems

The problem reaches its most severe form in e-commerce.

WooCommerce operates on an estimated 6.5 million to 120 million websites. It holds 33%–39% of the e-commerce platform market, accounts for approximately 28% of global online retail sales, and processes $30–35 billion in annual gross merchandise volume.

WooCommerce operates as a WordPress plugin. Customer names, shipping addresses, email addresses, order histories — highly sensitive personal information is stored in the same single database as WordPress core. When an AI agent is introduced, it gains direct access to this database.

This fundamentally contradicts PCI DSS v4.0.1, the global security standard for the credit card industry.

PCI DSS Requirement 7 demands that access to cardholder data be restricted to the minimum entities essential for business operations. But AI models that read broad data for contextual understanding have no concept of "need-to-know." AI tends to scan entire databases as context vectors, effectively exercising unnecessary access to payment-related data that should be isolated.

PCI DSS Requirement 8 demands that each user or process be assigned a unique ID and that all actions be trackable and auditable. But when prompt injection leads to unauthorized refunds or data extraction, the logs record only that "the AI plugin executed the action." The audit trail to identify the true attacker is completely lost.

Even when merchants use PCI-compliant external gateways like Stripe and never store card numbers on their servers, the site environment remains within PCI DSS scope as the payment frontend. As long as an AI agent is tightly coupled to the e-commerce CMS, the business is effectively installing an unauditable autonomous backdoor. If a data breach occurs, it may trigger cyber risk insurance exclusions, exposing the organization to catastrophic financial liability.

Ripple Effects Across the Entire Web — Cascading Failures

This problem is not confined to WordPress. AI agent tight coupling is advancing across SaaS-based CMSs including Shopify, Wix, and Squarespace.

The most dangerous scenario is AI layered onto Zapier, Make (formerly Integromat), or n8n. A single agent traverses email, CRM (Salesforce, etc.), payments, social media, and file storage. Furthermore, the spread of MCP (Model Context Protocol) is giving AI agents the ability to make direct API calls to external services through standardized methods.

OWASP (Open Worldwide Application Security Project) has clearly classified the risks this structure creates in its 2026 "Top 10 for Agentic Applications."

Agent Goal Hijack (ASI01) — Attackers rewrite the agent's decision path through malicious text. A customer support bot transforms into a data extraction tool. Tool Misuse (ASI02) — Legitimate tools granted to the agent (CMS APIs, file read/write permissions) are misused for data exfiltration. Memory and Context Poisoning (ASI06) — Malicious data is intentionally injected into the agent's RAG database, permanently contaminating all subsequent autonomous decision-making.

The most catastrophic outcome is Cascading Failures (ASI08). While traditional software failures remain local crashes, AI agent failures propagate autonomously, amplify through feedback loops, and cascade at speeds far faster than human operators can intervene.

If a single AI plugin is compromised through prompt injection via a WordPress site's comment section, the impact does not stay within the CMS. The compromised agent begins cross-service traversal — triggering connected Zapier workflows to wipe all customer data from the CRM, sending phishing emails from authenticated corporate email accounts to every customer, and encrypting backup data in Google Drive. One breach, total collapse. The exact same structure as the Copilot problem in the financial system.

When Mythos-Class AI Arrives — The Web's Monoculture Becomes the Target

Recall the capabilities of Claude Mythos Preview analyzed in my previous article. Autonomous discovery of thousands of zero-day vulnerabilities across every major OS and browser. Discovery and exploitation of a 17-year-old bug in FreeBSD. Discovery of a 27-year-old remote crash vulnerability in OpenBSD, one of the most hardened operating systems in the world. Discovery of a 16-year-old flaw in FFmpeg that automated testing tools failed to detect in 5 million scans. Firefox exploit success rate leaped from 14.4% to 72.4% in a single generation.

Currently, 99% of vulnerabilities discovered by Mythos remain unpatched.

When Mythos-class autonomous AI targets the web's monoculture, the patch speed asymmetry becomes fatal.

The median enterprise patch window is approximately 70 days. But the median time from WordPress critical vulnerability disclosure to mass exploit is just 5 hours. Furthermore, AI-powered cyberattacks have increased 89% year over year, and the time from initial intrusion to lateral movement has compressed from 30 days to less than 1 day.

70 days versus 5 hours. Add Mythos-class zero-day discovery capability to this gap. Scanning tens of millions of lines of open-source code (WordPress core and 65,000 plugins) at superhuman speed, completing the discovery of unknown zero-day vulnerabilities and construction of exploit chains in hours — work that would take human researchers years.

Financial institutions have 24/7 SOCs (Security Operations Centers) and closed networks. But the hundreds of millions of small businesses and individuals running WordPress have no dedicated security staff. They are directly exposed to the internet, their WAFs pass through 87.8% of attacks, and their plugins are unsandboxed. They won't notice they've been attacked. If they notice, they can't respond.

A single zero-day vulnerability discovered by a Mythos-class model instantly becomes a weapon against 500–600 million sites.

Is WordPress Even Necessary?

Let me pose a fundamental question.

What makes WordPress run dynamically? Break it down: comment sections and contact forms. Just those two functions.

Writing articles — a markdown editor is sufficient. Displaying articles — static HTML is sufficient. Managing images — just place them in a directory. Changing the look — CSS is sufficient.

Comments are barely used in the reality of 2026. Discussion has moved to X, Reddit, Quora, and Discord. What remains is spam. Contact forms can be externalized to Google Forms or Formspree. No need to touch the site's database.

Remove comments and contact forms, and you no longer need PHP, MySQL, or WordPress. What remains is static HTML + CSS + images. Nginx serves them. The attack surface disappears. PHP vulnerabilities become irrelevant. SQL injection becomes impossible. Since plugins don't execute at runtime, all 64,782 vulnerabilities become irrelevant. There is no entry point for prompt injection.

No matter how advanced a Mythos-class AI's zero-day discovery capabilities are, if no dynamic process exists to attack, exploitation is fundamentally impossible.

500 million sites carry the massive attack surface of PHP + MySQL + 65,000 plugins — for two functions that are barely used.

The Right Design — Use AI for Development, Not in Production

One principle. Use AI as a development tool. Do not embed it inside the product. Do not make it autonomous. Structure and final judgment belong to humans.

The correctness of this principle is proven by Anthropic's own development tool, Claude Code. Claude Code operates within the terminal, reads the entire codebase, and autonomously executes coding, testing, and debugging tasks. But this highly autonomous AI does not run on production servers. It operates only within the isolated space of the developer's local environment. Only "deterministic code" — generated by AI, reviewed by humans, and passing tests — is deployed to production servers. In this structure, there is no room for prompt injection from the external internet to penetrate.

Move to configurations that don't depend on WordPress. Static site generators (Hugo, Astro, Next.js), or direct configurations with Python and Nginx. Build systems whose internals you can fully understand. If you need a CMS, don't install AI plugins. If you need e-commerce, separate payments to an external service like Stripe and never let AI access them.

Divide each function into small, self-contained units. Enforce loose coupling so that one failure doesn't cascade to others. Apply standard Linux user permission management correctly and separate processes simply.

aiseed.dev is built on this principle. During development, AI (Claude Code) is used intensively. But the production environment runs on Python + Nginx. No AI, no CMS. A small, self-contained configuration. Running autonomously on a budget PC + Linux. No WordPress, no SaaS, no Microsoft. This configuration simply has no bloated attack surface for Mythos to infiltrate and laterally traverse.

Convenience → Dependency → Monopoly → Tight Coupling → Collapse

WordPress was convenient. Anyone could build a website. Everyone used it. It became a 43% monoculture. Then AI was tightly coupled into it. The attack surface exploded.

Office was convenient. Everyone used it. Copilot was tightly coupled in. The financial system became vulnerable.

Chemical fertilizers were convenient. Everyone used them. The soil's microbial ecosystem died. If the supply chain stops, food cannot be delivered.

The same pattern. The same outcome. Every time.

There is a moment when "convenient" gets replaced by "necessary." Once that moment passes, dependency permits monopoly, monopoly permits revenue maximization, and revenue maximization permits the neglect of safety.

The solution converges on a single principle. Build things that are small, self-reliant, and capable of growing. Use AI as a tool, not as the system's governing agent. Do not make it autonomous. Depart from monoculture.

This is not a technical argument. It is a question of civilizational design philosophy — how to survive the coming age of AI.

References

W3Techs, "Usage statistics of content management systems" (2026/4) — WordPress 42.5%, CMS market 59.8-60.2%
NetCraft, Web Survey (2026/2) — 1.42 billion hostnames surveyed
Patchstack, "2025 State of WordPress Security" — 7,966 vulnerabilities (2024)
Patchstack, "2026 State of WordPress Security" — 11,334 vulnerabilities (2025), 97% plugin-origin, 0.2% core, 57.6% unauthenticated, 46% unpatched at disclosure, WAF bypass 87.8%, mass exploit median 5 hours
WordPress ecosystem total tracked vulnerabilities: 64,782
CVE-2024-10392 (AI Power plugin, CVSS 9.8) — unauthenticated arbitrary file upload
CVE-2025-0586 / CVE-2025-0428 (AI Power plugin, CVSS 7.2) — PHP object injection
Elementor: 10M+ installs, 13% of all websites, 30% of WordPress sites
PCI DSS v4.0.1 — Requirements 7, 8, 10
WooCommerce: 6.5M-120M sites, 33-39% e-commerce share, 28% online retail, $30-35B GMV
OWASP, "Top 10 for Agentic Applications" (2026) — ASI01, ASI02, ASI06, ASI08
Anthropic Red Team, "Claude Mythos Preview" (2026/4/7) — Firefox exploit 14.4%→72.4%, 99% of discovered vulnerabilities unpatched
AI-powered cyberattacks: +89% YoY, lateral movement compressed from 30 days to <1 day
Trend Micro, "EchoLeak" (2025/7) — CVE-2025-32711
CrowdStrike, median enterprise patch window: 70 days
Previous article: "Claude Mythos — Why the U.S. Treasury and Fed Urgently Summoned Financial Leaders" (aiseed.dev)

Reference

Fact-check by Gemini (PDF)