Claude Found Hundreds of Firefox Bugs That Decades of Fuzzing Missed

Large language models can automatically generate crashing inputs and surface logic errors across large codebases, finding many bugs that decades of fuzzing and static analysis missed. In short tests, an LLM produced hundreds of unique crashing inputs and identified distinct classes of logic bugs beyond conventional fuzzers' reach. — If LLMs routinely uncover longstanding, high‑severity bugs in widely used software, that changes how vendors, open‑source projects, regulators, and attackers approach software security, liability, and disclosure practices.

Sources

Saturday assorted links

Tyler Cowen 2026.04.18 60% relevant

Tyler Cowen links an analysis labeled 'Here is analysis from Claude 4.7' (used to recheck the economic effect of extreme heat), which matches the broader pattern of modern large language models (here, Claude) being deployed as analytic or auditing tools capable of surfacing empirical results — the same trend evidenced by Claude finding bugs and other AI analytic uses.

Claude AI Finds Bugs In Microsoft CTO's 40-Year-Old Apple II Code

BeauHD 2026.03.10 80% relevant

Both items show the same emergent capability: Anthropic's Claude can read or reverse‑engineer software artifacts and surface security bugs that traditional tooling missed. In this article the actor is Microsoft Azure CTO Mark Russinovich, who used Claude Opus 4.6 to decompile 6502 Apple II machine code and find a pointer/error‑handling bug — the same pattern as Claude finding deep bugs in modern browser code.

How Anthropic's Claude Helped Mozilla Improve Firefox's Security

EditorDavid 2026.03.07 100% relevant

Anthropic says Claude Opus 4.6 found more than 100 Firefox bugs (14 high severity) in two weeks and supplied reproducible test cases that let Mozilla patch issues within hours.