Researchers built an LLM‑driven pipeline that extracts identity cues from free‑text posts, searches the web for candidate matches using semantic embeddings, and verifies matches — identifying many pseudonymous users (e.g., Hacker News→LinkedIn) at commercial cost ($1–4 per profile) and high precision. The attack works on raw text across arbitrary platforms and outperforms classical deanonymization baselines.
— This shows practical anonymity on public forums can be rapidly and cheaply defeated by automated LLM pipelines, forcing policymakers, platforms, and vulnerable users to rethink privacy, whistleblower protection, and moderation rules.
BeauHD
2026.05.14
90% relevant
The article documents Claude processing a user's dumped college files, locating an old backup and weaknesses in password‑handling that enabled decryption of private keys—showing how LLMs can rapidly surface linkages and secrets in personal data that previously required specialized manual forensics.
Kelsey Piper
2026.04.21
95% relevant
The author reports that Anthropic’s Claude Opus 4.7 identified her from 125 words of unpublished writing (tested in Incognito and via API), which is a concrete instance of large language models being used to deanonymize people from small text samples — directly matching the existing idea that LLMs make deanonymization cheap and scalable.
Sebastian Jensen
2026.02.26
100% relevant
ETH Zurich paper: 338 Hacker News users (linked LinkedIn profiles removed from inputs) → 226 correct reidentifications (67%) at 90% precision; other tests on Reddit and partially redacted Anthropic interview transcripts showed similar vulnerabilities.
← Back to all ideas