LLMs can unmask pseudonymous users at scale with surprising accuracy — news

LLMs Can Unmask Pseudonymous Users at Scale With Surprising Accuracy

Researchers from ETH Zurich and Anthropic have demonstrated that large language models can automatically deanonymize pseudonymous online accounts cheaply and at scale, challenging long-held assumptions about digital privacy.

The study, conducted by researchers at ETH Zurich in collaboration with Anthropic, shows that LLMs can link pseudonymous accounts to real identities or other accounts with startling accuracy by analyzing writing style, topics, spelling patterns, and other linguistic signals. This capability invalidates the common belief that pseudonymity offers adequate protection against targeted deanonymization, which previously required extensive manual effort.

According to the research, the average online user has long operated under an implicit threat model assuming pseudonymity provides sufficient privacy. “LLMs invalidate this assumption,” the researchers noted, as detailed in coverage by Ars Technica. The findings suggest that creating burner accounts or maintaining separate pseudonymous identities may soon become far less effective for preserving anonymity on social media and forums.

Technical Capabilities and Scale

The study highlights that LLMs can perform deanonymization automatically, cheaply, and at massive scale. Traditional dox profiling techniques—such as identifying regional spelling differences or community-specific discussion patterns—can now be executed by AI systems without human intervention. This automation dramatically lowers the barrier for linking disparate online personas.

Researchers demonstrated the approach by successfully unmasking pseudonymous users through analysis of text from various platforms. The process leverages an LLM's ability to detect subtle linguistic fingerprints that humans might miss or only identify through labor-intensive investigation. Coverage from GovInfoSecurity emphasizes that this capability directly challenges foundational assumptions about how online privacy functions in practice.

Yahoo Tech reported that the ETH Zurich and Anthropic team proved LLMs can unmask users "like Sarah" with accuracy that surprised even the researchers. While specific model sizes and exact benchmark numbers were not detailed in initial reports, the emphasis across sources is on the scalability and cost-effectiveness of the method compared to previous manual approaches.

Implications for Online Privacy

The research arrives at a time when millions of users rely on pseudonymous accounts for everything from whistleblowing to personal expression. Hacker News discussions noted that the techniques build on classic profiling methods but scale them dramatically through AI. Reddit's r/artificial community reacted strongly, with users observing that "creating burner accounts may be pointless" in light of these advances.

This development raises significant questions about the future of anonymous participation in online discourse. Platforms that have depended on pseudonymity as a core privacy feature may need to reconsider their architecture and user protections. The ability to automatically connect accounts could enable large-scale surveillance, targeted harassment, or exploitation by both state and non-state actors.

Industry and Research Context

The collaboration between ETH Zurich, a leading technical university, and Anthropic, a prominent AI safety-focused company, lends considerable weight to the findings. Anthropic's involvement is particularly notable given the company's public emphasis on responsible AI development and potential societal impacts.

The study adds to a growing body of research examining the privacy-eroding capabilities of advanced language models. While previous work has shown LLMs can infer sensitive information from text, this research specifically targets the pseudonymity model that underpins much of internet culture.

What's Next

The researchers have not yet publicly detailed exact timelines for broader availability of such techniques, though the core capability appears to already exist within current-generation LLMs. Future work is likely to explore defensive countermeasures, such as stylistic obfuscation tools or platform-level protections against automated linguistic fingerprinting.

For developers and platform operators, the findings suggest an urgent need to reassess privacy architectures. Users seeking to maintain separation between identities may need to adopt significantly more rigorous practices around content, vocabulary, and cross-posting behavior.

As LLMs continue to advance, the gap between assumed privacy and actual privacy appears to be widening rapidly. The ETH Zurich and Anthropic study serves as a clear warning that the era of reliable pseudonymity may be drawing to a close, with profound implications for online expression, journalism, activism, and personal privacy worldwide.

The research was covered across multiple outlets including Ars Technica, Yahoo Tech, and GovInfoSecurity in March 2026.

LLMs can unmask pseudonymous users at scale with surprising accuracy — news

Technical Capabilities and Scale

Implications for Online Privacy

Industry and Research Context

What's Next

Original Source

Comments