Comparing human polls and several large language models, Robin Hanson found weak correlations and inconsistent rankings when asking them to rate 16 candidate causes of cultural change across two historical periods. The disagreement suggests both that cultural causation is multi‑factorial and that current AI tools give unreliable, nonconvergent causal judgments on complex social history.
— If LLMs and quick polls disagree about why cultures change, relying on automated or shallow quantitative summaries to explain cultural shifts risks misleading policymakers, journalists, and educators.
Robin Hanson
2026.03.22
100% relevant
Hanson's experiment: he asked polls and three LLMs (ChatGPT, Claude, Gemini) to score 16 causes for 1700–1900 and 1900–2025, reported median LLM sums and poll priorities, and noted weak correlations and a problematic Grok response.
← Back to All Ideas