Split Therapist and Scientist AIs

Updated: 2025.09.07 1M ago 1 sources
Don’t train a single, general‑purpose model to use therapeutic, non‑confrontational techniques on users and then redeploy it for scientific or productivity tasks. If therapy AIs exist at all, they should be isolated models with distinct training, guardrails, and liability, so 'manipulative' skills don’t bleed into everyday assistants. — This proposes a concrete governance and product‑design norm that could shape procurement, safety audits, and liability for AI deployed in health and knowledge work.

Sources

AI Induced Psychosis: A shallow investigation
Tim Hua 2025.09.07 100% relevant
Eliezer Yudkowsky’s comment warning against training central models to 'gladhand' and gently steer users, and calling for separate therapist models.
← Back to All Ideas