[2605.30381] When LLMs Learn to Be Consistently Wrong: A Multi-Model Study of Linear Representations of Synthetic Deception

Read full story on arxiv.org
Share
[2605.30381] When LLMs Learn to Be Consistently Wrong: A Multi-Model Study of Linear Representations of Synthetic Deception
AI disclosure

Summary

Abstract page for arXiv paper 2605.30381: When LLMs Learn to Be Consistently Wrong: A Multi-Model Study of Linear Representations of Synthetic Deception

Original reporting

Open original source

Related coverage

Read full article on arxiv.org