Scaling Monosemanticity in Claude 3 Sonnet Features
Researchers scale methods to extract monosemantic features from Claude 3 Sonnet. The work aims to improve understanding of internal model representations. Analysis covers feature sparsity and semanti…