The Changing North Star of AI Control — LessWrong
Summary
On December 1st, 2025, the GDM mech interp team published a LessWrong article declaring a pivot to a pragmatic approach to interpretability. Much tim…
Description
On December 1st, 2025, the GDM mech interp team published a LessWrong article declaring a pivot to a pragmatic approach to interpretability. Much tim…
Original reporting
AFBytes is a read-only aggregator. Use the original source for full context and complete reporting.
Open original source