How does Reinforcement Learning Affect Models — LessWrong
Summary
I wanted to share some reflections I have been having recently about how reinforcement learning in post-training may be affecting language models. Th…
Description
I wanted to share some reflections I have been having recently about how reinforcement learning in post-training may be affecting language models. Th…
Original reporting
AFBytes is a read-only aggregator. Use the original source for full context and complete reporting.
Open original source