[2605.31159] Trust-Region Behavior Blending for On-Policy Distillation
AI disclosure
Summary
Abstract page for arXiv paper 2605.31159: Trust-Region Behavior Blending for On-Policy Distillation