[2606.02684] Filter, Then Reweight: Rethinking Optimization Granularity in On-Policy Distillation
AI disclosure
Summary
Abstract page for arXiv paper 2606.02684: Filter, Then Reweight: Rethinking Optimization Granularity in On-Policy Distillation