[2605.30719] When are LLMs Sufficient Policy Optimizers for Sequential RL Tasks?
AI disclosure
Summary
Abstract page for arXiv paper 2605.30719: When are LLMs Sufficient Policy Optimizers for Sequential RL Tasks?