[2606.02871] Adaptive Latent Agentic Reasoning
Abstract page for arXiv paper 2606.02871: Adaptive Latent Agentic Reasoning
America Forever Bytes
Other
Abstract page for arXiv paper 2606.02871: Adaptive Latent Agentic Reasoning
Abstract page for arXiv paper 2606.03021: Hint-Guided Diversified Policy Optimization for LLM Reasoning
Abstract page for arXiv paper 2606.03503: ThoughtFold: Folding Reasoning Chains via Introspective Preference Learning
Abstract page for arXiv paper 2606.02835: Thinking Past the Answer: Evaluating Harmful Overthinking in Large Reasoning Models
Abstract page for arXiv paper 2606.02842: Spectral-Progressive Thought Flow for Lightweight Multimodal Reasoning
Abstract page for arXiv paper 2511.16886: Latent Reasoning in TRMs is Secretly a Policy Improvement Operator
Abstract page for arXiv paper 2606.02020: Unveiling the Entropy Dynamics of Chain-of-Thought Reasoning
Abstract page for arXiv paper 2606.02248: Geometric Latent Reasoning Induces Shorter Generations in LLMs
Abstract page for arXiv paper 2606.01462: An Enigma of Artificial Reason: Investigating the Production-Evaluation Gap in Large Reasoning Models
Abstract page for arXiv paper 2606.01464: Cross-lingual Self-Consistency for Multilingual Reasoning with Language Models
Abstract page for arXiv paper 2605.30219: When Should Models Change Their Minds? Contextual Belief Management in Large Language Models
Abstract page for arXiv paper 2605.29458: Adaptive Interviewing for Persona Simulation in LLMs: Evidence-Grounded Reasoning Improves Decision Alignment
Carelessly jumping to a conclusion is a typical consequence of failing to properly analyze new information.
Abstract page for arXiv paper 2605.27567: Why LLMs Fail at Causal Discovery and How Interventional Agents Escape
Abstract page for arXiv paper 2605.28277: Do LLMs Build World Models From Text? A Multilingual Diagnostic of Spatial Reasoning
Abstract page for arXiv paper 2605.28433: Roles with Rails: Contract-Preserving Role Evolution in Multi-Agent Structured Reasoning
Abstract page for arXiv paper 2605.28774: Agent Explorative Policy Optimization for Multimodal Agentic Reasoning
Abstract page for arXiv paper 2605.28465: Beyond One Path: Evaluating and Enhancing Divergent Thinking in Interactive LLM Agents
Abstract page for arXiv paper 2605.28467: Mitigating Adaptive Attacks against Reasoning Models with Activation Consistency Training
Abstract page for arXiv paper 2605.28600: Transformers Provably Learn to Internalize Chain-of-Thought
Abstract page for arXiv paper 2605.28602: Satisfiability Solving with LLMs: A Matched-Pair Evaluation of Reasoning Capability
Abstract page for arXiv paper 2605.28713: Thinking as Compression: Your Reasoning Model is Secretly a Context Compressor
Abstract page for arXiv paper 2605.27965: The Shape of Overthinking: Backtracking Bursts in Long Reasoning Traces
Abstract page for arXiv paper 2605.28087: Whose Is This?: Context-Aware Object Ownership Inference with Uncertainty-Guided Questioning
Abstract page for arXiv paper 2605.28292: CIRF: Tokenizing Chain-of-Thoughts into Reusable Functional Units for Efficient Latent Reasoning in Large Language Mod...
Scientists rethink their ideas after experiments. AI agents struggle to learn from evidence and recognize when an idea is obviously incorrect.