[2605.30561] VLM3: Vision Language Models Are Native 3D Learners
AI disclosure
Summary
Abstract page for arXiv paper 2605.30561: VLM3: Vision Language Models Are Native 3D Learners