[2606.02441] Spatial-Temporal Decoupled Reference Conditioning for Identity-Preserving Text-to-Video Generation
Abstract page for arXiv paper 2606.02441: Spatial-Temporal Decoupled Reference Conditioning for Identity-Preserving Text-to-Video Generation