STA-V2A: Video-to-Audio Generation with Semantic and Temporal Alignment Paper • 2409.08601 • Published Sep 13, 2024 • 1