TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models Paper • 2512.02014 • Published Dec 1, 2025 • 70 • 5
TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models Paper • 2512.02014 • Published Dec 1, 2025 • 70 • 5
FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing Paper • 2310.05922 • Published Oct 9, 2023 • 4 • 1