Causal-rCM: A Unified Teacher-Forcing and Self-Forcing Open Recipe for Autoregressive Diffusion Distillation in Streaming Video Generation and Interactive World Models Paper • 2606.25473 • Published 9 days ago • 24
Running on Zero Agents Featured 117 Stable Audio 3 🎵 117 Text-to-audio with SA3 Medium / Small Music / Small SFX.
google/diffusiongemma-26B-A4B-it Image-Text-to-Text • 26B • Updated about 3 hours ago • 1.42M • 1.09k
SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer Paper • 2605.15178 • Published May 14 • 91
CausalCine: Real-Time Autoregressive Generation for Multi-Shot Video Narratives Paper • 2605.12496 • Published May 12 • 30
TextLDM: Language Modeling with Continuous Latent Diffusion Paper • 2605.07748 • Published May 8 • 26
Running on Zero Agents 12 DialogueSidon Demo 🔥 12 Separate two speakers from an audio or video recording