OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory Paper β’ 2512.07802 β’ Published 5 days ago β’ 42
Mixture of States: Routing Token-Level Dynamics for Multimodal Generation Paper β’ 2511.12207 β’ Published 28 days ago β’ 8
TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models Paper β’ 2512.02014 β’ Published 12 days ago β’ 64
Mixture of States: Routing Token-Level Dynamics for Multimodal Generation Paper β’ 2511.12207 β’ Published 28 days ago β’ 8
Scaling Zero-Shot Reference-to-Video Generation Paper β’ 2512.06905 β’ Published 6 days ago β’ 28 β’ 4
TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models Paper β’ 2512.02014 β’ Published 12 days ago β’ 64
Running on Zero MCP Featured 1.59k Qwen Image Edit Camera Control π¬ 1.59k Fast 4 step inference with Qwen Image Edit 2509
One Small Step in Latent, One Giant Leap for Pixels: Fast Latent Upscale Adapter for Your Diffusion Models Paper β’ 2511.10629 β’ Published 30 days ago β’ 122
Running on Zero MCP Featured 2.41k Wan2.2 14B Fast π₯ 2.41k generate a video from an image with a text prompt