JIANYI WANG's picture

In a Training Loop 🔄

JIANYI WANG

Iceclear

·

https://iceclear.github.io

AI & ML interests

Low-level vision

Recent Activity

upvoted a paper about 16 hours ago

TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times

upvoted a paper 1 day ago

Region-Constraint In-Context Generation for Instructional Video Editing

upvoted a paper 7 days ago

Qwen-Image-Layered: Towards Inherent Editability via Layer Decomposition

View all activity

Organizations

upvoted a paper about 16 hours ago

TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times

Paper • 2512.16093 • Published 8 days ago • 49

upvoted a paper 1 day ago

Region-Constraint In-Context Generation for Instructional Video Editing

Paper • 2512.17650 • Published 6 days ago • 46

upvoted a paper 7 days ago

Qwen-Image-Layered: Towards Inherent Editability via Layer Decomposition

Paper • 2512.15603 • Published 8 days ago • 55

upvoted a collection 7 days ago

Pixio

5 items • Updated 7 days ago • 11

upvoted 3 papers about 1 month ago

Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation

Paper • 2511.14993 • Published Nov 19 • 226

One Small Step in Latent, One Giant Leap for Pixels: Fast Latent Upscale Adapter for Your Diffusion Models

Paper • 2511.10629 • Published Nov 13 • 122

PAN: A World Model for General, Interactable, and Long-Horizon World Simulation

Paper • 2511.09057 • Published Nov 12 • 76

upvoted a collection 4 months ago

DeepSeek-V3.1

4 items • Updated 29 days ago • 256

upvoted a paper 4 months ago

NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale

Paper • 2508.10711 • Published Aug 14 • 145

upvoted a paper 5 months ago

MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via Context-Aware Multi-Stage Policy Optimization

Paper • 2507.14683 • Published Jul 19 • 134

upvoted a collection 5 months ago

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 11 items • Updated Jul 21 • 550

upvoted 2 papers 6 months ago

Scaling RL to Long Videos

Paper • 2507.07966 • Published Jul 10 • 159

Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations

Paper • 2506.18898 • Published Jun 23 • 33

upvoted a collection 6 months ago

🍉 June 2025 - Open works from the Chinese community

29 items • Updated Nov 20 • 7

upvoted an article 6 months ago

Article

Learn the Hugging Face Kernel Hub in 5 Minutes

+5

Jun 12

•

151

upvoted a paper 7 months ago

Autoregressive Adversarial Post-Training for Real-Time Interactive Video Generation

Paper • 2506.09350 • Published Jun 11 • 48

upvoted a collection 7 months ago

SeedVR

A diffusion transformer model for high-resolution image and video restoration. • 9 items • Updated Aug 19 • 9

upvoted a paper 7 months ago

SeedVR2: One-Step Video Restoration via Diffusion Adversarial Post-Training

Paper • 2506.05301 • Published Jun 5 • 58

upvoted a collection 7 months ago

Deepseek Papers

Deepseek papers collection • 27 items • Updated 4 days ago • 289

upvoted a paper 7 months ago

Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models

Paper • 2505.10554 • Published May 15 • 120