Soap2Soap: Long Cinematic Video Remaking via Multi-Agent Collaboration Paper • 2605.17423 • Published 11 days ago • 26
LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding Paper • 2605.27365 • Published 2 days ago • 104
RLDX-1 Collection RLDX-1 : General-purpose robotics foundation model for dexterous manipulation. • 11 items • Updated 21 days ago • 26
Rethinking State Tracking in Recurrent Models Through Error Control Dynamics Paper • 2605.07755 • Published 20 days ago • 23
EXAONE 4.5 Collection LG's First Open-Weight Vision-Language Model for Industrial Intelligence • 5 items • Updated Apr 22 • 43
Attentive Illumination Decomposition Model for Multi-Illuminant White Balancing Paper • 2402.18277 • Published Feb 28, 2024 • 1
ATTIQA: Generalizable Image Quality Feature Extractor using Attribute-aware Pretraining Paper • 2406.01020 • Published Jun 3, 2024 • 1
HVI: A New color space for Low-light Image Enhancement Paper • 2502.20272 • Published Feb 27, 2025 • 9
TempFlow-GRPO: When Timing Matters for GRPO in Flow Models Paper • 2508.04324 • Published Aug 6, 2025 • 11
StableAvatar: Infinite-Length Audio-Driven Avatar Video Generation Paper • 2508.08248 • Published Aug 11, 2025 • 27
view article Article Extending *Transformer layers as Painters* to DiT's NagaSaiAbhinay • Aug 31, 2024 • 16