Taming Actor-Observer Asymmetry in Agents via Dialectical Alignment Paper • 2604.19548 • Published 10 days ago • 14
atefarabi/meme-namer-floodgate-Qwen36-27B-lora Image-Text-to-Text • 28B • Updated 6 days ago • 199 • 1
Learning Adaptive Reasoning Paths for Efficient Visual Reasoning Paper • 2604.14568 • Published 15 days ago • 8
WildDet3D: Scaling Promptable 3D Detection in the Wild Paper • 2604.08626 • Published 22 days ago • 243
An Efficient Heterogeneous Co-Design for Fine-Tuning on a Single GPU Paper • 2603.16428 • Published Mar 17 • 51
GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning Paper • 2604.02721 • Published 28 days ago • 375
CLEAR: Unlocking Generative Potential for Degraded Image Understanding in Unified Multimodal Models Paper • 2604.04780 • Published 25 days ago • 10
PixelPrune: Pixel-Level Adaptive Visual Token Reduction via Predictive Coding Paper • 2604.00886 • Published 29 days ago • 6
CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence Paper • 2603.28032 • Published Mar 30 • 340
AdaptToken: Entropy-based Adaptive Token Selection for MLLM Long Video Understanding Paper • 2603.28696 • Published about 1 month ago • 6
GEditBench v2: A Human-Aligned Benchmark for General Image Editing Paper • 2603.28547 • Published about 1 month ago • 32