-
Robix: A Unified Model for Robot Interaction, Reasoning and Planning
Paper • 2509.01106 • Published • 53 -
Open Data Synthesis For Deep Research
Paper • 2509.00375 • Published • 72 -
Mixture of Global and Local Experts with Diffusion Transformer for Controllable Face Generation
Paper • 2509.00428 • Published • 19 -
LMEnt: A Suite for Analyzing Knowledge in Language Models from Pretraining Data to Representations
Paper • 2509.03405 • Published • 24
Shiqiang Wu
ShiqiangWoo
AI & ML interests
None yet
Organizations
None yet
20250902
-
PVPO: Pre-Estimated Value-Based Policy Optimization for Agentic Reasoning
Paper • 2508.21104 • Published • 37 -
T2R-bench: A Benchmark for Generating Article-Level Reports from Real World Industrial Tables
Paper • 2508.19813 • Published • 28 -
No Label Left Behind: A Unified Surface Defect Detection Model for all Supervision Regimes
Paper • 2508.19060 • Published • 12 -
From reactive to cognitive: brain-inspired spatial intelligence for embodied agents
Paper • 2508.17198 • Published • 10
AI-generaed code
20250903
-
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
Paper • 2509.02547 • Published • 238 -
SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning
Paper • 2509.02479 • Published • 84 -
POINTS-Reader: Distillation-Free Adaptation of Vision-Language Models for Document Conversion
Paper • 2509.01215 • Published • 51 -
LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model
Paper • 2509.00676 • Published • 85
20250901
-
Think in Games: Learning to Reason in Games via Reinforcement Learning with Large Language Models
Paper • 2508.21365 • Published • 29 -
TiKMiX: Take Data Influence into Dynamic Mixture for Language Model Pre-training
Paper • 2508.17677 • Published • 14 -
UItron: Foundational GUI Agent with Advanced Perception and Planning
Paper • 2508.21767 • Published • 12 -
AHELM: A Holistic Evaluation of Audio-Language Models
Paper • 2508.21376 • Published • 9
EO
20250904
-
Robix: A Unified Model for Robot Interaction, Reasoning and Planning
Paper • 2509.01106 • Published • 53 -
Open Data Synthesis For Deep Research
Paper • 2509.00375 • Published • 72 -
Mixture of Global and Local Experts with Diffusion Transformer for Controllable Face Generation
Paper • 2509.00428 • Published • 19 -
LMEnt: A Suite for Analyzing Knowledge in Language Models from Pretraining Data to Representations
Paper • 2509.03405 • Published • 24
20250903
-
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
Paper • 2509.02547 • Published • 238 -
SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning
Paper • 2509.02479 • Published • 84 -
POINTS-Reader: Distillation-Free Adaptation of Vision-Language Models for Document Conversion
Paper • 2509.01215 • Published • 51 -
LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model
Paper • 2509.00676 • Published • 85
20250902
-
PVPO: Pre-Estimated Value-Based Policy Optimization for Agentic Reasoning
Paper • 2508.21104 • Published • 37 -
T2R-bench: A Benchmark for Generating Article-Level Reports from Real World Industrial Tables
Paper • 2508.19813 • Published • 28 -
No Label Left Behind: A Unified Surface Defect Detection Model for all Supervision Regimes
Paper • 2508.19060 • Published • 12 -
From reactive to cognitive: brain-inspired spatial intelligence for embodied agents
Paper • 2508.17198 • Published • 10
20250901
-
Think in Games: Learning to Reason in Games via Reinforcement Learning with Large Language Models
Paper • 2508.21365 • Published • 29 -
TiKMiX: Take Data Influence into Dynamic Mixture for Language Model Pre-training
Paper • 2508.17677 • Published • 14 -
UItron: Foundational GUI Agent with Advanced Perception and Planning
Paper • 2508.21767 • Published • 12 -
AHELM: A Holistic Evaluation of Audio-Language Models
Paper • 2508.21376 • Published • 9
AI-generaed code
EO