20 198 9

Chengsong Huang

ChengsongHuang

https://chengsong-huang.github.io/

hcscctv

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges

upvoted a paper 10 days ago

Seedance 2.0: Advancing Video Generation for World Complexity

published a dataset 11 days ago

HINT-lab/qwen38b_solver_v1

View all activity

Organizations

upvoted a paper 3 days ago

Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges

Paper • 2604.13602 • Published 11 days ago • 26

upvoted a paper 10 days ago

Seedance 2.0: Advancing Video Generation for World Complexity

Paper • 2604.14148 • Published 11 days ago • 152

published a dataset 11 days ago

HINT-lab/qwen38b_solver_v1

Viewer • Updated Aug 18, 2025 • 3.46k • 4

upvoted a paper 16 days ago

Graph of Skills: Dependency-Aware Structural Retrieval for Massive Agent Skills

Paper • 2604.05333 • Published 19 days ago • 22

upvoted a paper 17 days ago

MARS: Enabling Autoregressive Models Multi-Token Generation

Paper • 2604.07023 • Published 18 days ago • 38

upvoted 2 papers about 1 month ago

MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild

Paper • 2603.17187 • Published Mar 17 • 139

Video-Based Reward Modeling for Computer-Use Agents

Paper • 2603.10178 • Published Mar 10 • 43

authored a paper about 2 months ago

MM-Zero: Self-Evolving Multi-Model Vision Language Models From Zero Data

Paper • 2603.09206 • Published Mar 10 • 53

upvoted 2 papers about 2 months ago

MM-Zero: Self-Evolving Multi-Model Vision Language Models From Zero Data

Paper • 2603.09206 • Published Mar 10 • 53

Surgical Post-Training: Cutting Errors, Keeping Knowledge

Paper • 2603.01683 • Published Mar 2 • 12

upvoted 4 papers 2 months ago

upvoted 3 papers 3 months ago

EgoAVU: Egocentric Audio-Visual Understanding

Paper • 2602.06139 • Published Feb 5 • 12

Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations

Paper • 2602.05885 • Published Feb 5 • 28

Context Forcing: Consistent Autoregressive Video Generation with Long Context

Paper • 2602.06028 • Published Feb 5 • 36

authored a paper 3 months ago

Training Data Efficiency in Multimodal Process Reward Models

Paper • 2602.04145 • Published Feb 4 • 79

upvoted 2 papers 3 months ago

Reinforced Attention Learning

Paper • 2602.04884 • Published Feb 4 • 30

Steering LLMs via Scalable Interactive Oversight

Paper • 2602.04210 • Published Feb 4 • 18

Chengsong Huang

AI & ML interests

Recent Activity

Organizations

ChengsongHuang's activity