3 32 17

Cong Wei PRO

CongWei1230

https://congwei1230.github.io/

AI & ML interests

Generative Models; Reasoning

Recent Activity

updated a collection 1 day ago

MoCha

liked a model 1 day ago

CongWei1230/MoCha-Demo

updated a model 1 day ago

CongWei1230/MoCha-Demo

View all activity

Organizations

upvoted a paper 4 days ago

SemanticGen: Video Generation in Semantic Space

Paper • 2512.20619 • Published 5 days ago • 87

upvoted a paper 14 days ago

SVG-T2I: Scaling Up Text-to-Image Latent Diffusion Model Without Variational Autoencoder

Paper • 2512.11749 • Published 16 days ago • 36

upvoted a paper 25 days ago

MultiShotMaster: A Controllable Multi-Shot Video Generation Framework

Paper • 2512.03041 • Published 26 days ago • 62

upvoted 2 papers 26 days ago

Infinity-RoPE: Action-Controllable Infinite Video Generation Emerges From Autoregressive Self-Rollout

Paper • 2511.20649 • Published Nov 25 • 45

TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models

Paper • 2512.02014 • Published 27 days ago • 69

upvoted 3 papers 2 months ago

upvoted 2 papers 3 months ago

UniVideo: Unified Understanding, Generation, and Editing for Videos

Paper • 2510.08377 • Published Oct 9 • 71

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9 • 269

upvoted 2 papers 4 months ago

OpenVision 2: A Family of Generative Pretrained Visual Encoders for Multimodal Learning

Paper • 2509.01644 • Published Sep 1 • 33

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

Paper • 2509.01055 • Published Sep 1 • 76

upvoted a paper 6 months ago

First Return, Entropy-Eliciting Explore

Paper • 2507.07017 • Published Jul 9 • 23

upvoted 5 papers 7 months ago

ACECODER: Acing Coder RL via Automated Test-Case Synthesis

Paper • 2502.01718 • Published Feb 3 • 28

QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-Design

Paper • 2505.16175 • Published May 22 • 41

Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning

Paper • 2505.15966 • Published May 21 • 53

VideoEval-Pro: Robust and Realistic Long Video Understanding Evaluation

Paper • 2505.14640 • Published May 20 • 16

General-Reasoner: Advancing LLM Reasoning Across All Domains

Paper • 2505.14652 • Published May 20 • 24

upvoted 2 collections 8 months ago

MoCha

Collection

Dialogue-driven Movie Shot Generation • 4 items • Updated 1 day ago • 1

MoCha

Collection

The pioneering work in Dialogue-driven Movie Shot Generation • 4 items • Updated 1 day ago • 2

Cong Wei PRO

AI & ML interests

Recent Activity

Organizations

CongWei1230's activity