1 13 2

Frank Chen

quantumfr

AI & ML interests

alignment and Interpretability

Recent Activity

upvoted a paper 15 days ago

Towards Scalable Pre-training of Visual Tokenizers for Generation

upvoted a paper about 1 month ago

Geometrically-Constrained Agent for Spatial Reasoning

upvoted a collection about 1 month ago

SafeWork-R1

View all activity

Organizations

None yet

upvoted a paper 15 days ago

Towards Scalable Pre-training of Visual Tokenizers for Generation

Paper • 2512.13687 • Published 18 days ago • 98

upvoted a paper about 1 month ago

Geometrically-Constrained Agent for Spatial Reasoning

Paper • 2511.22659 • Published Nov 27, 2025 • 40

upvoted a collection about 1 month ago

SafeWork-R1

Collection

7 items • Updated Nov 20, 2025 • 2

liked a model about 2 months ago

AI45Research/SafeWork-R1

Image-Text-to-Text • 73B • Updated Nov 6, 2025 • 35 • 4

upvoted a collection 2 months ago

Ouro

Collection

a family of pre-trained Looped Language Models. • 4 items • Updated Oct 29, 2025 • 21

liked a model 2 months ago

ByteDance/Ouro-1.4B

Text Generation • Updated Nov 16, 2025 • 14k • 57

upvoted a paper 2 months ago

Glyph: Scaling Context Windows via Visual-Text Compression

Paper • 2510.17800 • Published Oct 20, 2025 • 67

authored 3 papers 3 months ago

SafeWork-R1: Coevolving Safety and Intelligence under the AI-45$^{\circ}$ Law

Paper • 2507.18576 • Published Jul 24, 2025 • 8

Taming Masked Diffusion Language Models via Consistency Trajectory Reinforcement Learning with Fewer Decoding Step

Paper • 2509.23924 • Published Sep 28, 2025 • 8

Rethinking Entropy Regularization in Large Reasoning Models

Paper • 2509.25133 • Published Sep 29, 2025 • 4

upvoted a paper 3 months ago

Rethinking Entropy Regularization in Large Reasoning Models

Paper • 2509.25133 • Published Sep 29, 2025 • 4

commented a paper 3 months ago

Conditional Advantage Estimation for Reinforcement Learning in Large Reasoning Models

Paper • 2509.23962 • Published Sep 28, 2025 • 5 •

upvoted 4 papers 3 months ago

LLMs Learn to Deceive Unintentionally: Emergent Misalignment in Dishonesty from Misaligned Samples to Biased Human-AI Interactions

Paper • 2510.08211 • Published Oct 9, 2025 • 22

CoMAS: Co-Evolving Multi-Agent Systems via Interaction Rewards

Paper • 2510.08529 • Published Oct 9, 2025 • 18

Conditional Advantage Estimation for Reinforcement Learning in Large Reasoning Models

Paper • 2509.23962 • Published Sep 28, 2025 • 5

ExGRPO: Learning to Reason from Experience

Paper • 2510.02245 • Published Oct 2, 2025 • 80

authored 2 papers 3 months ago

Conditional Advantage Estimation for Reinforcement Learning in Large Reasoning Models

Paper • 2509.23962 • Published Sep 28, 2025 • 5

Beyond External Monitors: Enhancing Transparency of Large Language Models for Easier Monitoring

Paper • 2502.05242 • Published Feb 7, 2025

upvoted a paper 3 months ago

Taming Masked Diffusion Language Models via Consistency Trajectory Reinforcement Learning with Fewer Decoding Step

Paper • 2509.23924 • Published Sep 28, 2025 • 8

upvoted a paper 4 months ago

Reasoning over Boundaries: Enhancing Specification Alignment via Test-time Delibration

Paper • 2509.14760 • Published Sep 18, 2025 • 53

Frank Chen

AI & ML interests

Recent Activity

Organizations

quantumfr's activity