Mensa Coate's picture

22

Mensa Coate

mensacoate

·

AI & ML interests

None yet

Organizations

None yet

upvoted 9 papers 5 months ago

Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning

Paper • 2505.24726 • Published May 30, 2025 • 277

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2, 2025 • 187

Will It Still Be True Tomorrow? Multilingual Evergreen Question Classification to Improve Trustworthy QA

Paper • 2505.21115 • Published May 27, 2025 • 140

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 263

Rep-MTL: Unleashing the Power of Representation-level Task Saliency for Multi-Task Learning

Paper • 2507.21049 • Published Jul 28, 2025 • 40

Kwai Keye-VL Technical Report

Paper • 2507.01949 • Published Jul 2, 2025 • 130

Scaling RL to Long Videos

Paper • 2507.07966 • Published Jul 10, 2025 • 159

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published Jul 1, 2025 • 250

A Survey of Context Engineering for Large Language Models

Paper • 2507.13334 • Published Jul 17, 2025 • 259

upvoted 5 papers 7 months ago

Taming LLMs by Scaling Learning Rates with Gradient Grouping

Paper • 2506.01049 • Published Jun 1, 2025 • 38

Frame In-N-Out: Unbounded Controllable Image-to-Video Generation

Paper • 2505.21491 • Published May 27, 2025 • 16

ImgEdit: A Unified Image Editing Dataset and Benchmark

Paper • 2505.20275 • Published May 26, 2025 • 18

Large Language Models are Locally Linear Mappings

Paper • 2505.24293 • Published May 30, 2025 • 14

ChARM: Character-based Act-adaptive Reward Modeling for Advanced Role-Playing Language Agents

Paper • 2505.23923 • Published May 29, 2025 • 8

upvoted 6 papers 9 months ago

Multi-Token Attention

Paper • 2504.00927 • Published Apr 1, 2025 • 56

DreamActor-M1: Holistic, Expressive and Robust Human Image Animation with Hybrid Guidance

Paper • 2504.01724 • Published Apr 2, 2025 • 68

Unicorn: Text-Only Data Synthesis for Vision Language Model Training

Paper • 2503.22655 • Published Mar 28, 2025 • 39

AnimeGamer: Infinite Anime Life Simulation with Next Game State Prediction

Paper • 2504.01014 • Published Apr 1, 2025 • 70

TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes

Paper • 2503.23461 • Published Mar 30, 2025 • 94

MoCha: Towards Movie-Grade Talking Character Synthesis

Paper • 2503.23307 • Published Mar 30, 2025 • 138