Nagori

MohammedNaeem

2 252 138

Naeem_1144

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Scaling the Horizon, Not the Parameters: Reaching Trillion-Parameter Performance with a 35B Agent

upvoted a paper 2 days ago

MOPD: Multi-Teacher On-Policy Distillation for Capability Integration in LLM Post-Training

upvoted a paper 2 days ago

Reinforcement Learning with Metacognitive Feedback Elicits Faithful Uncertainty Expression in LLMs

View all activity

Organizations

upvoted a paper 1 day ago

Scaling the Horizon, Not the Parameters: Reaching Trillion-Parameter Performance with a 35B Agent

Paper • 2606.30616 • Published 6 days ago • 86

upvoted 2 papers 2 days ago

MOPD: Multi-Teacher On-Policy Distillation for Capability Integration in LLM Post-Training

Paper • 2606.30406 • Published 6 days ago • 13

Reinforcement Learning with Metacognitive Feedback Elicits Faithful Uncertainty Expression in LLMs

Paper • 2606.32032 • Published 5 days ago • 22

upvoted a paper 3 days ago

PerceptionRubrics: Calibrating Multimodal Evaluation to Human Perception

Paper • 2606.28322 • Published 9 days ago • 38

upvoted 2 papers 4 days ago

DOPD: Dual On-policy Distillation

Paper • 2606.30626 • Published 6 days ago • 93

Agentic Abstention: Do Agents Know When to Stop Instead of Act?

Paper • 2606.28733 • Published 8 days ago • 142

upvoted a paper 5 days ago

AsyncOPD: How Stale Can On-Policy Distillation Be?

Paper • 2606.24143 • Published 12 days ago • 29

liked a model 5 days ago

InternScience/Agents-A1

Text Generation • 35B • Updated 2 days ago • 7.01k • 255

upvoted a paper 6 days ago

Translation as a Bridging Action: Transferring Manipulation Skills from Humans to Robots

Paper • 2606.28133 • Published 9 days ago • 39

upvoted 3 papers 8 days ago

NatureBench: Can Coding Agents Match the Published SOTA of Nature-Family Papers?

Paper • 2606.24530 • Published 12 days ago • 62

World Value Models for Robotic Manipulation

Paper • 2606.24742 • Published 12 days ago • 7

The Verification Horizon: No Silver Bullet for Coding Agent Rewards

Paper • 2606.26300 • Published 11 days ago • 47

upvoted 2 papers 9 days ago

OPID: On-Policy Skill Distillation for Agentic Reinforcement Learning

Paper • 2606.26790 • Published 10 days ago • 54

Qwen-Image-Agent: Bridging the Context Gap in Real-World Image Generation

Paper • 2606.26907 • Published 10 days ago • 49

upvoted a paper 10 days ago

Qwen-AgentWorld: Language World Models for General Agents

Paper • 2606.24597 • Published 12 days ago • 144

upvoted a paper 11 days ago

OpenThoughts-Agent: Data Recipes for Agentic Models

Paper • 2606.24855 • Published 12 days ago • 46

liked a model 14 days ago

SupraLabs/Supra-A2A-Nano-Exp

Any-to-Any • 29.7M • Updated 14 days ago • 33

liked 3 models 17 days ago

Nagori

AI & ML interests

Recent Activity

Organizations

MohammedNaeem's activity