7 13 15

Massimo Caccia

optimass

https://optimass.github.io/

AI & ML interests

None yet

Recent Activity

new activity 10 days ago

ServiceNow/WorkArena-Instances:Update instances.json

upvoted an article 18 days ago

Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance

new activity 29 days ago

ServiceNow/WorkArena-Instances:Update instances.json

View all activity

Organizations

upvoted an article 18 days ago

Article

Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance

18 days ago

•

upvoted an article about 1 month ago

Article

Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models

Nov 19

•

upvoted a paper about 1 month ago

Grounding Computer Use Agents on Human Demonstrations

Paper • 2511.07332 • Published Nov 10 • 105

upvoted 2 papers 4 months ago

A Survey of Reinforcement Learning for Large Reasoning Models

Paper • 2509.08827 • Published Sep 10 • 190

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published Sep 2 • 227

upvoted an article 5 months ago

Article

How to Train Your LLM Web Agent: A Statistical Diagnosis

Jul 8

•

upvoted a paper 6 months ago

How to Train Your LLM Web Agent: A Statistical Diagnosis

Paper • 2507.04103 • Published Jul 5 • 50

upvoted an article 7 months ago

Article

GRPO for GUI Grounding Done Right

Jun 11

•

upvoted an article 8 months ago

Article

PipelineRL

Apr 25

•

upvoted 2 papers about 1 year ago

The BrowserGym Ecosystem for Web Agent Research

Paper • 2412.05467 • Published Dec 6, 2024 • 23

GitChameleon: Unmasking the Version-Switching Capabilities of Code Generation Models

Paper • 2411.05830 • Published Nov 5, 2024 • 21

upvoted a paper over 1 year ago

RepLiQA: A Question-Answering Dataset for Benchmarking LLMs on Unseen Reference Content

Paper • 2406.11811 • Published Jun 17, 2024 • 16

upvoted a paper almost 2 years ago

Simple and Scalable Strategies to Continually Pre-train Large Language Models

Paper • 2403.08763 • Published Mar 13, 2024 • 51

Massimo Caccia

AI & ML interests

Recent Activity

Organizations

optimass's activity

Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance

Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models

How to Train Your LLM Web Agent: A Statistical Diagnosis

GRPO for GUI Grounding Done Right

PipelineRL