Tommaso Bonomo's picture

Tommaso Bonomo

tommasobonomo

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 26 days ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

upvoted a paper about 2 months ago

Tower+: Bridging Generality and Translation Specialization in Multilingual LLMs

updated a dataset about 2 months ago

sapienzanlp/LiteraryQA

View all activity

Organizations

upvoted a paper 26 days ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published Jan 8 • 228

upvoted a paper about 2 months ago

Tower+: Bridging Generality and Translation Specialization in Multilingual LLMs

Paper • 2506.17080 • Published Jun 20, 2025 • 7

upvoted a collection 3 months ago

ITA-Bench: Italian Benchmarks for LLMs

A collection of Italian benchmarks for Large Language Models. See also our Github repo: https://github.com/SapienzaNLP/ita-bench • 22 items • Updated 2 days ago • 8

upvoted a paper 4 months ago

LiteraryQA: Towards Effective Evaluation of Long-document Narrative QA

Paper • 2510.13494 • Published Oct 15, 2025 • 2

upvoted an article 6 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

+21

Jul 8, 2025

•

759

upvoted a paper 7 months ago

BeyondWeb: Lessons from Scaling Synthetic Data for Trillion-scale Pretraining

Paper • 2508.10975 • Published Aug 14, 2025 • 60

upvoted 3 papers 8 months ago

Right Answer, Wrong Score: Uncovering the Inconsistencies of LLM Evaluation in Multiple-Choice Question Answering

Paper • 2503.14996 • Published Mar 19, 2025 • 3

ZEBRA: Zero-Shot Example-Based Retrieval Augmentation for Commonsense Question Answering

Paper • 2410.05077 • Published Oct 7, 2024 • 5

BOOKCOREF: Coreference Resolution at Book Scale

Paper • 2507.12075 • Published Jul 16, 2025 • 5

upvoted a collection 8 months ago

Reward Bench 2

Datasets, spaces, and models for Reward Bench 2 benchmark and paper! • 11 items • Updated Dec 23, 2025 • 16

upvoted a paper 8 months ago

RewardBench 2: Advancing Reward Model Evaluation

Paper • 2506.01937 • Published Jun 2, 2025 • 7

upvoted a collection 8 months ago

CLIPPER

Models and datasets for CLIPPER: Compression enables long-context synthetic data generation • 7 items • Updated Oct 3, 2025 • 5

upvoted a paper 12 months ago

The Road Less Scheduled

Paper • 2405.15682 • Published May 24, 2024 • 27

upvoted a paper about 1 year ago

MMTEB: Massive Multilingual Text Embedding Benchmark

Paper • 2502.13595 • Published Feb 19, 2025 • 45

upvoted a paper over 1 year ago

Truth or Mirage? Towards End-to-End Factuality Evaluation with LLM-OASIS

Paper • 2411.19655 • Published Nov 29, 2024 • 20

upvoted 2 collections over 1 year ago

Models for dataset curation

9 items • Updated Dec 5, 2024 • 17

Minerva LLMs

The first family of LLMs pretrained from scratch on Italian. • 6 items • Updated Dec 7, 2024 • 40

upvoted a paper over 1 year ago

OLMoE: Open Mixture-of-Experts Language Models

Paper • 2409.02060 • Published Sep 3, 2024 • 80