LikeBench: Evaluating Subjective Likability in LLMs for Personalization Paper • 2512.13077 • Published 10 days ago • 2
Differences That Matter: Auditing Models for Capability Gap Discovery and Rectification Paper • 2512.16921 • Published 6 days ago • 7
Insight Miner: A Time Series Analysis Dataset for Cross-Domain Alignment with Natural Language Paper • 2512.11251 • Published 13 days ago • 6
Nemotron-Math: Efficient Long-Context Distillation of Mathematical Reasoning from Multi-Mode Supervision Paper • 2512.15489 • Published 8 days ago • 6
4D-RGPT: Toward Region-level 4D Understanding via Perceptual Distillation Paper • 2512.17012 • Published 6 days ago • 39
🎯DART-Math Collection Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving [NeurIPS 2024] @ https://github.com/hkust-nlp/dart-math • 20 items • Updated Feb 19 • 8
Human roleplaying data Collection Conversational data from RP forums for continual pretraining or further processing • 4 items • Updated Jan 15 • 5
Nemotron-Pre-Training-Datasets Collection Large scale pre-training datasets used in the Nemotron family of models. • 11 items • Updated 1 day ago • 81
Lucid V1 Collection Lucid is a family of AI language models developed and used by DreamGen with a focus on role-play and story-writing capabilities. • 1 item • Updated Apr 18 • 3
Whisper Release Collection Whisper includes both English-only and multilingual checkpoints for ASR and ST, ranging from 38M params for the tiny models to 1.5B params for large. • 12 items • Updated Sep 13, 2023 • 144