Center for Language and Speech Processing @ JHU

university

https://www.clsp.jhu.edu/

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

Chuanyang-Jin authored a paper 6 minutes ago

Self-Compacting Language Model Agents

mmarone updated a collection 9 days ago

mmBERT: a modern multilingual encoder

TaiMingLu authored a paper 13 days ago

Strong Teacher Not Needed? On Distillation in LLM Pretraining

View all activity

Papers

DAR: Deontic Reasoning with Agentic Harnesses

Trust Functions: Near-Lossless Weak-to-Strong Generalization by Learning When to Trust the Weak Teacher

View all Papers

Collections 3

View 3 collections

spaces 1

Science Hierarchography

Explore academic paper hierarchies and details

models 53

jhu-clsp/mmBERT-small

Fill-Mask • Updated Oct 17, 2025 • 21.5k • • 76

jhu-clsp/mmBERT-base

Fill-Mask • Updated Oct 7, 2025 • 339k • • 217

jhu-clsp/mmBERT-checkpoints

Updated Sep 9, 2025 • 4

jhu-clsp/ettin-decoder-1b

Fill-Mask • Updated Jul 21, 2025 • 20 • 5

jhu-clsp/ettin-decoder-32m

Text Generation • Updated Jul 18, 2025 • 328

jhu-clsp/ettin-encoder-1b

Feature Extraction • Updated Jul 18, 2025 • 1.88k • 22

jhu-clsp/ettin-encoder-68m

Fill-Mask • Updated Jul 18, 2025 • 66.7k • • 5

jhu-clsp/ettin-dec-from-enc-32m

Text Generation • Updated Jul 18, 2025 • 4

jhu-clsp/ettin-encoder-150m

Fill-Mask • Updated Jul 18, 2025 • 5.86k • • 13

jhu-clsp/ettin-decoder-400m

Text Generation • Updated Jul 18, 2025 • 6.97k • 4

datasets 40

jhu-clsp/ManyIH-Bench

Preview • Updated Apr 13 • 47 • 3

jhu-clsp/robust04-instructions

Viewer • Updated Mar 12 • 136k • 993 • 2

jhu-clsp/core17-instructions

Viewer • Updated Mar 12 • 49.4k • 994 • 2

jhu-clsp/news21-instructions

Viewer • Updated Mar 12 • 71.5k • 785 • 1

jhu-clsp/SciTaRC

Viewer • Updated Mar 6 • 371 • 52 • 1

jhu-clsp/megawika-2

Updated Mar 3 • 97 • 4

jhu-clsp/mmBERT-decay-data

Updated Dec 11, 2025 • 33.2k • 6

jhu-clsp/mmBERT-midtraining-data

Updated Oct 13, 2025 • 2.3k • 1

jhu-clsp/ettin-pretraining-data

Updated Jul 18, 2025 • 118k • 9

jhu-clsp/ettin-decay-data

Updated Jul 18, 2025 • 1.09k • 1

View 40 datasets