arxiv:2505.14045
Yingli Shen
ylshen
AI & ML interests
Postdoctoral Researcher @ THUNLP, Tsinghua University.
Researching Multilingual Large Language Models.
Recent Activity
updated
a dataset
8 days ago
openbmb/DCAD-2000
updated
a dataset
8 days ago
openbmb/DCAD-2000
authored
a paper
2 months ago
DCAD-2000: A Multilingual Dataset across 2000+ Languages with Data
Cleaning as Anomaly Detection