Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
CKeibel
's Collections
SLMs
PII
Code-Embeddings
Speech2Text (ASR)
Seq2Seq
Reward Models
diffusion models
Text-Classification
Data
PEFT (Papers)
LLMs (Papers)
Causal LMs, seq2seq models
Embedding models
Vision stuff
datasets
NER
BERT based tasks (models)
Multimodal
Data
updated
Feb 13
Upvote
-
HuggingFaceFW/fineweb-2
Viewer
•
Updated
Oct 27, 2025
•
4.48B
•
37.8k
•
776
allenai/c4
Viewer
•
Updated
Jan 9, 2024
•
10.4B
•
621k
•
539
ServiceNow-AI/R1-Distill-SFT
Viewer
•
Updated
Feb 8, 2025
•
1.85M
•
2.12k
•
315
PrimeIntellect/INTELLECT-2-RL-Dataset
Viewer
•
Updated
May 13, 2025
•
285k
•
148
•
66
togethercomputer/RedPajama-Data-V2
Updated
Nov 21, 2024
•
6.18k
•
399
wikimedia/wikipedia
Viewer
•
Updated
Jan 9, 2024
•
61.6M
•
95.3k
•
1.17k
avemio/German-RAG-EMBEDDING-TRIPLES-HESSIAN-AI
Viewer
•
Updated
Oct 16, 2024
•
294k
•
8
•
1
urchade/synthetic-pii-ner-mistral-v1
Updated
Apr 20, 2024
•
298
•
15
yahma/alpaca-cleaned
Viewer
•
Updated
Apr 10, 2023
•
51.8k
•
29.2k
•
803
Upvote
-
Share collection
View history
Collection guide
Browse collections