Salesforce/wikitext
Viewer • Updated • 3.71M • 1.33M • 690
This is a DISTILGPT2 (82M parameters) model fine-tuned on WikiText-103 for text generation and prediction tasks. It serves as part of the Pendo Text Editor's predictive text system.
Key Features:
Project: Pendo Text Editor - A modern text editor with AI-powered predictive text
Architecture: DISTILGPT2
Training Infrastructure:
Training Configuration:
├─ Epochs: 3
├─ Batch size: 32 per device (effective: 128 with gradient accumulation)
├─ Learning rate: 5e-5 (cosine with 500 warmup steps)
├─ Block size: 512 tokens
├─ Weight decay: 0.01
├─ Gradient clipping: 1.0
└─ Optimizer: AdamW
Metrics (WikiText-103 Test Set)
| Metric | Value |
|---|---|
| Validation Loss | 3.206 |
| Training Loss | 3.379 |
| Perplexity | ~25 |
No overfitting detected - model shows healthy generalization!
from transformers import AutoTokenizer, AutoModelForCausalLM
# Load model
tokenizer = AutoTokenizer.from_pretrained("bekalebendong/pendo-distilgpt2-wikitext")
model = AutoModelForCausalLM.from_pretrained("bekalebendong/pendo-distilgpt2-wikitext")
# Generate text
prompt = "The history of"
inputs = tokenizer(prompt, return_tensors="pt")
outputs = model.generate(
**inputs,
max_new_tokens=50,
do_sample=True,
top_k=50,
top_p=0.95,
temperature=0.8
)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))
from transformers import pipeline
# Create prediction pipeline
predictor = pipeline('text-generation', model="bekalebendong/pendo-distilgpt2-wikitext")
# Get next word predictions
text = "Machine learning is"
predictions = predictor(
text,
max_new_tokens=1,
num_return_sequences=5,
return_full_text=False
)
for pred in predictions:
print(pred['generated_text'])
If you use this model in your research, please cite:
@misc{pendo-distilgpt2-wikitext-103,
author = {Dimitri Bekale},
title = {Pendo DistilGPT2 - Fine-tuned on WikiText-103},
year = {2025},
publisher = {HuggingFace},
howpublished = {\url{https://huggingface.co/bekalebendong/pendo-distilgpt2-wikitext}}
}
Dimitri Bekale
Model Status: ✅ Production Ready Generation Quality: ✅ Verified Last Updated: 2025
Generated with Claude Code