Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
In a Training Loop 🔄
26760000000177.6
TFLOPS
1594
1780
4700
Omar Sanseviero
osanseviero
Follow
inoculatemedia's profile picture
MattBoraske's profile picture
roborovski's profile picture
3,443 followers
·
610 following
https://osanseviero.github.io/hackerllama/
osanseviero
osanseviero
omarsanseviero
osanseviero.bsky.social
AI & ML interests
Llamas, model merging, massive ASR for data collection, 3D ML, on-device ML, quantization, model judging, ML in browser, healthcare applications, education, intersection of art and ML.🦙
Recent Activity
new
activity
about 5 hours ago
google/gemma-4-E4B-it:
[Bug] `enable_thinking` doesn't trigger thinking mode for E4B/E2B
new
activity
about 5 hours ago
google/gemma-4-26B-A4B-it:
Add MMMU-Pro evaluation result
new
activity
about 5 hours ago
google/gemma-4-31B-it:
Add MMMU-Pro evaluation result
View all activity
Organizations
osanseviero
's models
301
Sort: Recently updated
osanseviero/distilbert-base-nli-wkpooling
Feature Extraction
•
Updated
May 4, 2021
•
8
Previous
1
...
9
10
11
Next