ChartNet: A Million-Scale, High-Quality Multimodal Dataset for Robust Chart Understanding Paper • 2603.27064 • Published 11 days ago • 23
view article Article Granite 4.0 3B Vision: Compact Multimodal Intelligence for Enterprise Documents 8 days ago • 33
Look Where It Matters: High-Resolution Crops Retrieval for Efficient VLMs Paper • 2603.16932 • Published 25 days ago • 86
CARES: Context-Aware Resolution Selector for VLMs Paper • 2510.19496 • Published Oct 22, 2025 • 9
NLE: Non-autoregressive LLM-based ASR by Transcript Editing Paper • 2603.08397 • Published 30 days ago • 21
view article Article Granite 4.0 1B Speech: Compact, Multilingual, and Built for the Edge 30 days ago • 16
Granite 4.0 Language Models Collection Efficient language models for multilingual generation, coding, RAG, and AI assistant workflows. • 11 items • Updated 6 days ago • 216
Granite 3.1 Language Models Collection Long-context language models for enterprise-grade text generation. • 9 items • Updated 7 days ago • 69
Granite 4.0 Nano Language Models Collection Ultra-compact language models designed for the edge and on-device deployment. • 9 items • Updated 7 days ago • 100
Charting and Navigating Hugging Face's Model Atlas Paper • 2503.10633 • Published Mar 13, 2025 • 93
Advancing Speech Understanding in Speech-Aware Language Models with GRPO Paper • 2509.16990 • Published Sep 21, 2025 • 22
view article Article Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers +5 Sep 11, 2025 • 186
Granite Speech Models Collection Multilingual ASR and speech-to-text (STT) models for enterprise transcription and translation. • 6 items • Updated 7 days ago • 24
Continuous Speech Synthesis using per-token Latent Diffusion Paper • 2410.16048 • Published Oct 21, 2024 • 30
view article Article Saving Memory Using Padding-Free Transformer Layers during Finetuning Jun 11, 2024 • 21