gliner2 family Collection GLiNER2 extends the original GLiNER architecture to support multi-task information extraction with a schema-driven interface. This base model provid • 4 items • Updated Feb 10 • 46
Granite 4.1 Language Models Collection Efficient language models for multilingual generation, coding, RAG, and AI assistant workflows. • 6 items • Updated 16 days ago • 50
Earth-2 Collection Open, state of the art models for Climate and Weather forecasting. Nowcasting, Medium range, S2S range, Downscaling. • 9 items • Updated 6 days ago • 23
Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free Paper • 2505.06708 • Published May 10, 2025 • 11
OpenTSLM: Time-Series Language Models for Reasoning over Multivariate Medical Text- and Time-Series Data Paper • 2510.02410 • Published Oct 2, 2025 • 21
TimesFM Release Collection TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting. • 7 items • Updated Mar 12 • 51
view article Article A Review on the Evolvement of Load Balancing Strategy in MoE LLMs: Pitfalls and Lessons NormalUhr • Feb 4, 2025 • 35
view article Article Blazingly fast whisper transcriptions with Inference Endpoints +4 mfuntowicz, freddyaboulton, Steveeeeeeen, reach-vb, erikkaum, michellehbn • May 13, 2025 • 82
view article Article Assisted Generation: a new direction toward low-latency text generation joaogante • May 11, 2023 • 78
view article Article Benchmarking Assisted Generation with Gemma 3 and Qwen 2.5: A Code-First Guide ariG23498 • Mar 12, 2025 • 6