MediaTek-Research/Breeze-ASR-25 Automatic Speech Recognition β’ 2B β’ Updated Jul 8 β’ 6.84k β’ 89
VisMem: Latent Vision Memory Unlocks Potential of Vision-Language Models Paper β’ 2511.11007 β’ Published Nov 14 β’ 15
view article Article Weβre open-sourcing our text-to-image model and the process behind it Nov 12 β’ 76
nvidia/diar_streaming_sortformer_4spk-v2 Automatic Speech Recognition β’ Updated 15 days ago β’ 9.09k β’ 86
AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs Paper β’ 2508.16153 β’ Published Aug 22 β’ 160
facebook/dinov3-vit7b16-pretrain-lvd1689m Image Feature Extraction β’ 7B β’ Updated Aug 19 β’ 12.9k β’ 198
SmallThinker: A Family of Efficient Large Language Models Natively Trained for Local Deployment Paper β’ 2507.20984 β’ Published Jul 28 β’ 57
view article Article Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders Jul 9 β’ 745