Michael Feldman's picture

In a Training Loop 🔄

154 779

Michael Feldman

mfeldman143

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 6 hours ago

Masking Teacher and Reinforcing Student for Distilling Vision-Language Models

upvoted a paper 1 day ago

REDSearcher: A Scalable and Cost-Efficient Framework for Long-Horizon Search Agents

liked a model 1 day ago

neuphonic/neutts-nano-q8-gguf

View all activity

Organizations

upvoted a paper about 6 hours ago

Masking Teacher and Reinforcing Student for Distilling Vision-Language Models

Paper • 2512.22238 • Published Dec 23, 2025 • 29

upvoted a paper 1 day ago

REDSearcher: A Scalable and Cost-Efficient Framework for Long-Horizon Search Agents

Paper • 2602.14234 • Published 6 days ago • 19

upvoted a paper 3 days ago

GLM-5: from Vibe Coding to Agentic Engineering

Paper • 2602.15763 • Published 4 days ago • 71

upvoted a collection 3 days ago

NVIDIA Cosmos 2

The latest open, multimodal generation models for world generation and reasoning for Physical AI. • 3 items • Updated 17 days ago • 14

upvoted a paper 8 days ago

GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning

Paper • 2602.12099 • Published 10 days ago • 56

upvoted an article 9 days ago

Article

Forge: Scalable Agent RL Framework and Algorithm

9 days ago

•

126

upvoted a paper 9 days ago

PhyCritic: Multimodal Critic Models for Physical AI

Paper • 2602.11124 • Published 10 days ago • 51

upvoted a collection 9 days ago

RADIO

A collection of Foundation Vision Models that combine multiple models (CLIP, DINOv2, SAM, etc.). • 19 items • Updated 11 days ago • 32

upvoted 2 papers 16 days ago

MemOCR: Layout-Aware Visual Memory for Efficient Long-Horizon Reasoning

Paper • 2601.21468 • Published 24 days ago • 25

Generative Visual Code Mobile World Models

Paper • 2602.01576 • Published 20 days ago • 41

upvoted an article 16 days ago

Article

Introducing NVIDIA Cosmos Policy for Advanced Robot Control

23 days ago

•

41

upvoted a paper 18 days ago

DocReward: A Document Reward Model for Structuring and Stylizing

Paper • 2510.11391 • Published Oct 13, 2025 • 27

upvoted a paper 21 days ago

TwinBrainVLA: Unleashing the Potential of Generalist VLMs for Embodied Tasks via Asymmetric Mixture-of-Transformers

Paper • 2601.14133 • Published Jan 20 • 61

upvoted a paper 25 days ago

Endless Terminals: Scaling RL Environments for Terminal Agents

Paper • 2601.16443 • Published 30 days ago • 16

upvoted a paper 28 days ago

Behavior Knowledge Merge in Reinforced Agentic Models

Paper • 2601.13572 • Published Jan 20 • 24

upvoted a collection 29 days ago

Qwen3-TTS

7 items • Updated about 1 month ago • 301

upvoted a paper 29 days ago

Cosmos Policy: Fine-Tuning Video Models for Visuomotor Control and Planning

Paper • 2601.16163 • Published about 1 month ago • 14

upvoted a collection 30 days ago

VibeVoice

Frontier Text-to-Speech Models https://microsoft.github.io/VibeVoice/ • 9 items • Updated Jan 21 • 209

upvoted a paper 30 days ago

VibeVoice Technical Report

Paper • 2508.19205 • Published Aug 26, 2025 • 143

upvoted a paper about 1 month ago

LightOnOCR: A 1B End-to-End Multilingual Vision-Language Model for State-of-the-Art OCR

Paper • 2601.14251 • Published Jan 20 • 24