1 10 6

Huu-Thien Tran

mathesics

AI & ML interests

CV, multimodal, Gen AI

Recent Activity

upvoted a paper about 1 month ago

SAM 3D: 3Dfy Anything in Images

liked a Space 2 months ago

HuggingFaceFW/blogpost-fineweb-v1

liked a Space 2 months ago

HuggingFaceTB/smol-training-playbook

View all activity

Organizations

None yet

upvoted a paper about 1 month ago

SAM 3D: 3Dfy Anything in Images

Paper • 2511.16624 • Published Nov 20, 2025 • 111

liked 2 Spaces 2 months ago

FineWeb: decanting the web for the finest text data at scale

🍷

1.26k

Generate high-quality text data for LLMs using FineWeb

The Smol Training Playbook

📚

2.84k

The secrets to building world-class LLMs

liked 3 datasets 5 months ago

upvoted a paper 6 months ago

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14, 2025 • 321

liked a Space 9 months ago

The Ultra-Scale Playbook

🌌

3.64k

The ultimate guide to training LLM on large GPU Clusters

upvoted 2 papers 11 months ago

Matryoshka Quantization

Paper • 2502.06786 • Published Feb 10, 2025 • 32

VideoRoPE: What Makes for Good Video Rotary Position Embedding?

Paper • 2502.05173 • Published Feb 7, 2025 • 65

upvoted 6 papers 12 months ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28, 2025 • 123

MMVU: Measuring Expert-Level Multi-Discipline Video Understanding

Paper • 2501.12380 • Published Jan 21, 2025 • 84

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22, 2025 • 434

Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped Noise

Paper • 2501.08331 • Published Jan 14, 2025 • 20

Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published Jan 11, 2025 • 90

Do generative video models learn physical principles from watching videos?

Paper • 2501.09038 • Published Jan 14, 2025 • 34