Evaluating Multimodal Large Language Models on Video Captioning via Monte Carlo Tree Search Paper • 2506.11155 • Published Jun 11, 2025 • 1