view article Article Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment Feb 11, 2025 • 97
PAN: A World Model for General, Interactable, and Long-Horizon World Simulation Paper • 2511.09057 • Published Nov 12, 2025 • 76
Cross-lingual Transfer Learning for Javanese Dependency Parsing Paper • 2401.12072 • Published Jan 22, 2024
Sleeping Ai Mindfulness Apps 👁 Generate recommendations and risk assessments aligned with Kalbe Group values
fadliaulawi/distilbert-base-uncased-finetuned-squad-d5716d28 Question Answering • Updated Jul 27, 2023 • 5