osunlp/bioscan-traits
Viewer • Updated • 80.8k • 51 • 1
Natural language processing, language models, language agents
When Actions Go Off-Task: Detecting and Correcting Misaligned Actions in Computer-Use Agents
When Benign Inputs Lead to Severe Harms: Eliciting Unsafe Unintended Behaviors of Computer-Use Agents