SEGAgentRL

non-profit

AI & ML interests

We target improved agent reinforcement learning in terms of stability (S), efficiency (E), and generalization (G).

Recent Activity

dwenlong  updated a collection about 19 hours ago
LLDS-Search
dwenlong  updated a model about 21 hours ago
SEGAgentRL/LLDS-A-GRPO-Llama3.2-3B-Base-MA
dwenlong  published a model about 21 hours ago
SEGAgentRL/LLDS-A-GRPO-Llama3.2-3B-Base-MA
View all activity

datasets 0

None public yet