yujunzhou/SFT_Advanced_Risk_Self_Grading_Qwen3-4B-Base Text Generation • 4B • Updated 9 days ago • 36
yujunzhou/SFT_Advanced_Risk_Reward_Tampering_Qwen3-4B-Base Text Generation • 4B • Updated 9 days ago • 41
yujunzhou/SFT_Advanced_Risk_Situation_Aware_Qwen3-4B-Base Text Generation • 4B • Updated 10 days ago • 130
yujunzhou/SFT_Advanced_Risk_Summarization_Qwen3-4B-Base Text Generation • 4B • Updated 12 days ago • 54
yujunzhou/Advanced_Risk_Advanced_Risk_Self_Grading_Qwen3-4B-Base_situation_aware 4B • Updated Oct 29 • 8
yujunzhou/Advanced_Risk_Advanced_Risk_Reward_Tampering_llama_situation_aware 8B • Updated Oct 29 • 10
yujunzhou/Advanced_Risk_Advanced_Risk_Reward_Tampering_Qwen3-4B_situation_aware 4B • Updated Oct 29 • 8
yujunzhou/Advanced_Risk_Advanced_Risk_Reward_Tampering_Qwen3-4B-Base_situation_aware 4B • Updated Oct 28 • 7
yujunzhou/Advanced_Risk_Advanced_Risk_Summarization_Qwen3-4B-Base_situation_aware 4B • Updated Oct 28 • 4