frankenstallm / data /preference /maywell_ko_Ultrafeedback_binarized.jsonl

Commit History

feat: Add SFT val + preference data (ORPO training, 630K pairs)
e9af455
verified

pathcosmos commited on