Open-RS-DR_GRPO-8B / generation_config.json

Commit History

Add merged model for training step 50 (adapter revision: 07bccdb7)
bc35aa6
verified

kangdawei commited on

End of training
a75b226
verified

kangdawei commited on