agurung/flawed-fictions-qwen3-4b-lengthpenalty-litereason Reinforcement Learning • 4B • Updated 16 days ago • 84
agurung/flawed-fictions-gemma-3-4b-lengthpenalty Reinforcement Learning • 4B • Updated 29 days ago • 63
agurung/flawed-fictions-gemma-3-4b-lengthpenalty Reinforcement Learning • 4B • Updated 29 days ago • 63
agurung/flawed-fictions-qwen3-4b-lengthpenalty-litereason Reinforcement Learning • 4B • Updated 16 days ago • 84
agurung/flawed-fictions-qwen3-4b-lengthpenalty Reinforcement Learning • 4B • Updated about 1 month ago • 12
agurung/flawed-fictions-qwen3-4b-lengthpenalty Reinforcement Learning • 4B • Updated about 1 month ago • 12