Cbgcbg/qwen3-1.7b-math-sft-antioverfitting-20250724_165951 Text Generation • 2B • Updated Jul 25 • 10
shivash/enhanced-hybrid-transformer-768d-trained-thinking Text Generation • 0.1B • Updated Sep 24 • 1
TMLR-Group-HF/Majority-Voting-Llama-3.2-3B-Instruct-DAPO14k Text Generation • 4B • Updated Oct 11 • 16
mradermacher/Self-Certainty-Qwen3-1.7B-Base-MATH-GGUF Reinforcement Learning • 2B • Updated Oct 11 • 177 • 1