-
Data Repetition Beats Data Scaling in Long-CoT Supervised Fine-Tuning
Paper • 2602.11149 • Published • 14 -
dakopi/distilled_numina__qwen3-0.6b
Viewer • Updated • 39.7k • 8 -
dakopi/distilled_numina__qwen3-8b
Viewer • Updated • 39.7k • 15 -
dakopi/distilled_numina__qwen3-0.6b__train_12800
Viewer • Updated • 12.8k • 7
Dawid
dakopi
AI & ML interests
None yet
Recent Activity
liked
a dataset 6 days ago
allenai/IF_multi_constraints_upto5 liked
a dataset 6 days ago
allenai/RLVR-IFeval liked
a dataset 8 days ago
peteromallet/dataclaw-peteromallet