Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2 Image-Text-to-Text • 28B • Updated 23 days ago • 610k • 116
Absolute Zero: Reinforced Self-play Reasoning with Zero Data Paper • 2505.03335 • Published May 6, 2025 • 191