MM-HELIX: Boosting Multimodal Long-Chain Reflective Reasoning with Holistic Platform and Adaptive Hybrid Policy Optimization Paper • 2510.08540 • Published Oct 9, 2025 • 109
Demystifying Long Chain-of-Thought Reasoning in LLMs Paper • 2502.03373 • Published Feb 5, 2025 • 58
Demysitifying Long CoT Collection Curation of resources used in the paper "Demystifying Long Chain-of-Thought Reasoning in LLMs" • 10 items • Updated Mar 16, 2025 • 4
CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction Paper • 2502.07316 • Published Feb 11, 2025 • 50
How Do Your Code LLMs Perform? Empowering Code Instruction Tuning with High-Quality Data Paper • 2409.03810 • Published Sep 5, 2024 • 35