Llms and reasoning
updated
Towards Large Reasoning Models: A Survey of Reinforced Reasoning with
Large Language Models
Paper
• 2501.09686
• Published
• 41
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via
Reinforcement Learning
Paper
• 2501.12948
• Published
• 441
Chain-of-Retrieval Augmented Generation
Paper
• 2501.14342
• Published
• 58
RL + Transformer = A General-Purpose Problem Solver
Paper
• 2501.14176
• Published
• 28
CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction
Paper
• 2502.07316
• Published
• 50
Logical Reasoning in Large Language Models: A Survey
Paper
• 2502.09100
• Published
• 24
CoT-Valve: Length-Compressible Chain-of-Thought Tuning
Paper
• 2502.09601
• Published
• 14
SQuARE: Sequential Question Answering Reasoning Engine for Enhanced
Chain-of-Thought in Large Language Models
Paper
• 2502.09390
• Published
• 16
Small Models Struggle to Learn from Strong Reasoners
Paper
• 2502.12143
• Published
• 39
Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement
Learning
Paper
• 2502.14768
• Published
• 47
AlphaMaze: Enhancing Large Language Models' Spatial Intelligence via
GRPO
Paper
• 2502.14669
• Published
• 15
Self-rewarding correction for mathematical reasoning
Paper
• 2502.19613
• Published
• 82
R1-Searcher: Incentivizing the Search Capability in LLMs via
Reinforcement Learning
Paper
• 2503.05592
• Published
• 27
MM-Eureka: Exploring Visual Aha Moment with Rule-based Large-scale
Reinforcement Learning
Paper
• 2503.07365
• Published
• 61
A Simple "Try Again" Can Elicit Multi-Turn LLM Reasoning
Paper
• 2507.14295
• Published
• 14