Llms and reasoning
updated
Towards Large Reasoning Models: A Survey of Reinforced Reasoning with
Large Language Models
Paper
•
2501.09686
•
Published
•
41
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via
Reinforcement Learning
Paper
•
2501.12948
•
Published
•
434
Chain-of-Retrieval Augmented Generation
Paper
•
2501.14342
•
Published
•
58
RL + Transformer = A General-Purpose Problem Solver
Paper
•
2501.14176
•
Published
•
28
CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction
Paper
•
2502.07316
•
Published
•
50
Logical Reasoning in Large Language Models: A Survey
Paper
•
2502.09100
•
Published
•
24
CoT-Valve: Length-Compressible Chain-of-Thought Tuning
Paper
•
2502.09601
•
Published
•
14
SQuARE: Sequential Question Answering Reasoning Engine for Enhanced
Chain-of-Thought in Large Language Models
Paper
•
2502.09390
•
Published
•
16
Small Models Struggle to Learn from Strong Reasoners
Paper
•
2502.12143
•
Published
•
39
Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement
Learning
Paper
•
2502.14768
•
Published
•
47
AlphaMaze: Enhancing Large Language Models' Spatial Intelligence via
GRPO
Paper
•
2502.14669
•
Published
•
15
Self-rewarding correction for mathematical reasoning
Paper
•
2502.19613
•
Published
•
82
R1-Searcher: Incentivizing the Search Capability in LLMs via
Reinforcement Learning
Paper
•
2503.05592
•
Published
•
27
MM-Eureka: Exploring Visual Aha Moment with Rule-based Large-scale
Reinforcement Learning
Paper
•
2503.07365
•
Published
•
61
A Simple "Try Again" Can Elicit Multi-Turn LLM Reasoning
Paper
•
2507.14295
•
Published
•
13