Oliver2021
's Collections
reasoning
updated
URSA: Understanding and Verifying Chain-of-thought Reasoning in
Multimodal Mathematics
Paper
•
2501.04686
•
Published
•
53
Towards Large Reasoning Models: A Survey of Reinforced Reasoning with
Large Language Models
Paper
•
2501.09686
•
Published
•
41
LLaVA-o1: Let Vision Language Models Reason Step-by-Step
Paper
•
2411.10440
•
Published
•
129
TheoremExplainAgent: Towards Multimodal Explanations for LLM Theorem
Understanding
Paper
•
2502.19400
•
Published
•
47
Perception, Reason, Think, and Plan: A Survey on Large Multimodal
Reasoning Models
Paper
•
2505.04921
•
Published
•
185
Enigmata: Scaling Logical Reasoning in Large Language Models with
Synthetic Verifiable Puzzles
Paper
•
2505.19914
•
Published
•
45
Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective
Reinforcement Learning for LLM Reasoning
Paper
•
2506.01939
•
Published
•
187
QwenLong-L1: Towards Long-Context Large Reasoning Models with
Reinforcement Learning
Paper
•
2505.17667
•
Published
•
88
SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning
Logical Reasoning and Beyond
Paper
•
2505.19641
•
Published
•
68
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
Paper
•
2505.24863
•
Published
•
97
REASONING GYM: Reasoning Environments for Reinforcement Learning with
Verifiable Rewards
Paper
•
2505.24760
•
Published
•
74
SRPO: Enhancing Multimodal LLM Reasoning via Reflection-Aware
Reinforcement Learning
Paper
•
2506.01713
•
Published
•
48
Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning
Paper
•
2505.24726
•
Published
•
277
DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and
Verifiable Mathematical Dataset for Advancing Reasoning
Paper
•
2504.11456
•
Published
•
12
VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models
with Reinforcement Learning
Paper
•
2504.08837
•
Published
•
43
SynthRL: Scaling Visual Reasoning with Verifiable Data Synthesis
Paper
•
2506.02096
•
Published
•
52
VisCoder: Fine-Tuning LLMs for Executable Python Visualization Code
Generation
Paper
•
2506.03930
•
Published
•
26