OpenLearnLM
community
AI & ML interests
None defined yet.
Recent Activity
View all activity
-
OpenLearnLM/deepseek_qwen3_8b_pedagogical_think_reward_grpo_step_300
8B • Updated • 10 -
OpenLearnLM/deepseek_qwen3_8b_pedagogical_think_noreward_grpo_step_300
8B • Updated • 3 -
OpenLearnLM/deepseek_qwen3_8b_think_noreward_grpo_step_300
8B • Updated • 4 -
OpenLearnLM/deepseek_qwen3_8b_think_reward_grpo_step_300
8B • Updated • 5
-
OpenLearnLM/deepseek_qwen3_8b_pedagogical_think_reward_grpo_step_300
8B • Updated • 10 -
OpenLearnLM/deepseek_qwen3_8b_pedagogical_think_noreward_grpo_step_300
8B • Updated • 3 -
OpenLearnLM/deepseek_qwen3_8b_think_noreward_grpo_step_300
8B • Updated • 4 -
OpenLearnLM/deepseek_qwen3_8b_think_reward_grpo_step_300
8B • Updated • 5
models 9
OpenLearnLM/special-r1-deepseek-qwen3-8b-merged-dare-v2
Text Generation • 8B • Updated • 16
OpenLearnLM/special-r1-deepseek-qwen3-8b-sped-adaptive-think-reward
Text Generation • 8B • Updated • 19
OpenLearnLM/special-r1-deepseek-qwen3-8b-sped-adaptive-think-noreward
Text Generation • 8B • Updated • 19
OpenLearnLM/qwen2.5_7b_nothink_noreward_grpo_step_300
8B • Updated • 25
OpenLearnLM/deepseek_qwen3_8b_think_reward_grpo_step_300
8B • Updated • 5
OpenLearnLM/deepseek_qwen3_8b_think_noreward_grpo_step_300
8B • Updated • 4
OpenLearnLM/deepseek_qwen3_8b_nothink_grpo_step_300
8B • Updated
OpenLearnLM/deepseek_qwen3_8b_pedagogical_think_reward_grpo_step_300
8B • Updated • 10
OpenLearnLM/deepseek_qwen3_8b_pedagogical_think_noreward_grpo_step_300
8B • Updated • 3
datasets 0
None public yet