·
AI & ML interests
None yet
Organizations
None yet
Preview
• Updated • 41
• 2
Viewer
• Updated • 1.41M • 6
Viewer
• Updated • 1.41M • 19
Viewer
• Updated • 1.41M • 7
zd21/ReST-MCTS_SciGLM-6B_Self-Rewarding-DPO_2nd
Viewer
• Updated • 1 • 14
zd21/ReST-MCTS_SciGLM-6B_ReST-MCTS_Policy_2nd
Viewer
• Updated • 40.9k • 8
zd21/ReST-MCTS_SciGLM-6B_ReST-EM-CoT_2nd
Viewer
• Updated • 28.9k • 4
zd21/ReST-MCTS_Mistral-MetaMATH-7b-Instruct_Self-Rewarding-DPO_2nd
Viewer
• Updated • 1 • 3
zd21/ReST-MCTS_Mistral-MetaMATH-7b-Instruct_ReST-MCTS_2nd
Viewer
• Updated • 26k • 4
zd21/ReST-MCTS_Mistral-MetaMATH-7b-Instruct_ReST-EM-CoT_2nd
Viewer
• Updated • 36.6k • 6
zd21/ReST-MCTS_Llama3-8b-Instruct_Self-Rewarding-DPO_2nd
Viewer
• Updated • 1 • 4
zd21/ReST-MCTS_Llama3-8b-Instruct_ReST-MCTS_Policy_2nd
Viewer
• Updated • 32.3k • 4
zd21/ReST-MCTS_Llama3-8b-Instruct_ReST-EM-CoT_2nd
Viewer
• Updated • 33.2k • 6
zd21/ReST-MCTS_SciGLM-6B_Self-Rewarding-DPO_1st
Viewer
• Updated • 33.5k • 6
zd21/ReST-MCTS_SciGLM-6B_ReST-MCTS_Policy_1st
Viewer
• Updated • 30.1k • 4
zd21/ReST-MCTS_SciGLM-6B_ReST-EM-CoT_1st
Viewer
• Updated • 55.8k • 5
zd21/ReST-MCTS_Mistral-MetaMATH-7b-Instruct_Self-Rewarding-DPO_1st
Viewer
• Updated • 1 • 4
zd21/ReST-MCTS_Mistral-MetaMATH-7b-Instruct_ReST-MCTS_1st
Viewer
• Updated • 38.7k • 3
zd21/ReST-MCTS_Mistral-MetaMATH-7b-Instruct_ReST-EM-CoT_1st
Viewer
• Updated • 74k • 3
zd21/ReST-MCTS_Llama3-8b-Instruct_Self-Rewarding-DPO_1st
Viewer
• Updated • 1 • 4
zd21/ReST-MCTS_Llama3-8b-Instruct_ReST-MCTS_Policy_1st
Viewer
• Updated • 33.7k • 6
zd21/ReST-MCTS_Llama3-8b-Instruct_ReST-EM-CoT_1st
Viewer
• Updated • 73.1k • 7
Viewer
• Updated • 474k • 8
• 2
Viewer
• Updated • 91.8k • 192
• 7
Viewer
• Updated • 91.8k • 19
• 3
zd21/ReST-MCTS-Llama3-8b-Instruct-Policy-1st
Viewer
• Updated • 33.7k • 10
• 7
zd21/ReST-MCTS-Llama3-8b-Instruct-PRM-1st
Viewer
• Updated • 673k • 29
• 9