Ray2333/GRM-Llama3.2-3B-rewardmodel-ft Text Classification • 3B • Updated Apr 30, 2025 • 1.04k • 13
Ray2333/Gemma-2B-rewardmodel-baseline Text Classification • 3B • Updated Feb 5, 2025 • 35 • 2
Ray2333/reward-model-Mistral-7B-instruct-Unified-Feedback Text Classification • 7B • Updated Feb 5, 2025 • 117 • 11
Ray2333/GRM-gemma2-2B-rewardmodel-ft Text Classification • 3B • Updated Feb 5, 2025 • 2.83k • 7
Ray2333/GRM_Llama3.1_8B_rewardmodel-ft Text Classification • 8B • Updated Feb 5, 2025 • 2 • 5
Ray2333/gpt2-large-helpful-reward_model Text Classification • 0.8B • Updated Jun 2, 2024 • 160k • • 13
Ray2333/gpt2-large-harmless-reward_model Text Classification • 0.8B • Updated Jun 2, 2024 • 160k • 4