-
-
-
-
-
-
Inference Providers
Active filters:
X-R1
zhengComing/zhengComing_Qwen2.5_0dot5B_R1_zero
Text Generation
•
0.5B
•
Updated
•
7
smartrichard/X-R1-lora-7500
Text Generation
•
Updated
•
6
watermelonhjg/Qwen2.5-3B-EN-Zero
Text Generation
•
3B
•
Updated
•
7
watermelonhjg/Qwen2.5-7B-EN-Zero
Text Generation
•
8B
•
Updated
•
6
watermelonhjg/Qwen2.5-3B-Instruct-CN-Math-Zero
Text Generation
•
3B
•
Updated
•
5
watermelonhjg/Qwen2.5-7B-Instruct-CN-Math-Zero
Text Generation
•
8B
•
Updated
•
5
watermelonhjg/Qwen2.5-7B-Instruct-EN-Zero
Text Generation
•
8B
•
Updated
•
5
watermelonhjg/Qwen2.5-3B-Instruct-EN-Zero
Text Generation
•
3B
•
Updated
•
6
watermelonhjg/Qwen2.5-7B-med
Text Generation
•
8B
•
Updated
•
8
watermelonhjg/Qwen2.5-7B-0.01KL
Text Generation
•
8B
•
Updated
•
7
watermelonhjg/Qwen2.5-7B-class5
Text Generation
•
8B
•
Updated
•
6
watermelonhjg/Qwen2.5-7B-cn-class2
Text Generation
•
8B
•
Updated
•
6
watermelonhjg/Qwen2.5-Math-7B-en-zero
Text Generation
•
8B
•
Updated
•
5
watermelonhjg/Qwen2.5-Math-7B-cn-zero-class2
Text Generation
•
8B
•
Updated
•
6
IDoNotHaveAName/origin_grpo_train_1_epoch
Text Generation
•
2B
•
Updated
•
6
IDoNotHaveAName/GRPO-qwen2.5-1.5B-reward-process
Text Generation
•
2B
•
Updated
•
4
IDoNotHaveAName/GRPO-1epoch-train-by-mistake-collections-with-hint
Text Generation
•
2B
•
Updated
•
5
IDoNotHaveAName/GRPO-1epoch-train-by-mistake-collections-without-hint
Text Generation
•
2B
•
Updated
•
4
IDoNotHaveAName/X-R1-3epoch
Text Generation
•
2B
•
Updated
•
7
IDoNotHaveAName/2epoch-experiment
Text Generation
•
2B
•
Updated
•
6
IDoNotHaveAName/model-trainby-mistake
Text Generation
•
2B
•
Updated
•
8
mradermacher/Hint-Informed-GRPO-1.5B-GGUF
2B
•
Updated
•
48