-
CL-From-Nothing/student_prefix_kukurasu_20K_continual_Qwen3_4B_Thinking_nemotron-cascade-8b_epoch_3_mask
8B • Updated • 10 -
CL-From-Nothing/student_prefix_kukurasu_20K_continual_Qwen3_4B_Thinking_qwen3-1.7b_epoch_3_mask
2B • Updated • 5 -
CL-From-Nothing/student_prefix_minesweeper_kukurasu_continual_Qwen3_4B_Thinking_nemtron_cascade-8b
8B • Updated • 6 -
CL-From-Nothing/student_prefix_minesweeper_kukurasu_continual_Qwen3_4B_Thinking_qwen3-1.7b
2B • Updated • 4
AI & ML interests
None defined yet.
Recent Activity
View all activity
Ablation datasets for cutoff-based completion experiments.
-
CL-From-Nothing/kukurasu-qwen1.7b-cutoff512-completed-by-qwen3-4b-thinking-r16384
Viewer • Updated • 20k • 12 -
CL-From-Nothing/kukurasu-qwen1.7b-cutoff1024-completed-by-qwen3-4b-thinking-r16384
Viewer • Updated • 20k • 16 -
CL-From-Nothing/kukurasu-qwen1.7b-cutoff2048-completed-by-qwen3-4b-thinking-r16384
Viewer • Updated • 20k • 14 -
CL-From-Nothing/kukurasu-nemotron8b-cutoff512-completed-by-qwen3-4b-thinking-r16384
Viewer • Updated • 20k • 13
-
CL-From-Nothing/student_prefix_kukurasu_20K_continual_Qwen3_4B_Thinking_nemotron-cascade-8b_epoch_3_mask
8B • Updated • 10 -
CL-From-Nothing/student_prefix_kukurasu_20K_continual_Qwen3_4B_Thinking_qwen3-1.7b_epoch_3_mask
2B • Updated • 5 -
CL-From-Nothing/student_prefix_minesweeper_kukurasu_continual_Qwen3_4B_Thinking_nemtron_cascade-8b
8B • Updated • 6 -
CL-From-Nothing/student_prefix_minesweeper_kukurasu_continual_Qwen3_4B_Thinking_qwen3-1.7b
2B • Updated • 4
Ablation datasets for cutoff-based completion experiments.
-
CL-From-Nothing/kukurasu-qwen1.7b-cutoff512-completed-by-qwen3-4b-thinking-r16384
Viewer • Updated • 20k • 12 -
CL-From-Nothing/kukurasu-qwen1.7b-cutoff1024-completed-by-qwen3-4b-thinking-r16384
Viewer • Updated • 20k • 16 -
CL-From-Nothing/kukurasu-qwen1.7b-cutoff2048-completed-by-qwen3-4b-thinking-r16384
Viewer • Updated • 20k • 14 -
CL-From-Nothing/kukurasu-nemotron8b-cutoff512-completed-by-qwen3-4b-thinking-r16384
Viewer • Updated • 20k • 13
models 46
CL-From-Nothing/polaris_warmup_polaris_offline_40K-parquet_qwen3-4b_epoch_1_mask_k4096_step312
4B • Updated • 20
CL-From-Nothing/polaris_warmup_polaris_offline_40K-parquet_qwen3-4b_epoch_1_mask_step312
4B • Updated • 19
CL-From-Nothing/Qwen3-4B-GRPO-polaris-hard-step40
4B • Updated • 11
CL-From-Nothing/Qwen3-4B-OPD-polaris-hard-warmup-step40
4B • Updated • 13
CL-From-Nothing/Qwen3-4B-OPD-polaris-hard-step40
4B • Updated • 16
CL-From-Nothing/Qwen3-4B-OPD-math-hard-509-step60
4B • Updated • 16
CL-From-Nothing/Qwen3-4B-OPD-math-hard-509-step45
4B • Updated • 16
CL-From-Nothing/Qwen3-1.7B-TokenReward-Minesweeper-MixedSFT-Thinking-epoch3
2B • Updated • 19
CL-From-Nothing/Qwen3-1.7B-GRPO-Minesweeper-MixedSFT-Thinking-epoch3
2B • Updated • 56
CL-From-Nothing/Qwen3-1.7B-TokenReward-Survo-DedupRL
2B • Updated • 17
datasets 102
CL-From-Nothing/RLVE-Eval20-Qwen3-4B-SSD-N20-SFT-Train
Viewer • Updated • 16k • 36
CL-From-Nothing/RLVE-Eval20-Qwen3-1.7B-SSD-N20-SFT-Train
Viewer • Updated • 16k • 49
CL-From-Nothing/rlve-eval20-qwen3-4b-n4-randcut512-4096x20-completed-by-qwen3-4b-thinking-r16384
Viewer • Updated • 64k • 34
CL-From-Nothing/rlve_teacher
Viewer • Updated • 32k • 60
CL-From-Nothing/RLVE-Multi-Task-Teacher
Preview • Updated • 64
CL-From-Nothing/RLVE-Eval
Viewer • Updated • 156 • 38
CL-From-Nothing/rlve-multitask-qwen3-4b-n4-randcut512-4096x20-completed-by-qwen3-4b-thinking-r16384
Viewer • Updated • 42.8k • 19
CL-From-Nothing/rlve-multitask-qwen3-4b-rollouts-n4-tokens16384
Viewer • Updated • 3.2k • 22
CL-From-Nothing/rlve-teacher-completion-qwen3-4b-thinking
Viewer • Updated • 3k • 210
CL-From-Nothing/FrozenLake-Hard-Trajectories
Viewer • Updated • 8k • 51