GRPO/PPO Finetunes for Creative Writing
DV
AI & ML interests
Post training @ https://dphn.ai
Recent Activity
updated a dataset 1 day ago
NewEden/RL-Seed-Mix-Iter-4 published a dataset 1 day ago
NewEden/RL-Seed-Mix-Iter-4 updated a dataset 1 day ago
Delta-Vector/Tauri-RL-Styles-V2Organizations
models 112
Delta-Vector/Qwen-ckpt-100
Text Generation • Updated • 12
Delta-Vector/Rei-24B-KTO
Text Generation • 24B • Updated • 131 • 16
Delta-Vector/Dr-House-Evals
Updated
Delta-Vector/Austral-4.5B-Winton
Text Generation • 5B • Updated • 10 • 11
Delta-Vector/Nanuq-R1-9B
Text Generation • 11B • Updated • 5 • 4
Delta-Vector/Nanuq-R1-14B
Text Generation • 14B • Updated • 6 • 2
Delta-Vector/Austral-AFM-SFT
5B • Updated • 4
Delta-Vector/Elenchus
545k • Updated • 2
Delta-Vector/Austral-32B-GLM4-Winton
Text Generation • 33B • Updated • 6 • 8
Delta-Vector/Austral-GLM4-SFT
33B • Updated • 1
datasets 124
Delta-Vector/Tauri-RL-Styles-V2
Viewer • Updated • 128 • 1
Delta-Vector/CAI-critic-revision-8k-cleaned-sharegpt
Viewer • Updated • 8.1k • 17
Delta-Vector/Ursa-Armored-Core-6-Lore
Viewer • Updated • 166 • 20
Delta-Vector/wordlist
Viewer • Updated • 253 • 15
Delta-Vector/Tauri-RL-Styles
Viewer • Updated • 32 • 94
Delta-Vector/Hydrus-Olmo-3-sft-dedup-ngram-filter-r1
Viewer • Updated • 1.67M • 3
Delta-Vector/Ursa-Armored-Core-Lore-Kimi
Viewer • Updated • 286 • 6
Delta-Vector/Hydrus-Hardcode-Dphn
Viewer • Updated • 220 • 18
Delta-Vector/Hydrus-Smoltalk-3-Subset-Demarkdownified
Viewer • Updated • 92.1k • 8
Delta-Vector/Hydrus-Next-Coder-Single-turn
Viewer • Updated • 17.3k • 44