Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
11398.0
TFLOPS
10
9
17
Sumuk Shashidhar
PRO
sumuks
Follow
alozowski's profile picture
Aurelien-Morgan's profile picture
kramp's profile picture
35 followers
·
54 following
https://sumuk.org
sumukx
sumukshashidhar
sumuks
AI & ML interests
Evaluations, Reasoning, Long Term Planning
Recent Activity
updated
a dataset
about 16 hours ago
sumuks/openai-coval-dpo-standard-reward
published
a dataset
about 16 hours ago
sumuks/openai-coval-dpo-standard-reward
updated
a dataset
about 16 hours ago
sumuks/helpsteer3-standard-reward
View all activity
Organizations
sumuks
's datasets
35
Sort: Recently updated
sumuks/helpsteer3-standard-reward
Updated
about 14 hours ago
sumuks/openai-coval-dpo-standard-reward
Updated
about 15 hours ago
sumuks/openai-coval-dpo-response-aware-rubric-rewards
Updated
about 16 hours ago
sumuks/helpsteer3-response-aware-rubric-rewards
Updated
about 16 hours ago
sumuks/Litbench-Verified-with-rewards-hard-only
Viewer
•
Updated
1 day ago
•
7.9k
•
6
sumuks/Litbench-Verified-with-rewards
Viewer
•
Updated
1 day ago
•
20k
•
16
sumuks/helpsteer3
Viewer
•
Updated
4 days ago
•
37.9k
•
182
sumuks/openai-coval-dpo
Viewer
•
Updated
9 days ago
•
5.58k
•
128
sumuks/preference-atlas-rewards
Viewer
•
Updated
25 days ago
•
5.03k
•
33
sumuks/preference-atlas
Viewer
•
Updated
25 days ago
•
329k
•
108
•
1
sumuks/reward-bench-2
Viewer
•
Updated
26 days ago
•
1.87k
•
48
sumuks/helpsteer3-easy
Viewer
•
Updated
Feb 17
•
7.93k
•
29
sumuks/helpsteer-pairwise-grading
Viewer
•
Updated
Feb 12
•
51.8k
•
20
sumuks/rupo-eval-logs-helpsteer3-1
Viewer
•
Updated
Feb 10
•
1.43k
•
47
sumuks/helpsteer3-rupo
Viewer
•
Updated
Feb 10
•
38.2k
•
15
sumuks/persuasiveness_detection
Viewer
•
Updated
Feb 6
•
3.94k
•
9
sumuks/rupo-eval-humanlike-dpo-dataset-lbhr-2
Preview
•
Updated
Feb 6
•
18
sumuks/rupo-eval-humanlike-dpo-dataset-lrhb-2
Preview
•
Updated
Feb 6
•
18
sumuks/rupo-eval-humanlike-dpo-dataset-lrhb-1
Viewer
•
Updated
Feb 5
•
1k
•
12
sumuks/rupo-eval-humanlike-dpo-dataset-lbhr-1
Viewer
•
Updated
Feb 5
•
1k
•
11
sumuks/rupo-eval-humanlike-dpo-dataset-lrhb
Viewer
•
Updated
Feb 5
•
3
•
30
sumuks/rupo-eval-humanlike-dpo-dataset-lbhr
Viewer
•
Updated
Feb 5
•
142
•
211
sumuks/rupo-eval-logs-litbench-1
Preview
•
Updated
Feb 5
•
141
sumuks/rupo-eval-logs-lmarena-1
Viewer
•
Updated
Feb 5
•
775
•
57
sumuks/rupo-eval-logs-lmarena
Viewer
•
Updated
Feb 5
•
1.03k
•
43
sumuks/rupo-eval-logs-litbench
Viewer
•
Updated
Feb 5
•
75
•
12
sumuks/rupo-eval-logs
Viewer
•
Updated
Feb 5
•
100
•
10
sumuks/rupo-eval-logs-test
Viewer
•
Updated
Feb 5
•
1
•
9
sumuks/combined-synthetic-task
Viewer
•
Updated
Feb 5
•
10.6k
•
8
sumuks/persuasion
Viewer
•
Updated
Feb 1
•
14.1k
•
7
Previous
1
2
Next