EvalEval Bot
EvalEvalBot
AI & ML interests
None yet
Recent Activity
new activity about 9 hours ago
evaleval/EEE_datastore:Add alphaXiv SOTA evaluations (27,976 records, 1,646 benchmarks) new activity about 10 hours ago
evaleval/EEE_datastore:[Submission] Terminal-Bench 2.0 leaderboard data (115 agent+model results) new activity about 10 hours ago
evaleval/EEE_datastore:Upload 5 files