MergeBench

classroom

https://yifei-he.github.io/mergebench/

AI & ML interests

None defined yet.

Recent Activity

yifeihe3 authored a paper about 8 hours ago

Semi-Supervised Reward Modeling via Iterative Self-Training

yifeihe3 authored a paper about 8 hours ago

Towards Understanding the Fragility of Multilingual LLMs against Fine-Tuning Attacks

yifeihe3 authored a paper about 8 hours ago

MergeBench: A Benchmark for Merging Domain-Specialized LLMs

View all activity

authored 6 papers about 8 hours ago

Semi-Supervised Reward Modeling via Iterative Self-Training

Paper • 2409.06903 • Published Sep 10, 2024

Towards Understanding the Fragility of Multilingual LLMs against Fine-Tuning Attacks

Paper • 2410.18210 • Published Oct 23, 2024

MergeBench: A Benchmark for Merging Domain-Specialized LLMs

Paper • 2505.10833 • Published May 16, 2025 • 1

Scalable Data Synthesis for Computer Use Agents with Step-Level Filtering

Paper • 2512.10962 • Published Nov 22, 2025 • 3

Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic

Paper • 2408.13656 • Published Aug 24, 2024

Scaling Laws for Multilingual Language Models

Paper • 2410.12883 • Published Oct 15, 2024

updated 2 models about 22 hours ago

MergeBench/Llama-3.2-3B_instruction

Text Generation • 3B • Updated about 22 hours ago • 635

MergeBench/gemma-2-9b-it_math

Text Generation • 9B • Updated about 22 hours ago • 10

updated a model about 23 hours ago

MergeBench/gemma-2-9b_math

Text Generation • 9B • Updated about 23 hours ago • 15

updated a Space about 23 hours ago

README

published a Space about 23 hours ago

README

updated a model about 23 hours ago

MergeBench/gemma-2-9b-it_instruction

Text Generation • 9B • Updated about 23 hours ago • 13

updated a dataset 10 months ago

MergeBench/safety_val

Viewer • Updated May 15, 2025 • 1k • 68

published a dataset 10 months ago

MergeBench/safety_val

Viewer • Updated May 15, 2025 • 1k • 68

updated a dataset 10 months ago

MergeBench/multilingual_val

Viewer • Updated May 15, 2025 • 1k • 43

published a dataset 10 months ago

MergeBench/multilingual_val

Viewer • Updated May 15, 2025 • 1k • 43

updated a dataset 10 months ago

MergeBench/math_val

Viewer • Updated May 15, 2025 • 1k • 280

published a dataset 10 months ago

MergeBench/math_val

Viewer • Updated May 15, 2025 • 1k • 280

updated a dataset 10 months ago

MergeBench/coding_val

Viewer • Updated May 15, 2025 • 1k • 54

published a dataset 10 months ago

MergeBench/coding_val

Viewer • Updated May 15, 2025 • 1k • 54