Mask and You Shall Receive: Optimizing Masked Language Modeling For Pretraining BabyLMs Paper • 2510.20475 • Published Oct 23, 2025 • 1
EXECUTE: A Multilingual Benchmark for LLM Token Understanding Paper • 2505.17784 • Published May 23, 2025
Subword-Delimited Downsampling for Better Character-Level Translation Paper • 2212.01304 • Published Dec 2, 2022
EAGER: Entropy-Aware GEneRation for Adaptive Inference-Time Scaling Paper • 2510.11170 • Published Oct 13, 2025 • 1
Steering Large Language Models for Machine Translation Personalization Paper • 2505.16612 • Published May 22, 2025 • 6
QE4PE: Word-level Quality Estimation for Human Post-Editing Paper • 2503.03044 • Published Mar 4, 2025 • 6
Cross-Lingual Consistency of Factual Knowledge in Multilingual Language Models Paper • 2310.10378 • Published Oct 16, 2023 • 1
Can Model Uncertainty Function as a Proxy for Multiple-Choice Question Item Difficulty? Paper • 2407.05327 • Published Jul 7, 2024