Anthropogenic Regional Adaptation in Multimodal Vision-Language Model Paper β’ 2604.11490 β’ Published 23 days ago β’ 15
Apertus LLM Collection Democratizing Open and Compliant LLMs for Global Language Environments: 8B and 70B open-data open-weights models, multilingual in >1000 languages β’ 4 items β’ Updated Oct 1, 2025 β’ 349
Datasheets Aren't Enough: DataRubrics for Automated Quality Metrics and Accountability Paper β’ 2506.01789 β’ Published Jun 2, 2025 β’ 15
ReasonIR: Training Retrievers for Reasoning Tasks Paper β’ 2504.20595 β’ Published Apr 29, 2025 β’ 54
Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia Paper β’ 2503.07920 β’ Published Mar 10, 2025 β’ 101
Multilingual Large Language Models Are Not (Yet) Code-Switchers Paper β’ 2305.14235 β’ Published May 23, 2023
LayoutParser: A Unified Toolkit for Deep Learning Based Document Image Analysis Paper β’ 2103.15348 β’ Published Mar 29, 2021
CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark Paper β’ 2406.05967 β’ Published Jun 10, 2024 β’ 6
SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages Paper β’ 2406.10118 β’ Published Jun 14, 2024 β’ 32
MINERS: Multilingual Language Models as Semantic Retrievers Paper β’ 2406.07424 β’ Published Jun 11, 2024
Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia Paper β’ 2503.07920 β’ Published Mar 10, 2025 β’ 101
The Same But Different: Structural Similarities and Differences in Multilingual Language Modeling Paper β’ 2410.09223 β’ Published Oct 11, 2024 β’ 5
The Same But Different: Structural Similarities and Differences in Multilingual Language Modeling Paper β’ 2410.09223 β’ Published Oct 11, 2024 β’ 5
The Same But Different: Structural Similarities and Differences in Multilingual Language Modeling Paper β’ 2410.09223 β’ Published Oct 11, 2024 β’ 5 β’ 2