A lightweight explicit alignment recipe that adapts off-the-shelf VLMs into robust omni-modal embedding models. https://arxiv.org/abs/2601.03666
Haonan Chen PRO
Haon-Chen
AI & ML interests
None yet
Recent Activity
authored
a paper
1 day ago
e5-omni: Explicit Cross-modal Alignment for Omni-modal Embeddings
updated
a model
1 day ago
Haon-Chen/e5-omni-7B
updated
a model
1 day ago
Haon-Chen/e5-omni-3B
Organizations
Vidore-v2-full
SPEED
Aligned embedding data synthesis models and embedding model. Our paper: https://arxiv.org/pdf/2410.18634
MoCa
HomePage: https://haon-chen.github.io/MoCa/
mmE5
mmE5: Improving Multimodal Multilingual Embeddings via High-quality Synthetic Data
-
intfloat/mmE5-mllama-11b-instruct
Zero-Shot Image Classification • 11B • Updated • 210 • 19 -
mmE5: Improving Multimodal Multilingual Embeddings via High-quality Synthetic Data
Paper • 2502.08468 • Published • 15 -
intfloat/mmE5-synthetic
Viewer • Updated • 560k • 573 • 5 -
intfloat/mmE5-MMEB-hardneg
Viewer • Updated • 1.47M • 424 • 1
e5-omni
A lightweight explicit alignment recipe that adapts off-the-shelf VLMs into robust omni-modal embedding models. https://arxiv.org/abs/2601.03666
MoCa
HomePage: https://haon-chen.github.io/MoCa/
Vidore-v2-full
mmE5
mmE5: Improving Multimodal Multilingual Embeddings via High-quality Synthetic Data
-
intfloat/mmE5-mllama-11b-instruct
Zero-Shot Image Classification • 11B • Updated • 210 • 19 -
mmE5: Improving Multimodal Multilingual Embeddings via High-quality Synthetic Data
Paper • 2502.08468 • Published • 15 -
intfloat/mmE5-synthetic
Viewer • Updated • 560k • 573 • 5 -
intfloat/mmE5-MMEB-hardneg
Viewer • Updated • 1.47M • 424 • 1
SPEED
Aligned embedding data synthesis models and embedding model. Our paper: https://arxiv.org/pdf/2410.18634