AI & ML interests
Artificial General Intelligence
Recent Activity
View all activity
Papers
View all PapersLarge-Scale Visual Representation Model
-
DeepGlint-AI/mlcd-vit-bigG-patch14-448
Image Feature Extraction • 2B • Updated • 750 • 4 -
DeepGlint-AI/mlcd-vit-bigG-patch14-336
2B • Updated • 731 • 1 -
DeepGlint-AI/mlcd-vit-bigG-patch14-224
2B • Updated • 15 • 3 -
DeepGlint-AI/mlcd-vit-large-patch14-336
Feature Extraction • 0.3B • Updated • 140 • 10
UniME is a series of multimodal large language models trained for learning universal multimodal embedding.
-
DeepGlint-AI/UniME-Phi3.5-V-4.2B
Image-Text-to-Text • Updated • 285 • 7 -
DeepGlint-AI/UniME-LLaVA-1.6-7B
Image-Text-to-Text • 8B • Updated • 13 • 5 -
Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs
Paper • 2504.17432 • Published • 41 -
DeepGlint-AI/UniME-LLaVA-OneVision-7B
Image-Text-to-Text • 8B • Updated • 117 • 3
UniME is a series of multimodal large language models trained for learning universal multimodal embedding.
-
DeepGlint-AI/UniME-Phi3.5-V-4.2B
Image-Text-to-Text • Updated • 285 • 7 -
DeepGlint-AI/UniME-LLaVA-1.6-7B
Image-Text-to-Text • 8B • Updated • 13 • 5 -
Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs
Paper • 2504.17432 • Published • 41 -
DeepGlint-AI/UniME-LLaVA-OneVision-7B
Image-Text-to-Text • 8B • Updated • 117 • 3
Large-Scale Visual Representation Model
-
DeepGlint-AI/mlcd-vit-bigG-patch14-448
Image Feature Extraction • 2B • Updated • 750 • 4 -
DeepGlint-AI/mlcd-vit-bigG-patch14-336
2B • Updated • 731 • 1 -
DeepGlint-AI/mlcd-vit-bigG-patch14-224
2B • Updated • 15 • 3 -
DeepGlint-AI/mlcd-vit-large-patch14-336
Feature Extraction • 0.3B • Updated • 140 • 10