AI & ML interests
NLP, Information Retrieval, Computer Vision, Uncertainty Estimation, Trustworthy AI, Bias Estimation, Unbalanced ML, Choice Modeling, Time Series
Recent Activity
View all activity
Papers
View all Papers
Suite of Encoder models EuroBERT
-
EuroBERT/EuroBERT-210m
Fill-Mask • 0.3B • Updated • 4.97k • 78 -
EuroBERT/EuroBERT-610m
Fill-Mask • 0.8B • Updated • 1.43k • 32 -
EuroBERT/EuroBERT-2.1B
Fill-Mask • 2B • Updated • 278 • 64 -
EuroBERT: Scaling Multilingual Encoders for European Languages
Paper • 2503.05500 • Published • 80
Related paper: "Is Preference Alignment Always the Best Option to Enhance LLM-Based Translation? An Empirical Analysis" (accepted at WMT 2024)
-
Is Preference Alignment Always the Best Option to Enhance LLM-Based Translation? An Empirical Analysis
Paper • 2409.20059 • Published • 16 -
hgissbkh/ALMA-13B-LoRA
Text Generation • 13B • Updated • 2 -
hgissbkh/ALMA-13B-LoRA-SFT-xCOMET-QE-Multi
Text Generation • 13B • Updated • 3 -
hgissbkh/ALMA-13B-LoRA-SFT-xCOMET-QE-Multi-No-Base
Text Generation • 13B • Updated • 2
Related paper: "Should We Still Pretrain Encoders with Masked Language Modeling?"
Suite of Encoder models EuroBERT
-
EuroBERT/EuroBERT-210m
Fill-Mask • 0.3B • Updated • 4.97k • 78 -
EuroBERT/EuroBERT-610m
Fill-Mask • 0.8B • Updated • 1.43k • 32 -
EuroBERT/EuroBERT-2.1B
Fill-Mask • 2B • Updated • 278 • 64 -
EuroBERT: Scaling Multilingual Encoders for European Languages
Paper • 2503.05500 • Published • 80
Related paper: "Towards Trustworthy Reranking: A Simple yet Effective Abstention Mechanism" (accepted at TMLR 2024)
Related paper: "Is Preference Alignment Always the Best Option to Enhance LLM-Based Translation? An Empirical Analysis" (accepted at WMT 2024)
-
Is Preference Alignment Always the Best Option to Enhance LLM-Based Translation? An Empirical Analysis
Paper • 2409.20059 • Published • 16 -
hgissbkh/ALMA-13B-LoRA
Text Generation • 13B • Updated • 2 -
hgissbkh/ALMA-13B-LoRA-SFT-xCOMET-QE-Multi
Text Generation • 13B • Updated • 3 -
hgissbkh/ALMA-13B-LoRA-SFT-xCOMET-QE-Multi-No-Base
Text Generation • 13B • Updated • 2