Artefact

company

AI & ML interests

NLP, Information Retrieval, Computer Vision, Uncertainty Estimation, Trustworthy AI, Bias Estimation, Unbalanced ML, Choice Modeling, Time Series

Recent Activity

CharlesMoslonka updated a collection 1 day ago

LEDGER

CharlesMoslonka published a dataset 1 day ago

artefactory/ledger-market-sentiment

CharlesMoslonka updated a collection 1 day ago

LEDGER

View all activity

Papers

BERT-as-a-Judge: A Robust Alternative to Lexical Methods for Efficient Reference-Based LLM Evaluation

Learned Hallucination Detection in Black-Box LLMs using Token-level Entropy Production Rate

View all Papers

artefactory 's collections 7

LEDGER

A Long-Context Benchmark of Corporate Annual Reports for Grounded Financial Retrieval and Extraction

artefactory/ledger-long-context-multi-kpi

Updated 1 day ago • 10
artefactory/ledger-long-context-KPI-QA

Updated 1 day ago • 10
artefactory/ledger-market-sentiment

Updated 1 day ago • 10

Artefactual: LLM Hallucination detection

Learned Hallucination Detection in Black-Box LLMs using Token-level Entropy Production Rate

Paper • 2509.04492 • Published Sep 1, 2025 • 10

EuroBERT

Suite of Encoder models EuroBERT

EuroBERT: Scaling Multilingual Encoders for European Languages

Paper • 2503.05500 • Published Mar 7, 2025 • 81
EuroBERT/EuroBERT-210m

Fill-Mask • 0.3B • Updated Oct 18, 2025 • 9.49k • 84
EuroBERT/EuroBERT-610m

Fill-Mask • 0.8B • Updated Oct 18, 2025 • 4.03k • 33
EuroBERT/EuroBERT-2.1B

Fill-Mask • 2B • Updated Oct 18, 2025 • 992 • 67

Translation Alignment Analysis

Is Preference Alignment Always the Best Option to Enhance LLM-Based Translation? An Empirical Analysis

Paper • 2409.20059 • Published Sep 30, 2024 • 16
artefactory/ALMA-13B-LoRA

Text Generation • 13B • Updated Jun 15, 2024 • 7
artefactory/ALMA-13B-LoRA-SFT-xCOMET-QE-Multi

Text Generation • 13B • Updated Jun 22, 2024 • 11
artefactory/ALMA-13B-LoRA-SFT-xCOMET-QE-Multi-No-Base

Text Generation • 13B • Updated Jul 25, 2024 • 10

BERT-as-a-Judge

BERT-as-a-Judge: A Robust Alternative to Lexical Methods for Efficient Reference-Based LLM Evaluation

Paper • 2604.09497 • Published Apr 10 • 29
artefactory/BERTJudge

Text Classification • 0.2B • Updated Apr 16 • 23 • 7
artefactory/BERTJudge-Free-CR

Text Classification • 0.2B • Updated Apr 16 • 3 • 1
artefactory/BERTJudge-Formatted-QCR

Text Classification • 0.2B • Updated Apr 16 • 8 • 1

MLM versus CLM for NLP tasks

Should We Still Pretrain Encoders with Masked Language Modeling?

Paper • 2507.00994 • Published Jul 1, 2025 • 81
MLMvsCLM/610m-mlm30-42k

Feature Extraction • Updated Jul 4, 2025 • 35
MLMvsCLM/610m-mlm40-42k-2000

Feature Extraction • Updated Jul 4, 2025 • 34
MLMvsCLM/610m-clm-17k-mlm40-22k

Feature Extraction • Updated Jul 4, 2025 • 34

Abstention Reranking

Towards Trustworthy Reranking: A Simple yet Effective Abstention Mechanism

Paper • 2402.12997 • Published Feb 20, 2024 • 9
artefactory/abstention-reranking-benchmark

Viewer • Updated Oct 2, 2024 • 132 • 143
MMTEB: Massive Multilingual Text Embedding Benchmark

Paper • 2502.13595 • Published Feb 19, 2025 • 49

LEDGER

A Long-Context Benchmark of Corporate Annual Reports for Grounded Financial Retrieval and Extraction

artefactory/ledger-long-context-multi-kpi

Updated 1 day ago • 10
artefactory/ledger-long-context-KPI-QA

Updated 1 day ago • 10
artefactory/ledger-market-sentiment

Updated 1 day ago • 10

BERT-as-a-Judge

BERT-as-a-Judge: A Robust Alternative to Lexical Methods for Efficient Reference-Based LLM Evaluation

Paper • 2604.09497 • Published Apr 10 • 29
artefactory/BERTJudge

Text Classification • 0.2B • Updated Apr 16 • 23 • 7
artefactory/BERTJudge-Free-CR

Text Classification • 0.2B • Updated Apr 16 • 3 • 1
artefactory/BERTJudge-Formatted-QCR

Text Classification • 0.2B • Updated Apr 16 • 8 • 1

Artefactual: LLM Hallucination detection

Learned Hallucination Detection in Black-Box LLMs using Token-level Entropy Production Rate

Paper • 2509.04492 • Published Sep 1, 2025 • 10

MLM versus CLM for NLP tasks

Should We Still Pretrain Encoders with Masked Language Modeling?

Paper • 2507.00994 • Published Jul 1, 2025 • 81
MLMvsCLM/610m-mlm30-42k

Feature Extraction • Updated Jul 4, 2025 • 35
MLMvsCLM/610m-mlm40-42k-2000

Feature Extraction • Updated Jul 4, 2025 • 34
MLMvsCLM/610m-clm-17k-mlm40-22k

Feature Extraction • Updated Jul 4, 2025 • 34

EuroBERT

Suite of Encoder models EuroBERT

EuroBERT: Scaling Multilingual Encoders for European Languages

Paper • 2503.05500 • Published Mar 7, 2025 • 81
EuroBERT/EuroBERT-210m

Fill-Mask • 0.3B • Updated Oct 18, 2025 • 9.49k • 84
EuroBERT/EuroBERT-610m

Fill-Mask • 0.8B • Updated Oct 18, 2025 • 4.03k • 33
EuroBERT/EuroBERT-2.1B

Fill-Mask • 2B • Updated Oct 18, 2025 • 992 • 67

Abstention Reranking

Towards Trustworthy Reranking: A Simple yet Effective Abstention Mechanism

Paper • 2402.12997 • Published Feb 20, 2024 • 9
artefactory/abstention-reranking-benchmark

Viewer • Updated Oct 2, 2024 • 132 • 143
MMTEB: Massive Multilingual Text Embedding Benchmark

Paper • 2502.13595 • Published Feb 19, 2025 • 49

Translation Alignment Analysis

Is Preference Alignment Always the Best Option to Enhance LLM-Based Translation? An Empirical Analysis

Paper • 2409.20059 • Published Sep 30, 2024 • 16
artefactory/ALMA-13B-LoRA

Text Generation • 13B • Updated Jun 15, 2024 • 7
artefactory/ALMA-13B-LoRA-SFT-xCOMET-QE-Multi

Text Generation • 13B • Updated Jun 22, 2024 • 11
artefactory/ALMA-13B-LoRA-SFT-xCOMET-QE-Multi-No-Base

Text Generation • 13B • Updated Jul 25, 2024 • 10

AI & ML interests

Recent Activity

Papers

Team members 5

artefactory 's collections 7