impresso-project/wiki_comparable_corpus_en_de_hi_it_ko_zh Viewer • Updated about 21 hours ago • 69.2k • 12
impresso-project/ner-stacked-bert-multilingual-v1.1.0 Token Classification • 42.1M • Updated 2 days ago • 3.43k • 2
Running Multilingual Named Entity Recognition 👻 Multilingual Named Entity Recognition in Historical Data
impresso-project/halloween_workshop_ocr_robust_preview Sentence Similarity • 0.3B • Updated 4 days ago • 65
impresso-project/halloween_workshop_ocr_robust_with_lux_preview Sentence Similarity • 0.3B • Updated 4 days ago • 159
impresso-project/OCR-robust-gte-multilingual-base Sentence Similarity • 0.3B • Updated Oct 23, 2025 • 110
impresso-project/halloween_workshop_ocr_robust_with_lux_preview Sentence Similarity • 0.3B • Updated 4 days ago • 159
impresso-project/halloween_workshop_ocr_robust_preview Sentence Similarity • 0.3B • Updated 4 days ago • 65
impresso-project/histlux-paraphrase-multilingual-mpnet-base-v2 Sentence Similarity • 0.3B • Updated Jul 20, 2025 • 4
impresso-project/histlux-gte-multilingual-base Sentence Similarity • 0.3B • Updated Jul 20, 2025 • 11
impresso-project/OCR-robust-gte-multilingual-base Sentence Similarity • 0.3B • Updated Oct 23, 2025 • 110
impresso-project/histlux-gte-multilingual-base Sentence Similarity • 0.3B • Updated Jul 20, 2025 • 11
impresso-project/histlux-paraphrase-multilingual-mpnet-base-v2 Sentence Similarity • 0.3B • Updated Jul 20, 2025 • 4
MMTEB: Massive Multilingual Text Embedding Benchmark Paper • 2502.13595 • Published Feb 19, 2025 • 44
PARAPHRASUS : A Comprehensive Benchmark for Evaluating Paraphrase Detection Models Paper • 2409.12060 • Published Sep 18, 2024