Cross-Encoder for Quora Duplicate Questions Detection

This model was trained using SentenceTransformers Cross-Encoder class.

Training Data

This model was trained on the Quora Duplicate Questions dataset. The model will predict a score between 0 and 1 how likely the two given questions are duplicates.

Note: The model is not suitable to estimate the similarity of questions, e.g. the two questions "How to learn Java" and "How to learn Python" will result in a rather low score, as these are not duplicates.

Usage and Performance

Pre-trained models can be used like this:

from sentence_transformers import CrossEncoder

model = CrossEncoder('cross-encoder/quora-roberta-large')
scores = model.predict([('Question 1', 'Question 2'), ('Question 3', 'Question 4')])

You can use this model also without sentence_transformers and by just using Transformers AutoModel class

Downloads last month: 465

Safetensors

Model size

0.4B params

Tensor type

I64

F32

Inference Providers NEW

Text Ranking

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for cross-encoder/quora-roberta-large

Base model

FacebookAI/roberta-large

Quantized

(2)

this model

Finetunes

1 model

cross-encoder
/

quora-roberta-large

Cross-Encoder for Quora Duplicate Questions Detection

Training Data

Usage and Performance

Model tree for cross-encoder/quora-roberta-large

Dataset used to train cross-encoder/quora-roberta-large