nreimers
commited on
Commit
·
4c9aea3
1
Parent(s):
bf98750
readme
Browse files
README.md
ADDED
|
@@ -0,0 +1,18 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
# Cross-Encoder for Quora Duplicate Questions Detection
|
| 2 |
+
This model was trained using [SentenceTransformers](https://sbert.net) [Cross-Encoder](https://www.sbert.net/examples/applications/cross-encoder/README.html) class.
|
| 3 |
+
|
| 4 |
+
## Training Data
|
| 5 |
+
This model was trained on the [Quora Duplicate Questions](https://www.quora.com/q/quoradata/First-Quora-Dataset-Release-Question-Pairs) dataset. The model will predict a score between 0 and 1 how likely the two given questions are duplicates.
|
| 6 |
+
|
| 7 |
+
Note: The model is not suitable to estimate the similarity of questions, e.g. the two questions "How to learn Java" and "How to learn Python" will result in a rahter low score, as these are not duplicates.
|
| 8 |
+
|
| 9 |
+
## Usage and Performance
|
| 10 |
+
|
| 11 |
+
Pre-trained models can be used like this:
|
| 12 |
+
```
|
| 13 |
+
from sentence_transformers import CrossEncoder
|
| 14 |
+
model = CrossEncoder('model_name')
|
| 15 |
+
scores = model.predict([('Question 1', 'Question 2'), ('Question 3', 'Question 4')])
|
| 16 |
+
```
|
| 17 |
+
|
| 18 |
+
You can use this model also without sentence_transformers and by just using Transformers ``AutoModel`` class
|