Commit
·
40d957d
1
Parent(s):
210a373
Update README.md
Browse files
README.md
CHANGED
@@ -40,8 +40,8 @@ the binay token classification task of predicting binding sites (and active site
|
|
40 |
The model may need more training, however it still achieves better performance on the test set in terms of loss, accuracy,
|
41 |
precision, recall, F1 score, ROC_AUC, and Matthews Correlation Coefficient (MCC) compared to the models trained on the smaller
|
42 |
dataset [found here](https://huggingface.co/datasets/AmelieSchreiber/binding_sites_random_split_by_family) of ~209K protein sequences. Note,
|
43 |
-
this model has a high recall, meaning it is likely to detect binding sites, but it has a
|
44 |
-
false positives as well.
|
45 |
|
46 |
## Training procedure
|
47 |
|
|
|
40 |
The model may need more training, however it still achieves better performance on the test set in terms of loss, accuracy,
|
41 |
precision, recall, F1 score, ROC_AUC, and Matthews Correlation Coefficient (MCC) compared to the models trained on the smaller
|
42 |
dataset [found here](https://huggingface.co/datasets/AmelieSchreiber/binding_sites_random_split_by_family) of ~209K protein sequences. Note,
|
43 |
+
this model has a high recall, meaning it is likely to detect binding sites, but it has a precision score that is somewhat lower than the SOTA
|
44 |
+
structural models mentioned above, meaning the model may return some false positives as well.
|
45 |
|
46 |
## Training procedure
|
47 |
|