Discussion about this post

User's avatar
Martin's avatar

Do you plan on posting any PR AUC or precision metrics? AU ROC is great but when we have millions of samples with 1000 true binders the model can predict all as non-binders and have 100% accuracy. Seeing how Hermes does on your data where the metric is PR AUC or precision would tell a much better story, in my opinion. Also, any thoughts on using datasets with a higher ligand to protein ratio? Your private set has an average of 212 ligands per target (15,030/71). Some datasets have 50k+ ligands and only 200 true binders. The numbers seem slightly inflated here.

Expand full comment
1 more comment...

No posts