ROC Curves

Model Evaluation, Classification, True Positive Rate, Threshold Selection

Feeds to Scour
SubscribedAll
Scoured 40 posts in 10.0 ms

Harmfulness Directions in OLMo

 🤖Scikit-learn
lesswrong.com·

Evaluation Metrics for Regression and Classification Models

 📐Gini Coefficient  Content type: Blog
medium.com·

CaliPPer: quantifying, predicting and improving AI model performance for binding prediction

 🤖Scikit-learn  Content type: Academic
arxiv.org·

Handshake: Partner-Specific Protein-Protein Binding Site Prediction at Scale Using ProstT5 and Cross-Chain Attention

 🤖Scikit-learn  Content type: Academic
biorxiv.org·

Keeping a healthy degree of AI skepticism: Knowing the metrics that matter and asking the right questions

 📐Gini Coefficient
aasm.org·

Release v2.0.0 · leochlon/hallbayes

 ❄️NixOS  Content type: Code
github.com·

Applying the CIPHER Framework to AI Data and Annotation Pipelines in Healthcare

 🌪️Chaos Engineering  Content type: Blog
medium.com·

openpilot 0.11.1

 🐛Fuzz Testing  Content type: Blog
blog.comma.ai·

Weekly reads 1/06/26

 🎲Bootstrap Methods  Content type: News  Content type: Blog

SpliceBind: Isoform-Aware Prediction of Binding Pocket Druggability

 🤖Scikit-learn  Content type: Academic
arxiv.org·

Learning residue-level context for modeling protein-protein interactions

 🤖Scikit-learn  Content type: Academic
biorxiv.org·

BEACON: Behavioral Entropy Aggregation for Cross-Model Hallucination Detection in Large Language Models

 🎲Property-Based Testing  Content type: Academic
arxiv.org·

Logits as a new monitor for evaluation awareness

 📐Gini Coefficient
lesswrong.com··Hacker News

Predicting P-glycoprotein Substrate Status Using a Pretrained Graph Neural Network: A TDC Benchmark Study

 🤖Scikit-learn  Content type: Academic
biorxiv.org·

Routine laboratory trajectories encode the onset of organ-level complications in cancer

 📊Survival Analysis  Content type: Academic
arxiv.org·

Trait-space Monitoring for Emergent Misalignment During Supervised Finetuning

 🎲Property-Based Testing  Content type: Academic
arxiv.org·

Training Deliberative Monitors for Black-Box Scheming Detection

 🎲Property-Based Testing
lesswrong.com·

GenEyePose: Patient-Free, Knowledge-Based Saccadic Eye Movement Modeling for Digital Neurophysiologic Biomarker Development

 🤖Scikit-learn  Content type: Academic
arxiv.org·

Wearable Single-Lead ECG Detects Fine-Grained Structural Heart Disease Through Echo-Report Supervision

 🎛statistical process control  Content type: Academic
arxiv.org·

RadOT-Eval: Auditable Structured-Evidence Transport for Radiology Report Evaluation

 📊Survival Analysis  Content type: Academic
arxiv.org·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help