Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Model Evaluation
📊 Model Evaluation
Benchmarking, Performance Metrics, Testing, Validation
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
93
posts in
6.0
ms
Built and launched a research-reading and highlighting tool with Claude over a few months. Here are the things AI was surprisingly good (and bad) at.
🤖
AI
highlyt.app
·
2d
2 days ago
·
r/ClaudeAI
Actions for Built and launched a research-reading and highlighting tool with Claude over a few months. Here are the things AI was surprisingly good (and bad) at.
Welcome to
Machine
Learning
With Manya: The Ultimate Adventure Map!
🤖
AI
Content type:
Blog
medium.com
·
3d
3 days ago
Actions for Welcome to Machine Learning With Manya: The Ultimate Adventure Map!
A Controlled Study of Decoding-Time Truthfulness Methods on Instruction-Tuned LLMs
✍️
Prompt Engineering
Content type:
Academic
arxiv.org
·
16h
16 hours ago
Actions for A Controlled Study of Decoding-Time Truthfulness Methods on Instruction-Tuned LLMs
Hybrid vision transformer and ensemble
machine
learning
framework for automated atherosclerotic plaque classification in intravascular ultrasound imaging
🤖
AI
Content type:
Academic
nature.com
·
1d
1 day ago
Actions for Hybrid vision transformer and ensemble machine learning framework for automated atherosclerotic plaque classification in intravascular ultrasound imaging
Applying the CIPHER Framework to AI Data and Annotation Pipelines in Healthcare
⚖️
AI Governance
Content type:
Blog
medium.com
·
3d
3 days ago
Actions for Applying the CIPHER Framework to AI Data and Annotation Pipelines in Healthcare
Expert-Guided Supervised Annotation of Erythroid Differentiation in Single-Cell RNA-seq
🎛️
Fine-Tuning
Content type:
Academic
biorxiv.org
·
4d
4 days ago
Actions for Expert-Guided Supervised Annotation of Erythroid Differentiation in Single-Cell RNA-seq
Why Shrinking an AI
Model
Often Makes It More Useful
🤖
AI
siliconopera.com
·
4d
4 days ago
Actions for Why Shrinking an AI Model Often Makes It More Useful
DeMix: Debugging Training Data with Mixed Data Error Types by Investigating Influence Vectors
🎛️
Fine-Tuning
Content type:
Academic
arxiv.org
·
16h
16 hours ago
Actions for DeMix: Debugging Training Data with Mixed Data Error Types by Investigating Influence Vectors
🧾 Weekly Wrap Sheet (06/05/2026): Prospectuses & Platforms
🔬
Hallucination Detection
Content type:
News
Content type:
Blog
saanyaojha.substack.com
·
4d
4 days ago
·
Substack
Actions for 🧾 Weekly Wrap Sheet (06/05/2026): Prospectuses & Platforms
Generalizable self-supervised
learning
for imaging flow cytometry on multi-dataset leukocyte differential
🤖
AI
Content type:
Academic
nature.com
·
1d
1 day ago
Actions for Generalizable self-supervised learning for imaging flow cytometry on multi-dataset leukocyte differential
On the Study of Biometric Spoofing Detection using
Deep
Learning
🔬
Hallucination Detection
Content type:
Academic
arxiv.org
·
16h
16 hours ago
Actions for On the Study of Biometric Spoofing Detection using Deep Learning
When is Your
LLM
Steerable?
🛡
LLM safety
Content type:
Academic
arxiv.org
·
16h
16 hours ago
Actions for When is Your LLM Steerable?
A Reproducible and Extensible
Benchmark
of Supervised Cell Type Annotation Tools for Cytometry Data
🎛️
Fine-Tuning
Content type:
Academic
biorxiv.org
·
6d
6 days ago
Actions for A Reproducible and Extensible Benchmark of Supervised Cell Type Annotation Tools for Cytometry Data
When
Metrics
Disagree: A Meta-Analysis of Knowledge-Graph-Completion
Model
Benchmarking
🧬
Embeddings
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for When Metrics Disagree: A Meta-Analysis of Knowledge-Graph-Completion Model Benchmarking
When Does Delegation Beat Majority? A Delegation-Based Aggregator for Multi-Sample
LLM
Inference
✍️
Prompt Engineering
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for When Does Delegation Beat Majority? A Delegation-Based Aggregator for Multi-Sample LLM Inference
Multilingual Refusal Alignment for Safer
Large
Language
Models
🎯
AI Alignment
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Multilingual Refusal Alignment for Safer Large Language Models
Balancing Real and Synthetic Data for CNN-based Masonry Crack Detection
🔬
Hallucination Detection
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Balancing Real and Synthetic Data for CNN-based Masonry Crack Detection
LSTM based IoT Device Identification
💭
Context Management
Content type:
Academic
arxiv.org
·
16h
16 hours ago
Actions for LSTM based IoT Device Identification
Cross Paraphrastic Invariance
Learning
for Hallucination Detection
🔬
Hallucination Detection
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Cross Paraphrastic Invariance Learning for Hallucination Detection
Motion Reinforces Appearance: RGB-Skeleton Gated Residual Fusion for Micro-Gesture Online Recognition
🔬
Hallucination Detection
Content type:
Academic
arxiv.org
·
16h
16 hours ago
Actions for Motion Reinforces Appearance: RGB-Skeleton Gated Residual Fusion for Micro-Gesture Online Recognition
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help