Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
SOTA Models
🏆 SOTA Models
Specific
state of the art, frontier models, benchmark, model release
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
270
posts in
5.6
ms
MC-PDD: Masked Corpus-Level
Pretraining
Data Detection for Black-Box
Large
Language
Models
🧠
LLMs
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for MC-PDD: Masked Corpus-Level Pretraining Data Detection for Black-Box Large Language Models
Finetuning masking challenges narrow-task evaluation of cell foundation
models
🧠
LLMs
Content type:
Academic
biorxiv.org
·
3d
3 days ago
Actions for Finetuning masking challenges narrow-task evaluation of cell foundation models
Less-relevant results
If Claude Fable stops helping you, you’ll never know
🎛️
Fine-tuning
simonwillison.net
·
12h
12 hours ago
·
Hacker News
Actions for If Claude Fable stops helping you, you’ll never know
The AI
models
finding 10,000 vulnerabilities are the same ones China is trying to copy. That is the problem.
🌐
Open Source AI
Content type:
News
thenextweb.com
·
2d
2 days ago
Actions for The AI models finding 10,000 vulnerabilities are the same ones China is trying to copy. That is the problem.
Task-Seeded Synthetic Q&A Generation for Nemotron
Pretraining
🌐
Open Source AI
Content type:
Blog
huggingface.co
·
6d
6 days ago
Actions for Task-Seeded Synthetic Q&A Generation for Nemotron Pretraining
A generalist biomedical
vision-language
model
via multi-CLIP knowledge distillation
🧠
LLMs
Content type:
Academic
nature.com
·
13h
13 hours ago
Actions for A generalist biomedical vision-language model via multi-CLIP knowledge distillation
Train
Models
Faster with JAX and MaxText Using NVFP4 on NVIDIA Blackwell
⚡
Inference
Content type:
News
Content type:
Blog
developer.nvidia.com
·
1d
1 day ago
Actions for Train Models Faster with JAX and MaxText Using NVFP4 on NVIDIA Blackwell
New comment by bkjlblh in "Claude Fable 5"
🎛️
Fine-tuning
Content type:
Discussion
news.ycombinator.com
·
19h
19 hours ago
·
Hacker News
Actions for New comment by bkjlblh in "Claude Fable 5"
Tracing Eval-Awareness Emergence Through Training of OLMo 3
🎛️
Fine-tuning
lesswrong.com
·
2h
2 hours ago
Actions for Tracing Eval-Awareness Emergence Through Training of OLMo 3
bigattichouse/packed-twin-inference
: PTI achieves ~2× throughput using a single quantized
model
(Q5_K_M or better) by running 4 generation streams in one batched decode call. The GPU loads
model
weights
once per step and produces 4 predictions simultaneously. KV cache overhead is ~0.8 GiB total for all 4 streams. No draft
model
. No quality loss
⚡
Inference
Content type:
Code
github.com
·
1d
1 day ago
·
r/LocalLLaMA
Actions for bigattichouse/packed-twin-inference: PTI achieves ~2× throughput using a single quantized model (Q5_K_M or better) by running 4 generation streams in one batched decode call. The GPU loads model weights once per step and produces 4 predictions simultaneously. KV cache overhead is ~0.8 GiB total for all 4 streams. No draft model. No quality loss
On-device AI agents hit a hard memory limit. Apple's new architecture routes around it.
🤖
AI Agents
venturebeat.com
·
19h
19 hours ago
Actions for On-device AI agents hit a hard memory limit. Apple's new architecture routes around it.
SAR updates its first homegrown AI
model
- Azərtac
🌐
Open Source AI
azertag.az
·
6d
6 days ago
Actions for SAR updates its first homegrown AI model - Azərtac
BRAND ANALYSIS: I IS FOR INSTAGRAM
⚡
Inference
Content type:
Blog
nemesisglobal.substack.com
·
6d
6 days ago
·
Substack
Actions for BRAND ANALYSIS: I IS FOR INSTAGRAM
To discover new physics, AI may need to 'unlearn' the old one
🤖
AI Agents
phys.org
·
1d
1 day ago
Actions for To discover new physics, AI may need to 'unlearn' the old one
Nvidia Nemotron 3 Ultra
🎛️
Fine-tuning
research.nvidia.com
·
6d
6 days ago
·
Hacker News
Actions for Nvidia Nemotron 3 Ultra
Research Proposal: Decoupled
RISC-LLM
Architectures via Circadian Synaptic Consolidation
🎛️
Fine-tuning
aermia.com
·
3d
3 days ago
·
Hacker News
Actions for Research Proposal: Decoupled RISC-LLM Architectures via Circadian Synaptic Consolidation
Mix, Don't Pick: Why Synthetic Corpus Composition Matters for Time Series Foundation
Model
Pretraining
🧠
LLMs
Content type:
Academic
arxiv.org
·
9h
9 hours ago
Actions for Mix, Don't Pick: Why Synthetic Corpus Composition Matters for Time Series Foundation Model Pretraining
Coverage-driven alignment - What ‘Teaching Claude Why’ can borrow from AV verification
🎼
Agent Orchestration
lesswrong.com
·
2d
2 days ago
Actions for Coverage-driven alignment - What ‘Teaching Claude Why’ can borrow from AV verification
Sorry, not sorry (Ideogram jailbroken in 1 easy step)
📐
Context Engineering
gist.github.com
·
6d
6 days ago
·
r/StableDiffusion
Actions for Sorry, not sorry (Ideogram jailbroken in 1 easy step)
OpenAI responds to White House executive order on AI governance
🌐
Open Source AI
csoonline.com
·
5d
5 days ago
Actions for OpenAI responds to White House executive order on AI governance
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help