Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
SOTA Models
🏆 SOTA Models
Specific
state of the art, frontier models, benchmark, model release
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
279
posts in
4.9
ms
MC-PDD: Masked Corpus-Level
Pretraining
Data Detection for Black-Box
Large
Language
Models
🧠
LLMs
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for MC-PDD: Masked Corpus-Level Pretraining Data Detection for Black-Box Large Language Models
Finetuning masking challenges narrow-task evaluation of cell foundation
models
🧠
LLMs
Content type:
Academic
biorxiv.org
·
3d
3 days ago
Actions for Finetuning masking challenges narrow-task evaluation of cell foundation models
The
Inference
Alpha: Maximizing
Frontier
Models
on AMD
⚡
Inference
Content type:
Blog
digitalocean.com
·
5h
5 hours ago
Actions for The Inference Alpha: Maximizing Frontier Models on AMD
The AI
models
finding 10,000 vulnerabilities are the same ones China is trying to copy. That is the problem.
🌐
Open Source AI
Content type:
News
thenextweb.com
·
2d
2 days ago
Actions for The AI models finding 10,000 vulnerabilities are the same ones China is trying to copy. That is the problem.
Less-relevant results
Estimating No-CoT Task-Completion Time Horizons of
Frontier
AI
Models
✍️
Prompt Engineering
lesswrong.com
·
1h
1 hour ago
Actions for Estimating No-CoT Task-Completion Time Horizons of Frontier AI Models
Task-Seeded Synthetic Q&A Generation for Nemotron
Pretraining
🌐
Open Source AI
Content type:
Blog
huggingface.co
·
6d
6 days ago
Actions for Task-Seeded Synthetic Q&A Generation for Nemotron Pretraining
Train
Models
Faster with JAX and MaxText Using NVFP4 on NVIDIA Blackwell
⚡
Inference
Content type:
News
Content type:
Blog
developer.nvidia.com
·
2d
2 days ago
Actions for Train Models Faster with JAX and MaxText Using NVFP4 on NVIDIA Blackwell
A generalist biomedical
vision-language
model
via multi-CLIP knowledge distillation
🧠
LLMs
Content type:
Academic
nature.com
·
19h
19 hours ago
Actions for A generalist biomedical vision-language model via multi-CLIP knowledge distillation
Launch HN: General Instinct (YC P26) –
Frontier
models
on edge devices
🌐
Open Source AI
Content type:
Discussion
news.ycombinator.com
·
5d
5 days ago
·
Hacker News
Actions for Launch HN: General Instinct (YC P26) – Frontier models on edge devices
bigattichouse/packed-twin-inference
: PTI achieves ~2× throughput using a single quantized
model
(Q5_K_M or better) by running 4 generation streams in one batched decode call. The GPU loads
model
weights
once per step and produces 4 predictions simultaneously. KV cache overhead is ~0.8 GiB total for all 4 streams. No draft
model
. No quality loss
⚡
Inference
Content type:
Code
github.com
·
1d
1 day ago
·
r/LocalLLaMA
Actions for bigattichouse/packed-twin-inference: PTI achieves ~2× throughput using a single quantized model (Q5_K_M or better) by running 4 generation streams in one batched decode call. The GPU loads model weights once per step and produces 4 predictions simultaneously. KV cache overhead is ~0.8 GiB total for all 4 streams. No draft model. No quality loss
If Claude Fable stops helping you, you’ll never know
🎛️
Fine-tuning
simonwillison.net
·
19h
19 hours ago
·
Hacker News
Actions for If Claude Fable stops helping you, you’ll never know
SAR updates its first homegrown AI
model
- Azərtac
🌐
Open Source AI
azertag.az
·
6d
6 days ago
Actions for SAR updates its first homegrown AI model - Azərtac
On-device AI agents hit a hard memory limit. Apple's new architecture routes around it.
🤖
AI Agents
venturebeat.com
·
1d
1 day ago
Actions for On-device AI agents hit a hard memory limit. Apple's new architecture routes around it.
BRAND ANALYSIS: I IS FOR INSTAGRAM
⚡
Inference
Content type:
Blog
nemesisglobal.substack.com
·
6d
6 days ago
·
Substack
Actions for BRAND ANALYSIS: I IS FOR INSTAGRAM
Tracing Eval-Awareness Emergence Through Training of OLMo 3
🎛️
Fine-tuning
lesswrong.com
·
9h
9 hours ago
Actions for Tracing Eval-Awareness Emergence Through Training of OLMo 3
To discover new physics, AI may need to 'unlearn' the old one
🤖
AI Agents
phys.org
·
1d
1 day ago
Actions for To discover new physics, AI may need to 'unlearn' the old one
Nvidia Nemotron 3 Ultra
🎛️
Fine-tuning
research.nvidia.com
·
6d
6 days ago
·
Hacker News
Actions for Nvidia Nemotron 3 Ultra
Mix, Don't Pick: Why Synthetic Corpus Composition Matters for Time Series Foundation
Model
Pretraining
🧠
LLMs
Content type:
Academic
arxiv.org
·
15h
15 hours ago
Actions for Mix, Don't Pick: Why Synthetic Corpus Composition Matters for Time Series Foundation Model Pretraining
Research Proposal: Decoupled
RISC-LLM
Architectures via Circadian Synaptic Consolidation
🎛️
Fine-tuning
aermia.com
·
3d
3 days ago
·
Hacker News
Actions for Research Proposal: Decoupled RISC-LLM Architectures via Circadian Synaptic Consolidation
Sorry, not sorry (Ideogram jailbroken in 1 easy step)
📐
Context Engineering
gist.github.com
·
6d
6 days ago
·
r/StableDiffusion
Actions for Sorry, not sorry (Ideogram jailbroken in 1 easy step)
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help