Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
📊 LLM Evals
model evaluation, benchmarks, evals
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
6920
posts in
27.1
ms
Some notes on
unreliability
of LLM
APIs
andrewpwheeler.com
·
6h
·
Discuss:
Hacker News
🐛
Fuzzing
LLM-Based Evolution as a
Universal
Optimizer
imbue.com
·
2h
·
Discuss:
Hacker News
💬
Prompt Engineering
Optimizing LLM Inference: Sparse Activation, MoE, and
Gated-MLP
Efficiency
hackernoon.com
·
20h
💬
Prompt Engineering
felix-clark/ndarray-glm
: Rust library for linear,
logistic
, and generalized linear model regression
github.com
·
1d
·
Discuss:
r/rust
🦙
Local LLM
What can a 3B LLM actually do on an
i5
with 8GB RAM? I
benchmarked
10 real-world task categories
rishavchatterjee.com
·
1d
·
Discuss:
r/selfhosted
⚙️
Performance Profiling
Bridging the Gap: Diagnosing Online–Offline
Discrepancy
in Pinterest’s
L1
Conversion Models
medium.com
·
6h
⌚
Quantified Self
Can LLMs
SAT
?
blog.aiono.dev
·
1d
·
Discuss:
Lobsters
🦙
Local LLM
Reinforcement
Learning for LLMs
mesuvash.github.io
·
2d
·
Discuss:
Hacker News
💬
Prompt Engineering
How to
choose
and
implement
an LLM for your healthcare product
thoughtbot.com
·
3d
🤨
AI Criticism
An AI agent coding
skeptic
tries AI agent coding, in
excessive
detail
simonwillison.net
·
2h
🤨
AI Criticism
Instant LLM Updates with
Doc-to-LoRA
and
Text-to-LoRA
pub.sakana.ai
·
3h
·
Discuss:
Lobsters
,
Hacker News
💬
Prompt Engineering
Geekbench
: Tensor
G6
browser.geekbench.com
·
21h
·
Discuss:
r/Android
⚙️
Performance Profiling
Analyzing
ReLUfication
Limitations: Enhancing LLM
Sparsity
via Up Projection
hackernoon.com
·
20h
💬
Prompt Engineering
Episode #286:
Overcoming
Testing
Obstacles
With Python's Mock Object Library
realpython.com
·
11h
🐛
Fuzzing
How Large Language Models Learn
blog.bytebytego.com
·
4d
💬
Prompt Engineering
Quo
Vadis
, LLM Benchmarks?
florianbrand.com
·
1d
·
Discuss:
Hacker News
⚙️
Performance Profiling
"LLMs Out of
Context
"
lucek.ai
·
22h
·
Discuss:
Hacker News
💬
Prompt Engineering
Best
Self-Hosted
LLM
Leaderboard
2026 | Open-Weight Model Rankings for Enterprise
onyx.app
·
1d
·
Discuss:
Hacker News
🏠
Self-Hosting
pplx-embed
: State-of-the-Art Embedding Models for Web-Scale Retrieval
research.perplexity.ai
·
17h
·
Discuss:
Hacker News
🗂️
Vector Databases
The AI
Transformation
Framework
zapier.com
·
18h
·
Discuss:
Hacker News
🤨
AI Criticism
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help