Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
LLMs
🤖 LLMs
Specific
large language models, GPT, Claude, generative AI
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
373
posts in
6.7
ms
Claude
Fable 5 is Mythos for the masses
✍️
Prompt Engineering
Content type:
Blog
techzine.eu
·
2d
2 days ago
Actions for Claude Fable 5 is Mythos for the masses
high-performance classification API (beats
GPT-5.4-mini
)
✍️
Prompt Engineering
Content type:
Discussion
classer.ai
·
17h
17 hours ago
·
Hacker News
Actions for high-performance classification API (beats GPT-5.4-mini)
Boris Cherny,
Claude
Code creator, says he stopped manually
prompting
AI
and now writes autonomous loops to orchestrate the model
✍️
Prompt Engineering
Content type:
News
digg.com
·
5d
5 days ago
Actions for Boris Cherny, Claude Code creator, says he stopped manually prompting AI and now writes autonomous loops to orchestrate the model
Price Drop: Save 90% on ChatPlayground
AI
lifetime plan, and compare multiple
AI
models
✍️
Prompt Engineering
neowin.net
·
2d
2 days ago
Actions for Price Drop: Save 90% on ChatPlayground AI lifetime plan, and compare multiple AI models
Google's new open-weights
model
brings
image-generation
tricks to
AI
text
generation
✍️
Prompt Engineering
Content type:
News
theregister.com
·
12h
12 hours ago
Actions for Google's new open-weights model brings image-generation tricks to AI text generation
The Anthropic leader who built
Claude
Code says he ditched
prompting
— now he just writes loops.
✍️
Prompt Engineering
thenewstack.io
·
1d
1 day ago
Actions for The Anthropic leader who built Claude Code says he ditched prompting — now he just writes loops.
CommBench: Can
LLMs
Write Correct and Efficient GPU Communication Code?
🎮
GPU Computing
uccl-project.github.io
·
23h
23 hours ago
·
Hacker News
Actions for CommBench: Can LLMs Write Correct and Efficient GPU Communication Code?
MLPerf and the rise of latency-aware
LLM
benchmarking
📈
Performance Engineering
edn.com
·
6d
6 days ago
Actions for MLPerf and the rise of latency-aware LLM benchmarking
harmansingh4163-ai/ESP-32-s3-Story-maker-LLM
: 15M/42M-param
Llama
split across two ESP32-S3s over 3 wires — too big for either chip alone. INT4, flash mmap, bit-exact verified.
✍️
Prompt Engineering
Content type:
Code
github.com
·
40m
40 minutes ago
·
Hacker News
Actions for harmansingh4163-ai/ESP-32-s3-Story-maker-LLM: 15M/42M-param Llama split across two ESP32-S3s over 3 wires — too big for either chip alone. INT4, flash mmap, bit-exact verified.
NVIDIA Accelerates Google DeepMind’s DiffusionGemma for Local
AI
📈
Performance Engineering
Content type:
Blog
blogs.nvidia.com
·
1d
1 day ago
Actions for NVIDIA Accelerates Google DeepMind’s DiffusionGemma for Local AI
Research Proposal: Decoupled
RISC-LLM
Architectures via Circadian Synaptic Consolidation
✍️
Prompt Engineering
aermia.com
·
5d
5 days ago
·
Hacker News
Actions for Research Proposal: Decoupled RISC-LLM Architectures via Circadian Synaptic Consolidation
LLM
Cheat Sheet
📈
Performance Engineering
Content type:
Blog
drkpxl.bearblog.dev
·
13h
13 hours ago
Actions for LLM Cheat Sheet
Inferoa
AI
harness claimed 90% cache savings. We ran it and measured 97.8%
⚡
LLM Inference
zozo123.github.io
·
1d
1 day ago
·
Hacker News
Actions for Inferoa AI harness claimed 90% cache savings. We ran it and measured 97.8%
Mi50 32GB / GFX906 -
vLLM
Qwen 3.5 Configuration for Qwen 3.5:9B AWQ-4bit
⚡
LLM Inference
huggingface.co
·
8h
8 hours ago
·
r/LocalLLaMA
Actions for Mi50 32GB / GFX906 - vLLM Qwen 3.5 Configuration for Qwen 3.5:9B AWQ-4bit
If
LLMs
are all persona, whose persona are they?
✍️
Prompt Engineering
persona.earthpilot.ai
·
1d
1 day ago
·
Hacker News
Actions for If LLMs are all persona, whose persona are they?
Report: GKE Inference Gateway delivers up to 92% faster
AI
responses
✍️
Prompt Engineering
Content type:
Blog
cloud.google.com
·
3d
3 days ago
·
Hacker News
Actions for Report: GKE Inference Gateway delivers up to 92% faster AI responses
AMD's Lemonade SDK For Local
AI
Adds NVIDIA CUDA Support
🎮
GPU Computing
phoronix.com
·
1d
1 day ago
·
r/artificial
Actions for AMD's Lemonade SDK For Local AI Adds NVIDIA CUDA Support
New comment by alroma90 in "Ask HN: Who wants to be hired? (June 2026)"
✍️
Prompt Engineering
Content type:
Discussion
news.ycombinator.com
·
20h
20 hours ago
·
Hacker News
Actions for New comment by alroma90 in "Ask HN: Who wants to be hired? (June 2026)"
NVIDIA A100 vs RTX 4090 for
AI
Workloads: The Cost Per FLOP Reality
✍️
Prompt Engineering
Content type:
Blog
fitservers.com
·
3d
3 days ago
Actions for NVIDIA A100 vs RTX 4090 for AI Workloads: The Cost Per FLOP Reality
Everyone Was Searching for Better
AI
Prompts
. Then One Markdown File Changed Everything
✍️
Prompt Engineering
Content type:
Blog
medium.com
·
20h
20 hours ago
Actions for Everyone Was Searching for Better AI Prompts. Then One Markdown File Changed Everything
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help