Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
LLMs
🧠 LLMs
Specific
GPT, Transformers, Inference, Fine-tuning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
938
posts in
7.8
ms
Opus 4.8 Thinking keeps deteroriating on Hard
Prompts
English in LMArena (again)
🏗️
MLSys
arena.ai
·
3d
3 days ago
·
r/singularity
Actions for Opus 4.8 Thinking keeps deteroriating on Hard Prompts English in LMArena (again)
Timing Trick Cuts Energy Used in
LLM
Training
by Up to 14 Percent
🤖
AI
Content type:
News
spectrum.ieee.org
·
7h
7 hours ago
·
Hacker News
Actions for Timing Trick Cuts Energy Used in LLM Training by Up to 14 Percent
Token4Token — pay-per-token
inference
on Gnosis + Swarm
🤖
Inference
t4t.eth.link
·
1d
1 day ago
·
Hacker News
Actions for Token4Token — pay-per-token inference on Gnosis + Swarm
LLM
Inference
Engineering
Room — Part 3: The Orchestration Layer
🤖
Inference
Content type:
Blog
vimal-dwarampudi.medium.com
·
6d
6 days ago
Actions for LLM Inference Engineering Room — Part 3: The Orchestration Layer
Train
Models
Faster with JAX and MaxText Using NVFP4 on NVIDIA Blackwell
🤖
Inference
Content type:
News
Content type:
Blog
developer.nvidia.com
·
2d
2 days ago
Actions for Train Models Faster with JAX and MaxText Using NVFP4 on NVIDIA Blackwell
Anthropic
’s AI fearmongering isn’t what it appears to be
🤖
AI
Content type:
Blog
techzine.eu
·
5d
5 days ago
Actions for Anthropic’s AI fearmongering isn’t what it appears to be
lightmetal: GPU
LLM
Inference
From a Single Java 25 JAR
🤖
Inference
Content type:
Blog
adambien.blog
·
1d
1 day ago
Actions for lightmetal: GPU LLM Inference From a Single Java 25 JAR
How
LLMs
work | Practical Leaders
🔧
Hardware
practical-leaders.com
·
5d
5 days ago
·
Hacker News
Actions for How LLMs work | Practical Leaders
I built an open-source persistent memory layer for AI coding agents
🦀
Rust
Content type:
Code
github.com
·
23h
23 hours ago
·
r/GithubCopilot
Actions for I built an open-source persistent memory layer for AI coding agents
Build a Medical Report Analyzer on Dedicated
Inference
with Python
☁️
Cloud
digitalocean.com
·
6d
6 days ago
Actions for Build a Medical Report Analyzer on Dedicated Inference with Python
Alignment Collapse Under KV Cache
Quantization
: Diagnosis and Mitigation
🤖
Inference
Content type:
Academic
arxiv.org
·
14h
14 hours ago
Actions for Alignment Collapse Under KV Cache Quantization: Diagnosis and Mitigation
Deep Learning Weekly: Issue 458
☁️
Cloud
deeplearningweekly.com
·
6d
6 days ago
Actions for Deep Learning Weekly: Issue 458
Research Proposal: Decoupled
RISC-LLM
Architectures via Circadian Synaptic Consolidation
🏗️
MLSys
aermia.com
·
3d
3 days ago
·
Hacker News
Actions for Research Proposal: Decoupled RISC-LLM Architectures via Circadian Synaptic Consolidation
GGUF vs GPTQ vs AWQ: The Plain-English Guide to
LLM
Quantization
(and Which One to Pick)
🤖
Inference
vettedconsumer.com
·
4d
4 days ago
·
Hacker News
Actions for GGUF vs GPTQ vs AWQ: The Plain-English Guide to LLM Quantization (and Which One to Pick)
BacteReason: A Reasoning
Model
for Antimicrobial Resistance Prediction
🏗️
MLSys
Content type:
Academic
biorxiv.org
·
3d
3 days ago
Actions for BacteReason: A Reasoning Model for Antimicrobial Resistance Prediction
How we fight GPU scarcity without compromise
🤖
Inference
Content type:
Blog
equixly.com
·
5d
5 days ago
·
Hacker News
Actions for How we fight GPU scarcity without compromise
Best explanations of how
LLMs
work
🏗️
MLSys
Content type:
Blog
vorushin.github.io
·
3d
3 days ago
·
Hacker News
Actions for Best explanations of how LLMs work
AI
Model
for Ancient Papyri.
🏗️
MLSys
languagehat.com
·
6d
6 days ago
Actions for AI Model for Ancient Papyri.
The Neutral Mask: How
RLHF
Provides Shallow Alignment while Leaving Partisan Structure Intact in a
Large
Language
Model
🏗️
MLSys
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for The Neutral Mask: How RLHF Provides Shallow Alignment while Leaving Partisan Structure Intact in a Large Language Model
Speculators v0.5.0: DFlash support and online
training
🤖
Inference
developers.redhat.com
·
6d
6 days ago
Actions for Speculators v0.5.0: DFlash support and online training
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help