Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
AI
🤖 AI
artificial intelligence, machine learning, LLM, generative AI
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
430
posts in
6.8
ms
heterodoxin/graphkv: Graph-guided KV cache compression for memory-efficient
LLM
inference
.
💬
LLMs
Content type:
Code
github.com
·
3d
3 days ago
·
r/LocalLLaMA
Actions for heterodoxin/graphkv: Graph-guided KV cache compression for memory-efficient LLM inference.
google/gemma-4-31B-it · fix: chat template — null handling, reasoning preservation, turn-tag balance, input validation
💬
LLMs
huggingface.co
·
2d
2 days ago
·
r/LocalLLaMA
Actions for google/gemma-4-31B-it · fix: chat template — null handling, reasoning preservation, turn-tag balance, input validation
Claude Fable 5 is Mythos for the masses
🤖
AI in Games
Content type:
Blog
techzine.eu
·
1d
1 day ago
Actions for Claude Fable 5 is Mythos for the masses
How J.A.R.V.I.S. Became the Smartest Mind on Earth — What is an
LLM
?
💬
LLMs
Content type:
Blog
medium.com
·
3d
3 days ago
Actions for How J.A.R.V.I.S. Became the Smartest Mind on Earth — What is an LLM?
The
Transformer
Architecture
: A Step-by-Step Guide
✨
Generative AI
Content type:
Blog
m7mdelyoussef.medium.com
·
5h
5 hours ago
Actions for The Transformer Architecture: A Step-by-Step Guide
Start Up No.2680: Apple to relaunch Siri *again*, jet fuel shortage hits Brazil, astrophysicists see
LLM
future, and more
📰
AI News
Content type:
Blog
theoverspill.blog
·
2d
2 days ago
Actions for Start Up No.2680: Apple to relaunch Siri *again*, jet fuel shortage hits Brazil, astrophysicists see LLM future, and more
GPU Servers for Best Performance
⚙️
Game Engines
leaseweb.com
·
6d
6 days ago
·
DEV
Actions for GPU Servers for Best Performance
The biggest local
LLM
on your
machine
is useless if it can't call a single tool, no matter how many parameters it has
💬
LLMs
xda-developers.com
·
8h
8 hours ago
Actions for The biggest local LLM on your machine is useless if it can't call a single tool, no matter how many parameters it has
On-device
AI
is a margin decision
💬
LLMs
Content type:
Blog
ziraph.com
·
7h
7 hours ago
·
Hacker News
Actions for On-device AI is a margin decision
2x GH200 for
LLM
inference
, Part 2:
vLLM
, DeepSeek V4 Flash, and MTP
💬
LLMs
Content type:
Blog
dnhkng.github.io
·
3d
3 days ago
Actions for 2x GH200 for LLM inference, Part 2: vLLM, DeepSeek V4 Flash, and MTP
PagedAttention vs Traditional KV Cache: How
vLLM
Reinvented GPU Memory for
LLM
Inference
💬
LLMs
Content type:
Blog
medium.com
·
2d
2 days ago
Actions for PagedAttention vs Traditional KV Cache: How vLLM Reinvented GPU Memory for LLM Inference
Building & Benchmarking: LLMs on a 16GB Jetson Orin NX for Hermes Agent
💬
LLMs
Content type:
Blog
dnhkng.github.io
·
2d
2 days ago
Actions for Building & Benchmarking: LLMs on a 16GB Jetson Orin NX for Hermes Agent
What Are Tokens in LLMs?
💬
LLMs
Content type:
Blog
bearisland.dev
·
4d
4 days ago
·
Hacker News
Actions for What Are Tokens in LLMs?
LLM-as-a-Discriminator
: When Synthetic Tables Still Look Real
💬
LLMs
Content type:
Academic
arxiv.org
·
21h
21 hours ago
Actions for LLM-as-a-Discriminator: When Synthetic Tables Still Look Real
Xiaomi MiMo-V2.5-Pro Just Hit 1,000 Tokens Per Second!
📰
AI News
gizchina.com
·
1d
1 day ago
Actions for Xiaomi MiMo-V2.5-Pro Just Hit 1,000 Tokens Per Second!
Show HN:
Ext-Infer
💬
LLMs
infer.displace.tech
·
3d
3 days ago
·
Hacker News
Actions for Show HN: Ext-Infer
Here's a
llama.cpp
CLI Command builder.
💬
LLMs
llamabuilding.com
·
2d
2 days ago
·
r/LocalLLaMA
Actions for Here's a llama.cpp CLI Command builder.
Critical
Hugging
Face
Transformers
flaw ran attacker code on a routine model load
📰
AI News
siliconangle.com
·
6d
6 days ago
Actions for Critical Hugging Face Transformers flaw ran attacker code on a routine model load
Tokenminning: Because Tokenmaxxing Is a Bad Idea
💬
LLMs
tokenminning.com
·
1d
1 day ago
·
Hacker News
Actions for Tokenminning: Because Tokenmaxxing Is a Bad Idea
Issue #390 - The ML Engineer 🤖
📰
AI News
Content type:
News
Content type:
Blog
machinelearning.substack.com
·
3d
3 days ago
·
Substack
Actions for Issue #390 - The ML Engineer 🤖
Sign up or log in to see more results
Sign Up
Login
« Page 2
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help