Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
LocalLlama
reddit.com
Bleeding Llama: Critical
Unauthenticated
Memory Leak in
Ollama
(CVE-2026–7482)
cyera.com
·
9h
·
r/LocalLLaMA
,
r/netsec
US announces deals with tech
firms
for national security review of AI models before release
theguardian.com
·
10h
·
r/LocalLLaMA
Qwen3.6
27B
vs Qwen3.5
27B
vs Gemma 4
31B
: Accuracy, Latency, Memory, and Token Efficiency Tested
kaitchup.substack.com
·
10h
·
r/LocalLLaMA
ixu2486/tq
_
compat
_eval: Independent TurboQuant-compatible KV backend evaluation SDK for compressed-KV ABI testing, smoke tests, and partial attention decode experiments.
github.com
·
11h
·
r/LocalLLaMA
Comparing
the best open source
TranslateGemma
projects
metalglot.com
·
5d
·
Hacker News
,
r/LocalLLaMA
Supercharging LLM inference on Google
TPUs
: Achieving 3X
speedups
with diffusion-style speculative decoding
developers.googleblog.com
·
1d
·
Hacker News
,
Hacker News
,
r/LocalLLaMA
I made a voice controlled
Tic-Tac-Toe
game as a learning project
github.com
·
23h
·
r/LocalLLaMA
Qwen3.6 27B
FP8
runs with 200k tokens of
BF16
KV cache at 80 TPS on a single RTX 5000 PRO 48GB
huggingface.co
·
23h
·
r/LocalLLaMA
Peanut
- Text to Image Model (Open
Weights
coming soon)
xcancel.com
·
1d
·
r/LocalLLaMA
[Feature]
TurboQuant
: support hybrid models and uniform quantization by
JartX
· Pull Request #39931
github.com
·
1d
·
r/LocalLLaMA
White House
Considers
Vetting
A.I. Models Before They Are Released
nytimes.com
·
1d
·
Hacker News
,
r/LocalLLaMA
,
r/OpenAI
,
r/singularity
NVIDIA
DGX
Spark™ + Apple Mac Studio = 4x Faster LLM Inference with
EXO
1.0
blog.exolabs.net
·
28w
·
Hacker News
,
Hacker News
,
Hacker News
,
r/LocalLLaMA
,
r/LocalLLaMA
llama + spec:
MTP
Support by
am17an
· Pull Request #22673
github.com
·
1d
·
r/LocalLLaMA
[Release]
TinyMozart
v2
85M
🎶
huggingface.co
·
1d
·
r/LocalLLaMA
Deep research + report "a la McKinsey" with Hermes Agent and
qwen3.6-35b-a3b
Q6
_K.
github.com
·
1d
·
r/LocalLLaMA
50 Prozent mehr Speicher: Ryzen AI Max+ Pro 495 mit Radeon
8065S
nutzt 192
GByte
RAM
computerbase.de
·
1d
·
r/LocalLLaMA
AMD Ryzen AI Max+ PRO 495 leaks out, features Radeon
8065S
iGPU
and 192GB memory
videocardz.com
·
2d
·
r/LocalLLaMA
SearchSavior/Qwen3-TTS-OpenVINO
: From scratch qwen3 tts in pytorch, with from scratch
openvino
implementation on top.
github.com
·
2d
·
r/LocalLLaMA
SicariusSicariiStuff/Assistant
_
Pepe
_32B
huggingface.co
·
2d
·
r/LocalLLaMA
AMD and Intel Unveil ACE: New matrix
instructions
deliver a massive 16x AI performance leap over
AVX
tweaktown.com
·
5d
·
r/LocalLLaMA
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help