Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Llama
馃 Llama
Specific
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
269
posts in
6.9
ms
Show HN: Run
Llama.cpp
In-Process from Java with Project Panama FFM
deemwar-products.github.io
路
6d
6 days ago
路
Hacker News
Actions for Show HN: Run Llama.cpp In-Process from Java with Project Panama FFM
Less-relevant results
NVIDIA Accelerates Google DeepMind鈥檚 DiffusionGemma for Local AI
聽
Content type:
Blog
blogs.nvidia.com
路
18h
18 hours ago
Actions for NVIDIA Accelerates Google DeepMind鈥檚 DiffusionGemma for Local AI
"AI" Is Eating Platform Monopolist Free Cash Flow, Not the World: CHART OF THE DAY
聽
Content type:
News
聽
Content type:
Blog
braddelong.substack.com
路
2d
2 days ago
路
Substack
Actions for "AI" Is Eating Platform Monopolist Free Cash Flow, Not the World: CHART OF THE DAY
Apples to Apples: MLX vs.
Llama.cpp
for Gemma 4 12B on an M1 16GB
聽
Content type:
Blog
ziraph.com
路
5d
5 days ago
路
Hacker News
Actions for Apples to Apples: MLX vs. Llama.cpp for Gemma 4 12B on an M1 16GB
WWDC 2026: Foundation
Models
(& Anarlog)
skushagra.com
路
2d
2 days ago
Actions for WWDC 2026: Foundation Models (& Anarlog)
2x GH200 for LLM inference, Part 2:
vLLM
, DeepSeek V4 Flash, and MTP
聽
Content type:
Blog
dnhkng.github.io
路
3d
3 days ago
Actions for 2x GH200 for LLM inference, Part 2: vLLM, DeepSeek V4 Flash, and MTP
Can
Open-Source
LLM Agents Replace Static Application Security Testing Tools? An Empirical Assessment
聽
Content type:
Academic
arxiv.org
路
6h
6 hours ago
Actions for Can Open-Source LLM Agents Replace Static Application Security Testing Tools? An Empirical Assessment
Running LLM Inference on Kubernetes: What It Actually Takes
聽
Content type:
Blog
fairwinds.com
路
5d
5 days ago
Actions for Running LLM Inference on Kubernetes: What It Actually Takes
Neo-X7/Neo-AI: A fully offline AI assistant powered by
Ollama
. Stores and retrieves conversations using SQLite + LanceDB vector search. No cloud. No API keys. Runs entirely on your machine.
聽
Content type:
Code
github.com
路
21h
21 hours ago
路
DEV
Actions for Neo-X7/Neo-AI: A fully offline AI assistant powered by Ollama. Stores and retrieves conversations using SQLite + LanceDB vector search. No cloud. No API keys. Runs entirely on your machine.
Meta
ties up with Ambani's Reliance for AI data center in India
channelnewsasia.com
路
1d
1 day ago
Actions for Meta ties up with Ambani's Reliance for AI data center in India
Why Shrinking an AI
Model
Often Makes It More Useful
siliconopera.com
路
4d
4 days ago
Actions for Why Shrinking an AI Model Often Makes It More Useful
Intelligent inference scheduling with llm-d on Red Hat AI
developers.redhat.com
路
10h
10 hours ago
Actions for Intelligent inference scheduling with llm-d on Red Hat AI
Google Shrank Gemma 4 by 72% and Unsloth Fixed the 4-Bit Bug Nobody Else Caught on One 4090, and 4-Bit Shouldn鈥檛 Be This Good
聽
Content type:
Blog
towardsai.net
路
3d
3 days ago
Actions for Google Shrank Gemma 4 by 72% and Unsloth Fixed the 4-Bit Bug Nobody Else Caught on One 4090, and 4-Bit Shouldn鈥檛 Be This Good
What's in the Box? A Field Guide to AI
Models
聽
Content type:
Blog
iankduncan.com
路
2d
2 days ago
Actions for What's in the Box? A Field Guide to AI Models
On-device AI is a margin decision
聽
Content type:
Blog
ziraph.com
路
16h
16 hours ago
路
Hacker News
Actions for On-device AI is a margin decision
Running
Ollama
on a 15W CPU sounded ridiculous until I got it working with decent results
xda-developers.com
路
6d
6 days ago
Actions for Running Ollama on a 15W CPU sounded ridiculous until I got it working with decent results
RakuOS fixes the one thing that annoys me most about immutable Linux distros
聽
Content type:
News
zdnet.com
路
1d
1 day ago
Actions for RakuOS fixes the one thing that annoys me most about immutable Linux distros
Google's new
open
model
DiffusionGemma generates text from noise instead of word by word
the-decoder.com
路
14h
14 hours ago
Actions for Google's new open model DiffusionGemma generates text from noise instead of word by word
Creating ADK Agent using locally running Gemma 4
聽
Content type:
Blog
medium.com
路
3d
3 days ago
Actions for Creating ADK Agent using locally running Gemma 4
Evaluating Hallucinations in Domain-Adapted
Large
Language
Models
聽
Content type:
Academic
arxiv.org
路
2d
2 days ago
Actions for Evaluating Hallucinations in Domain-Adapted Large Language Models
« Page 1
路
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help