Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Open Source AI
🔓 Open Source AI
open source models, Hermes, Mistral, local LLM, Ollama
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
736
posts in
8.5
ms
Google's new
open
model
DiffusionGemma generates text from noise instead of word by word
🔔
Plan 9
the-decoder.com
·
3h
3 hours ago
Actions for Google's new open model DiffusionGemma generates text from noise instead of word by word
Domain-Specific Small Language
Models
(Manning)
🦬
Emacs
i-programmer.info
·
7h
7 hours ago
Actions for Domain-Specific Small Language Models (Manning)
Using
Scikit-LLM
with
Open-Source
LLMs
🦬
Emacs
machinelearningmastery.com
·
6d
6 days ago
Actions for Using Scikit-LLM with Open-Source LLMs
LeLab Is
Hugging
Face
’s New Browser-Based GUI for the LeRobot Ecosystem
♊
Gemini Protocol
Content type:
News
hackster.io
·
1d
1 day ago
Actions for LeLab Is Hugging Face’s New Browser-Based GUI for the LeRobot Ecosystem
147th airhacks tv:
Local
LLMs, LightMetal, ZSmith Agents,
AI
Rails, Saving Tokens
♊
Gemini Protocol
Content type:
Blog
adambien.blog
·
19h
19 hours ago
Actions for 147th airhacks tv: Local LLMs, LightMetal, ZSmith Agents, AI Rails, Saving Tokens
Gemma 4 QAT on 10GB Laptop:
Local
AI
with 6.7GB VRAM
🔌
Single-Board Computers
everylocalai.com
·
2h
2 hours ago
·
DEV
Actions for Gemma 4 QAT on 10GB Laptop: Local AI with 6.7GB VRAM
Using
local
LLMs for agentic coding
🦬
Emacs
Content type:
Blog
blog.alexewerlof.com
·
6d
6 days ago
Actions for Using local LLMs for agentic coding
Fixing a stuck
Ollama
runner and building a GPU watchdog
🏠
Self-Hosting
patrickmccanna.net
·
2d
2 days ago
·
Hacker News
Actions for Fixing a stuck Ollama runner and building a GPU watchdog
Why agentic
AI
needs an
open
inference
stack
🔌
Single-Board Computers
redhat.com
·
2d
2 days ago
Actions for Why agentic AI needs an open inference stack
DiffusionGemma: 4x Faster Text Generation
🖥️
Retro Computing
Content type:
News
Content type:
Blog
blog.google
·
7h
7 hours ago
·
Hacker News
,
r/LocalLLaMA
,
r/singularity
Actions for DiffusionGemma: 4x Faster Text Generation
Running
LLM
Inference
on Kubernetes: What It Actually Takes
🏠
Self-Hosting
Content type:
Blog
fairwinds.com
·
5d
5 days ago
Actions for Running LLM Inference on Kubernetes: What It Actually Takes
Neo-X7/Neo-AI
: A fully offline
AI
assistant powered by
Ollama
. Stores and retrieves conversations using SQLite + LanceDB vector search. No cloud. No API keys. Runs entirely on your machine.
🌐
Fediverse
Content type:
Code
github.com
·
10h
10 hours ago
·
DEV
Actions for Neo-X7/Neo-AI: A fully offline AI assistant powered by Ollama. Stores and retrieves conversations using SQLite + LanceDB vector search. No cloud. No API keys. Runs entirely on your machine.
Alignment Collapse Under KV Cache
Quantization
: Diagnosis and Mitigation
🔌
Single-Board Computers
Content type:
Academic
arxiv.org
·
19h
19 hours ago
Actions for Alignment Collapse Under KV Cache Quantization: Diagnosis and Mitigation
What's in the Box? A Field Guide to
AI
Models
🖥️
Retro Computing
Content type:
Blog
iankduncan.com
·
1d
1 day ago
Actions for What's in the Box? A Field Guide to AI Models
Show HN: Run
Llama.cpp
In-Process from Java with Project Panama FFM
🦬
Emacs
deemwar-products.github.io
·
5d
5 days ago
·
Hacker News
Actions for Show HN: Run Llama.cpp In-Process from Java with Project Panama FFM
On-device
AI
is a margin decision
🔌
Single-Board Computers
Content type:
Blog
ziraph.com
·
5h
5 hours ago
·
Hacker News
Actions for On-device AI is a margin decision
Xiaomi MiMo-V2.5-Pro Just Hit 1,000 Tokens Per Second!
🔌
Single-Board Computers
gizchina.com
·
1d
1 day ago
Actions for Xiaomi MiMo-V2.5-Pro Just Hit 1,000 Tokens Per Second!
Unsloth Gemma 4 QAT
🖧
BSD
unsloth.ai
·
5d
5 days ago
Actions for Unsloth Gemma 4 QAT
NexusOS v2.0 – A zero-dependency pipeline streaming server chaos to Parquet
🌐
Fediverse
huggingface.co
·
2d
2 days ago
·
Hacker News
Actions for NexusOS v2.0 – A zero-dependency pipeline streaming server chaos to Parquet
AMD's Lemonade SDK For
Local
AI
Adds NVIDIA CUDA Support
🖥️
Retro Computing
phoronix.com
·
6h
6 hours ago
Actions for AMD's Lemonade SDK For Local AI Adds NVIDIA CUDA Support
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help