Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Open Source AI
🌐 Open Source AI
open source models, Llama, Mistral, Hugging Face, local LLMs
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
647
posts in
9.5
ms
Improved performance and
model
support with GGUF
💬
LLMs
Content type:
Blog
ollama.com
·
6d
6 days ago
Actions for Improved performance and model support with GGUF
Ollama
0.30 GPU Boost: Faster
local
Qwen
inference on NVIDIA
⚙️
MLOps
everylocalai.com
·
23h
23 hours ago
·
DEV
Actions for Ollama 0.30 GPU Boost: Faster local Qwen inference on NVIDIA
techjarves/Portable-AI-USB
: A 100% offline, fully portable, zero-trace
AI
(
Ollama
+
Llama
3 + AnythingLLM) that runs natively from a USB drive on Windows and Mac.
✍️
Prompt Engineering
Content type:
Code
github.com
·
2d
2 days ago
Actions for techjarves/Portable-AI-USB: A 100% offline, fully portable, zero-trace AI (Ollama + Llama 3 + AnythingLLM) that runs natively from a USB drive on Windows and Mac.
Orchestrate your
LLM
pipeline.
Locally
✍️
Prompt Engineering
llmforge.app
·
3h
3 hours ago
·
Hacker News
Actions for Orchestrate your LLM pipeline. Locally
MechLens: Late Crystallization of
Factual
Knowledge Explains Intervention Effectiveness in Language
Models
✍️
Prompt Engineering
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for MechLens: Late Crystallization of Factual Knowledge Explains Intervention Effectiveness in Language Models
What
Ollama
Reveals About
Local
AI
, Agents, and Open Models
⚙️
MLOps
Content type:
Blog
odsc.medium.com
·
21h
21 hours ago
Actions for What Ollama Reveals About Local AI, Agents, and Open Models
Show HN: Run
Llama.cpp
In-Process from Java with Project Panama FFM
✍️
Prompt Engineering
deemwar-products.github.io
·
6d
6 days ago
·
Hacker News
Actions for Show HN: Run Llama.cpp In-Process from Java with Project Panama FFM
MTP Isn't Always a Win: 1.95x on My 3090, but Speculative Decoding Is Hardware-Dependent
💬
LLMs
Content type:
Blog
bric.pe.kr
·
2d
2 days ago
·
DEV
Actions for MTP Isn't Always a Win: 1.95x on My 3090, but Speculative Decoding Is Hardware-Dependent
6. Air-Gapped Claude Code - The Claude Code SRE Handbook
✍️
Prompt Engineering
har-ki.github.io
·
3h
3 hours ago
·
Hacker News
Actions for 6. Air-Gapped Claude Code - The Claude Code SRE Handbook
Ask HN: Any
Local
LLM
can I run without GPU for
Local
Agentic workflow
AI
?
✍️
Prompt Engineering
Content type:
Discussion
news.ycombinator.com
·
13h
13 hours ago
·
Hacker News
Actions for Ask HN: Any Local LLM can I run without GPU for Local Agentic workflow AI?
local
llm
on laptop 780M GPU using
llama
+ gemma 4 qat
✍️
Prompt Engineering
Content type:
Blog
alper.bearblog.dev
·
5d
5 days ago
Actions for local llm on laptop 780M GPU using llama + gemma 4 qat
LeLab Is
Hugging
Face
’s New Browser-Based GUI for the LeRobot Ecosystem
🤖
AI
Content type:
News
hackster.io
·
2d
2 days ago
Actions for LeLab Is Hugging Face’s New Browser-Based GUI for the LeRobot Ecosystem
Two old GPUs I salvaged are doing more
AI
work than a brand new $2000 card, and I won't be upgrading anytime soon
✍️
Prompt Engineering
xda-developers.com
·
4h
4 hours ago
Actions for Two old GPUs I salvaged are doing more AI work than a brand new $2000 card, and I won't be upgrading anytime soon
I've tested so many desktop
AI
tools, but Hermes with
Ollama
is my new favorite - here's why
🧠
AI Agents
Content type:
News
Content type:
Tutorial
zdnet.com
·
1d
1 day ago
Actions for I've tested so many desktop AI tools, but Hermes with Ollama is my new favorite - here's why
You don't need Copilot for code completion, try this instead
🤖
AI Coding
mistral.ai
·
3d
3 days ago
·
r/GithubCopilot
Actions for You don't need Copilot for code completion, try this instead
DiffusionGemma 26B A4B results on my 5090
✍️
Prompt Engineering
huggingface.co
·
1d
1 day ago
·
r/LocalLLaMA
Actions for DiffusionGemma 26B A4B results on my 5090
谷歌推出 DiffusionGemma 文本扩散模型:
本地
AI
推理速度提升 4 倍
🤖
AI
ithome.com
·
21h
21 hours ago
Actions for 谷歌推出 DiffusionGemma 文本扩散模型:本地 AI 推理速度提升 4 倍
Ollama
0.30 delivers faster NVIDIA GPU performance and wider hardware support
⚙️
MLOps
alternativeto.net
·
3d
3 days ago
Actions for Ollama 0.30 delivers faster NVIDIA GPU performance and wider hardware support
Show HN: In-browser real
LLM
token counter and cost estimation
✍️
Prompt Engineering
holaclaw.ai
·
4h
4 hours ago
·
Hacker News
Actions for Show HN: In-browser real LLM token counter and cost estimation
lightmetal: GPU
LLM
Inference From a Single Java 25 JAR
✍️
Prompt Engineering
Content type:
Blog
adambien.blog
·
2d
2 days ago
Actions for lightmetal: GPU LLM Inference From a Single Java 25 JAR
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help