Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
ML
馃敭 ML
Broad
Local llm
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
489
posts in
6.1
ms
Using
Scikit-LLM
with Open-Source LLMs
聽
馃悕
Python
machinelearningmastery.com
路
6d
6 days ago
Actions for Using Scikit-LLM with Open-Source LLMs
Ollama
0.30 GPU Boost: Faster
local
Qwen
inference on NVIDIA
聽
馃殌
Model Deployment
everylocalai.com
路
8h
8 hours ago
路
DEV
Actions for Ollama 0.30 GPU Boost: Faster local Qwen inference on NVIDIA
defai-digital/ax-engine: Apple Silicon
LLM
runtime supporting
Gemma
4 and
Qwen
3.6 MTP modes
聽
馃
AI
聽
Content type:
Code
github.com
路
1d
1 day ago
路
Hacker News
Actions for defai-digital/ax-engine: Apple Silicon LLM runtime supporting Gemma 4 and Qwen 3.6 MTP modes
What
Ollama
Reveals About
Local
AI, Agents, and Open Models
聽
馃殌
Model Deployment
聽
Content type:
Blog
odsc.medium.com
路
6h
6 hours ago
Actions for What Ollama Reveals About Local AI, Agents, and Open Models
How Small Can You Go? LoRA
Fine-Tuning
270M-8B Models for Merchant Information Extraction in Financial Transactions
聽
馃殌
Model Deployment
聽
Content type:
Academic
arxiv.org
路
2d
2 days ago
Actions for How Small Can You Go? LoRA Fine-Tuning 270M-8B Models for Merchant Information Extraction in Financial Transactions
Unsloth
Gemma
4 QAT
聽
馃
AI
unsloth.ai
路
5d
5 days ago
Actions for Unsloth Gemma 4 QAT
I've tested so many desktop AI tools, but Hermes with
Ollama
is my new favorite - here's why
聽
馃悕
Python
聽
Content type:
News
聽
Content type:
Tutorial
zdnet.com
路
14h
14 hours ago
Actions for I've tested so many desktop AI tools, but Hermes with Ollama is my new favorite - here's why
You don't need Copilot for code completion, try this instead
聽
馃殌
Model Deployment
mistral.ai
路
2d
2 days ago
路
r/GithubCopilot
Actions for You don't need Copilot for code completion, try this instead
The biggest
local
LLM
on your machine is useless if it can't call a single tool, no matter how many parameters it has
聽
馃殌
Model Deployment
xda-developers.com
路
11h
11 hours ago
Actions for The biggest local LLM on your machine is useless if it can't call a single tool, no matter how many parameters it has
Qwen
3.6 27B AutoRound
GGUF
, need your feedback
聽
馃殌
Model Deployment
huggingface.co
路
1d
1 day ago
路
r/LocalLLaMA
Actions for Qwen 3.6 27B AutoRound GGUF, need your feedback
Improved performance and model support with
GGUF
聽
馃
AI
聽
Content type:
Blog
ollama.com
路
6d
6 days ago
Actions for Improved performance and model support with GGUF
lightmetal: GPU
LLM
Inference
From a Single Java 25 JAR
聽
馃殌
Model Deployment
聽
Content type:
Blog
adambien.blog
路
1d
1 day ago
Actions for lightmetal: GPU LLM Inference From a Single Java 25 JAR
Google鈥檚 DiffusionGemma is 4x faster than its other
Gemma
models
聽
馃幉
Synthetic Data Generation
thenewstack.io
路
11h
11 hours ago
Actions for Google鈥檚 DiffusionGemma is 4x faster than its other Gemma models
Apples to Apples: MLX vs.
Llama.cpp
for
Gemma
4 12B on an M1 16GB
聽
馃搱
Time Series Forecasting
聽
Content type:
Blog
ziraph.com
路
5d
5 days ago
路
Hacker News
Actions for Apples to Apples: MLX vs. Llama.cpp for Gemma 4 12B on an M1 16GB
What's in the Box? A Field Guide to AI Models
聽
馃
AI
聽
Content type:
Blog
iankduncan.com
路
2d
2 days ago
Actions for What's in the Box? A Field Guide to AI Models
An
LLM
that reviews your code, challenges your decisions, but never writes code for you
聽
馃殌
Model Deployment
聽
Content type:
Blog
blog.adafruit.com
路
8h
8 hours ago
Actions for An LLM that reviews your code, challenges your decisions, but never writes code for you
Ollama
0.30 delivers faster NVIDIA GPU performance and wider hardware support
聽
馃殌
Model Deployment
alternativeto.net
路
2d
2 days ago
Actions for Ollama 0.30 delivers faster NVIDIA GPU performance and wider hardware support
Running
Qwen
35B MoE at 450k Context on a Single 32GB GPU
聽
馃
AI
local-llm.utop.workers.dev
路
3d
3 days ago
路
Hacker News
Actions for Running Qwen 35B MoE at 450k Context on a Single 32GB GPU
NVIDIA Accelerates Google DeepMind鈥檚 DiffusionGemma for
Local
AI
聽
馃
AI
聽
Content type:
Blog
blogs.nvidia.com
路
12h
12 hours ago
Actions for NVIDIA Accelerates Google DeepMind鈥檚 DiffusionGemma for Local AI
Google fills out the middle with the
Gemma
4 12B
聽
馃
AI
jonpeddie.com
路
1d
1 day ago
Actions for Google fills out the middle with the Gemma 4 12B
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help