Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Open Source AI
🔓 Open Source AI
open source model, Llama, Mistral, Hugging Face
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
857
posts in
4.8
ms
google/gemma-4-12B-it-qat-q4
_0-gguf
🧠
LLMs
huggingface.co
·
5d
5 days ago
Actions for google/gemma-4-12B-it-qat-q4_0-gguf
Google fills out the middle with the
Gemma
4 12B
💬
NLP
jonpeddie.com
·
2d
2 days ago
Actions for Google fills out the middle with the Gemma 4 12B
Ollama
0.30 GPU Boost: Faster local
Qwen
inference on NVIDIA
⚙️
MLOps
everylocalai.com
·
19h
19 hours ago
·
DEV
Actions for Ollama 0.30 GPU Boost: Faster local Qwen inference on NVIDIA
fix(
ollama
): use provider thinking default in SDK session
factory
(#9… ·
openclaw/openclaw
@4f3c2cd
🧠
LLMs
Content type:
Code
github.com
·
4h
4 hours ago
Actions for fix(ollama): use provider thinking default in SDK session factory (#9… · openclaw/openclaw@4f3c2cd
NVIDIA Accelerates Google DeepMind’s DiffusionGemma for Local
AI
🧠
LLMs
Content type:
Blog
blogs.nvidia.com
·
23h
23 hours ago
Actions for NVIDIA Accelerates Google DeepMind’s DiffusionGemma for Local AI
How Small Can You Go? LoRA
Fine-Tuning
270M-8B
Models
for Merchant Information Extraction in Financial Transactions
🧠
LLMs
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for How Small Can You Go? LoRA Fine-Tuning 270M-8B Models for Merchant Information Extraction in Financial Transactions
Unsloth
Gemma
4 QAT
🧠
LLMs
unsloth.ai
·
5d
5 days ago
Actions for Unsloth Gemma 4 QAT
What
Ollama
Reveals About Local
AI
, Agents, and
Open
Models
🕵️
AI Agents
Content type:
Blog
odsc.medium.com
·
17h
17 hours ago
Actions for What Ollama Reveals About Local AI, Agents, and Open Models
Google
Gemma
4 12B brings native multimodal
AI
to standard laptops
🕵️
AI Agents
4sysops.com
·
2d
2 days ago
Actions for Google Gemma 4 12B brings native multimodal AI to standard laptops
Intelligent inference scheduling with
llm-d
on Red Hat
AI
🧠
LLMs
developers.redhat.com
·
16h
16 hours ago
Actions for Intelligent inference scheduling with llm-d on Red Hat AI
Gemma
4 QAT
models
: Optimizing model compression for mobile and laptop efficiency
🧠
LLMs
Content type:
News
Content type:
Blog
blog.google
·
6d
6 days ago
·
Hacker News
Actions for Gemma 4 QAT models: Optimizing model compression for mobile and laptop efficiency
MTP Isn't Always a Win: 1.95x on My 3090, but Speculative Decoding Is Hardware-Dependent
🧠
LLMs
Content type:
Blog
bric.pe.kr
·
2d
2 days ago
·
DEV
Actions for MTP Isn't Always a Win: 1.95x on My 3090, but Speculative Decoding Is Hardware-Dependent
Government aims to make UK top spot for
open
source
AI
🧠
LLMs
Content type:
News
computerweekly.com
·
10h
10 hours ago
Actions for Government aims to make UK top spot for open source AI
Google unveils DiffusionGemma, delivering up to 4x faster inference on dedicated GPUs
🧠
LLMs
alternativeto.net
·
4h
4 hours ago
Actions for Google unveils DiffusionGemma, delivering up to 4x faster inference on dedicated GPUs
lightmetal: GPU
LLM
Inference From a Single Java 25 JAR
🧠
LLMs
Content type:
Blog
adambien.blog
·
2d
2 days ago
Actions for lightmetal: GPU LLM Inference From a Single Java 25 JAR
Improved performance and
model
support with GGUF
🧠
LLMs
Content type:
Blog
ollama.com
·
6d
6 days ago
Actions for Improved performance and model support with GGUF
Google's new
open
model
DiffusionGemma generates text from noise instead of word by word
🧠
LLMs
the-decoder.com
·
20h
20 hours ago
Actions for Google's new open model DiffusionGemma generates text from noise instead of word by word
Fixing a stuck
Ollama
runner and building a GPU watchdog
🤖
Autonomous Systems
patrickmccanna.net
·
2d
2 days ago
·
Hacker News
Actions for Fixing a stuck Ollama runner and building a GPU watchdog
AMD's Lemonade SDK For Local
AI
Adds NVIDIA CUDA Support
🧠
LLMs
phoronix.com
·
23h
23 hours ago
·
r/artificial
Actions for AMD's Lemonade SDK For Local AI Adds NVIDIA CUDA Support
The latest
Gemma
4
models
use a training trick to slash their on-device memory footprint
🧠
LLMs
androidauthority.com
·
5d
5 days ago
Actions for The latest Gemma 4 models use a training trick to slash their on-device memory footprint
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help