Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Open Source AI
馃敁 Open Source AI
open source models, llama, local LLM, hugging face
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
551
posts in
8.6
ms
zhongkaifu/TensorSharp: A C# inference engine for running
large
language
models
(LLMs) locally using GGUF model files. TensorSharp provides a console application, a web-based chatbot interface, and Ollama/OpenAI-compatible HTTP APIs for programmatic access. It supports Windows/MacOS/Linux with full GPU capability
聽
馃挰
LLMs
聽
Content type:
Code
github.com
路
6d
6 days ago
路
Hacker News
Actions for zhongkaifu/TensorSharp: A C# inference engine for running large language models (LLMs) locally using GGUF model files. TensorSharp provides a console application, a web-based chatbot interface, and Ollama/OpenAI-compatible HTTP APIs for programmatic access. It supports Windows/MacOS/Linux with full GPU capability
lightmetal: GPU
LLM
Inference From a Single Java 25 JAR
聽
馃挰
LLMs
聽
Content type:
Blog
adambien.blog
路
1d
1 day ago
Actions for lightmetal: GPU LLM Inference From a Single Java 25 JAR
Qwen
3.6 27B AutoRound
GGUF
, need your feedback
聽
鈿欙笍
ML Engineering
huggingface.co
路
22h
22 hours ago
路
r/LocalLLaMA
Actions for Qwen 3.6 27B AutoRound GGUF, need your feedback
NVIDIA Accelerates Google DeepMind鈥檚 DiffusionGemma for
Local
AI
聽
馃
ai
聽
Content type:
Blog
blogs.nvidia.com
路
1h
1 hour ago
Actions for NVIDIA Accelerates Google DeepMind鈥檚 DiffusionGemma for Local AI
How Small Can You Go?
LoRA
Fine-Tuning
270M-8B Models for Merchant Information Extraction in Financial Transactions
聽
馃
ai
聽
Content type:
Academic
arxiv.org
路
1d
1 day ago
Actions for How Small Can You Go? LoRA Fine-Tuning 270M-8B Models for Merchant Information Extraction in Financial Transactions
Using
Scikit-LLM
with
Open-Source
LLMs
聽
馃挰
LLMs
machinelearningmastery.com
路
6d
6 days ago
Actions for Using Scikit-LLM with Open-Source LLMs
AMD's Lemonade SDK For
Local
AI
Adds NVIDIA CUDA Support
聽
鈿欙笍
ML Engineering
phoronix.com
路
1h
1 hour ago
Actions for AMD's Lemonade SDK For Local AI Adds NVIDIA CUDA Support
Ollama
0.30 delivers faster NVIDIA GPU performance and wider hardware support
聽
馃挰
LLMs
alternativeto.net
路
2d
2 days ago
Actions for Ollama 0.30 delivers faster NVIDIA GPU performance and wider hardware support
The biggest
local
LLM
on your machine is useless if it can't call a single tool, no matter how many parameters it has
聽
馃挰
LLMs
xda-developers.com
路
40m
40 minutes ago
Actions for The biggest local LLM on your machine is useless if it can't call a single tool, no matter how many parameters it has
Improved performance and
model
support with
GGUF
聽
馃
ai
聽
Content type:
Blog
ollama.com
路
5d
5 days ago
Actions for Improved performance and model support with GGUF
Google Shrank
Gemma
4 by 72% and Unsloth Fixed the 4-Bit Bug Nobody Else Caught on One 4090, and 4-Bit Shouldn鈥檛 Be This Good
聽
鈿欙笍
ML Engineering
聽
Content type:
Blog
towardsai.net
路
2d
2 days ago
Actions for Google Shrank Gemma 4 by 72% and Unsloth Fixed the 4-Bit Bug Nobody Else Caught on One 4090, and 4-Bit Shouldn鈥檛 Be This Good
DiffusionGemma: 4x Faster Text Generation
聽
馃
ai
聽
Content type:
News
聽
Content type:
Blog
blog.google
路
1h
1 hour ago
路
Hacker News
,
r/LocalLLaMA
,
r/singularity
Actions for DiffusionGemma: 4x Faster Text Generation
DiffusionGemma: The Developer Guide
聽
馃
ai
聽
Content type:
Blog
developers.googleblog.com
路
17h
17 hours ago
Actions for DiffusionGemma: The Developer Guide
Using
local
LLMs for agentic coding
聽
馃挰
LLMs
聽
Content type:
Blog
blog.alexewerlof.com
路
6d
6 days ago
Actions for Using local LLMs for agentic coding
Google fills out the middle with the
Gemma
4 12B
聽
馃
ai
jonpeddie.com
路
1d
1 day ago
Actions for Google fills out the middle with the Gemma 4 12B
Domain-Specific Small
Language
Models
(Manning)
聽
馃
ai
i-programmer.info
路
2h
2 hours ago
Actions for Domain-Specific Small Language Models (Manning)
What's in the Box? A Field Guide to
AI
Models
聽
馃
ai
聽
Content type:
Blog
iankduncan.com
路
1d
1 day ago
Actions for What's in the Box? A Field Guide to AI Models
Optimal Post-Training
Quantization
Scales and Where to
Find
Them
聽
鈿欙笍
ML Engineering
聽
Content type:
Academic
arxiv.org
路
13h
13 hours ago
Actions for Optimal Post-Training Quantization Scales and Where to Find Them
local
llm
on laptop 780M GPU using
llama
+ gemma 4 qat
聽
馃挰
LLMs
聽
Content type:
Blog
alper.bearblog.dev
路
4d
4 days ago
Actions for local llm on laptop 780M GPU using llama + gemma 4 qat
LeLab Is
Hugging
Face
鈥檚 New Browser-Based GUI for the LeRobot Ecosystem
聽
馃
ai
聽
Content type:
News
hackster.io
路
1d
1 day ago
Actions for LeLab Is Hugging Face鈥檚 New Browser-Based GUI for the LeRobot Ecosystem
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help