Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Open Source AI
🔓 Open Source AI
open source models, llama, local LLM, hugging face
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
537
posts in
9.7
ms
zhongkaifu/TensorSharp: A C# inference engine for running
large
language
models
(LLMs) locally using GGUF model files. TensorSharp provides a console application, a web-based chatbot interface, and Ollama/OpenAI-compatible HTTP APIs for programmatic access. It supports Windows/MacOS/Linux with full GPU capability
💬
LLMs
Content type:
Code
github.com
·
6d
6 days ago
·
Hacker News
Actions for zhongkaifu/TensorSharp: A C# inference engine for running large language models (LLMs) locally using GGUF model files. TensorSharp provides a console application, a web-based chatbot interface, and Ollama/OpenAI-compatible HTTP APIs for programmatic access. It supports Windows/MacOS/Linux with full GPU capability
lightmetal: GPU
LLM
Inference From a Single Java 25 JAR
💬
LLMs
Content type:
Blog
adambien.blog
·
1d
1 day ago
Actions for lightmetal: GPU LLM Inference From a Single Java 25 JAR
Google fills out the middle with the
Gemma
4 12B
🤖
ai
jonpeddie.com
·
19h
19 hours ago
Actions for Google fills out the middle with the Gemma 4 12B
LeLab Is
Hugging
Face
’s New Browser-Based GUI for the LeRobot Ecosystem
🤖
ai
Content type:
News
hackster.io
·
19h
19 hours ago
Actions for LeLab Is Hugging Face’s New Browser-Based GUI for the LeRobot Ecosystem
How Small Can You Go?
LoRA
Fine-Tuning
270M-8B Models for Merchant Information Extraction in Financial Transactions
🤖
ai
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for How Small Can You Go? LoRA Fine-Tuning 270M-8B Models for Merchant Information Extraction in Financial Transactions
Using
Scikit-LLM
with
Open-Source
LLMs
💬
LLMs
machinelearningmastery.com
·
5d
5 days ago
Actions for Using Scikit-LLM with Open-Source LLMs
Researchers Build Self-Replicating
AI
Worm That
Operates
Entirely on
Local
, Open-Weight Models
👨💻
Andrej Karpathy
thehackernews.com
·
22h
22 hours ago
Actions for Researchers Build Self-Replicating AI Worm That Operates Entirely on Local, Open-Weight Models
Ollama
0.30 delivers faster NVIDIA GPU performance and wider hardware support
💬
LLMs
alternativeto.net
·
2d
2 days ago
Actions for Ollama 0.30 delivers faster NVIDIA GPU performance and wider hardware support
google/gemma-4-12B-it-qat-q4
_
0-gguf
🤖
ai
huggingface.co
·
4d
4 days ago
Actions for google/gemma-4-12B-it-qat-q4_0-gguf
Google Shrank
Gemma
4 by 72% and Unsloth Fixed the 4-Bit Bug Nobody Else Caught on One 4090, and 4-Bit Shouldn’t Be This Good
⚙️
ML Engineering
Content type:
Blog
towardsai.net
·
2d
2 days ago
Actions for Google Shrank Gemma 4 by 72% and Unsloth Fixed the 4-Bit Bug Nobody Else Caught on One 4090, and 4-Bit Shouldn’t Be This Good
I added this
open-source
tool to my
local
AI stack, and my
local
LLM finally has persistent memory
💬
LLMs
xda-developers.com
·
17h
17 hours ago
Actions for I added this open-source tool to my local AI stack, and my local LLM finally has persistent memory
Improved performance and
model
support with
GGUF
🤖
ai
Content type:
Blog
ollama.com
·
5d
5 days ago
Actions for Improved performance and model support with GGUF
What's in the Box? A Field Guide to
AI
Models
🤖
ai
Content type:
Blog
iankduncan.com
·
1d
1 day ago
Actions for What's in the Box? A Field Guide to AI Models
Token4Token — pay-per-token inference on Gnosis + Swarm
⚙️
ML Engineering
t4t.eth.link
·
23h
23 hours ago
·
Hacker News
Actions for Token4Token — pay-per-token inference on Gnosis + Swarm
Using
local
LLMs for agentic coding
💬
LLMs
Content type:
Blog
blog.alexewerlof.com
·
6d
6 days ago
Actions for Using local LLMs for agentic coding
Florian Brand, Prime Intellect research engineer, adopts
Gemma
4 E4B 6-bit
quantized
as his primary
local
Mac LLM
⚙️
ML Engineering
Content type:
News
digg.com
·
2d
2 days ago
·
Hacker News
Actions for Florian Brand, Prime Intellect research engineer, adopts Gemma 4 E4B 6-bit quantized as his primary local Mac LLM
Autonomous
AI
worm uses
local
models
to exploit networks and repair its own code
🤖
ai
4sysops.com
·
20h
20 hours ago
Actions for Autonomous AI worm uses local models to exploit networks and repair its own code
local
llm
on laptop 780M GPU using
llama
+ gemma 4 qat
💬
LLMs
Content type:
Blog
alper.bearblog.dev
·
4d
4 days ago
Actions for local llm on laptop 780M GPU using llama + gemma 4 qat
A system programmer’s guide to
LLM
inference
⚙️
ML Engineering
Content type:
Blog
blog.xiangpeng.systems
·
2d
2 days ago
·
Hacker News
Actions for A system programmer’s guide to LLM inference
LLM-as-a-Discriminator
: When Synthetic Tables Still Look Real
💬
LLMs
Content type:
Academic
arxiv.org
·
6h
6 hours ago
Actions for LLM-as-a-Discriminator: When Synthetic Tables Still Look Real
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help