Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
AI
🤖 AI
Broad
Llama, qwen, OpenAI, Claude, Anthropic, GPUs, Ollama, Local LLMs
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
146
posts in
12.0
ms
zhongkaifu/TensorSharp: A C#
inference
engine for running
large
language
models (LLMs) locally using GGUF model files. TensorSharp provides a console application, a web-based chatbot interface, and Ollama/OpenAI-compatible HTTP APIs for programmatic access. It supports Windows/MacOS/Linux with full GPU capability
🇨🇳
Chinese AI
Content type:
Code
github.com
·
6d
6 days ago
·
Hacker News
Actions for zhongkaifu/TensorSharp: A C# inference engine for running large language models (LLMs) locally using GGUF model files. TensorSharp provides a console application, a web-based chatbot interface, and Ollama/OpenAI-compatible HTTP APIs for programmatic access. It supports Windows/MacOS/Linux with full GPU capability
Fixing a stuck
Ollama
runner and building a
GPU
watchdog
🏠
Self-Hosting
patrickmccanna.net
·
1d
1 day ago
·
Hacker News
Actions for Fixing a stuck Ollama runner and building a GPU watchdog
ReasonAlloc: Hierarchical Decoding-Time KV Cache Budget Allocation for Reasoning
Models
🇨🇳
Chinese AI
Content type:
Academic
arxiv.org
·
8h
8 hours ago
Actions for ReasonAlloc: Hierarchical Decoding-Time KV Cache Budget Allocation for Reasoning Models
Token4Token — pay-per-token
inference
on Gnosis + Swarm
🤖
LLM
t4t.eth.link
·
1d
1 day ago
·
Hacker News
Actions for Token4Token — pay-per-token inference on Gnosis + Swarm
Show HN: Run
Llama.cpp
In-Process from Java with Project Panama FFM
🤖
LLM
deemwar-products.github.io
·
5d
5 days ago
·
Hacker News
Actions for Show HN: Run Llama.cpp In-Process from Java with Project Panama FFM
A system programmer’s guide to
LLM
inference
💬
NLP
Content type:
Blog
blog.xiangpeng.systems
·
2d
2 days ago
·
Hacker News
Actions for A system programmer’s guide to LLM inference
defai-digital/ax-engine: Apple Silicon
LLM
runtime supporting Gemma 4 and
Qwen
3.6 MTP
modes
🇨🇳
Chinese AI
Content type:
Code
github.com
·
11h
11 hours ago
·
Hacker News
Actions for defai-digital/ax-engine: Apple Silicon LLM runtime supporting Gemma 4 and Qwen 3.6 MTP modes
NexusOS v2.0 – A zero-dependency pipeline streaming server chaos to Parquet
🪝
eBPF
huggingface.co
·
2d
2 days ago
·
Hacker News
Actions for NexusOS v2.0 – A zero-dependency pipeline streaming server chaos to Parquet
Running
Qwen
35B MoE at 450k Context on a Single 32GB
GPU
🦉
Qwen
local-llm.utop.workers.dev
·
3d
3 days ago
·
Hacker News
Actions for Running Qwen 35B MoE at 450k Context on a Single 32GB GPU
DeepSeek enters the fight for token volume,
Anthropic
continues to dominate spend
🇨🇳
Chinese AI
Content type:
Blog
vercel.com
·
2d
2 days ago
·
Hacker News
Actions for DeepSeek enters the fight for token volume, Anthropic continues to dominate spend
raeudigerRaeffi/riddlerun: An
open
source agentic end2end testing tool for your webpages
🐳
Docker
Content type:
Code
github.com
·
23h
23 hours ago
·
Hacker News
,
r/OpenAI
Actions for raeudigerRaeffi/riddlerun: An open source agentic end2end testing tool for your webpages
EvalStop: Using World Feedback to Detect and Correct Reward Overoptimization in Multi-Tenant
RLHF
Platforms
✨
LLMs
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for EvalStop: Using World Feedback to Detect and Correct Reward Overoptimization in Multi-Tenant RLHF Platforms
Anthropic
tops
AI
Arena rankings as it files for IPO
🎭
Claude
Content type:
News
Content type:
Blog
liveclip.substack.com
·
5d
5 days ago
·
Substack
Actions for Anthropic tops AI Arena rankings as it files for IPO
patriceckhart/zot: Yet another coding agent harness, lightweight and written in go.
🔌
Claude Plugins
Content type:
Code
github.com
·
18h
18 hours ago
·
Hacker News
Actions for patriceckhart/zot: Yet another coding agent harness, lightweight and written in go.
How Small Can You Go? LoRA
Fine-Tuning
270M-8B
Models
for Merchant Information Extraction in Financial Transactions
🇨🇳
Chinese AI
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for How Small Can You Go? LoRA Fine-Tuning 270M-8B Models for Merchant Information Extraction in Financial Transactions
Large
companies can add a
local
LLM
filter layer to considerably reducing their AI costs
💬
NLP
umrashrf.github.io
·
4d
4 days ago
·
Hacker News
Actions for Large companies can add a local LLM filter layer to considerably reducing their AI costs
GGUF vs GPTQ vs AWQ: The Plain-English Guide to
LLM
Quantization (and Which One to Pick)
🧠
Machine Learning
vettedconsumer.com
·
3d
3 days ago
·
Hacker News
Actions for GGUF vs GPTQ vs AWQ: The Plain-English Guide to LLM Quantization (and Which One to Pick)
Clairvoyant: Predictive SJF Scheduling to Mitigate Head-of-Line Blocking in Serial
LLM
Backends
📱
Edge Computing
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Clairvoyant: Predictive SJF Scheduling to Mitigate Head-of-Line Blocking in Serial LLM Backends
Nvidia Nemotron 3 Ultra
✨
LLMs
research.nvidia.com
·
6d
6 days ago
·
Hacker News
Actions for Nvidia Nemotron 3 Ultra
Shrivastava-Aditya/boolean-algebra-engine: Deterministic boolean algebra engine — evaluates expressions, detects contradictions, audits logic rules. MCP server, NL layer, REST API, CLI, Streamlit UI.
🔌
Claude Plugins
Content type:
Code
github.com
·
2d
2 days ago
·
Hacker News
,
r/LLM
Actions for Shrivastava-Aditya/boolean-algebra-engine: Deterministic boolean algebra engine — evaluates expressions, detects contradictions, audits logic rules. MCP server, NL layer, REST API, CLI, Streamlit UI.
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help