Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
LLMs
💬 LLMs
Specific
large language models, local LLM, ollama, open source models
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
917
posts in
12.9
ms
1-bit and 1.58 bit
LLM
Benchmarking on Jetson Orin Nano Super | Bonsai LM
🔧
AI Tools
smolhub.com
·
2d
2 days ago
·
r/LocalLLaMA
Actions for 1-bit and 1.58 bit LLM Benchmarking on Jetson Orin Nano Super | Bonsai LM
Hugging
Face
Transformers
RCE flaw enables stealthy compromise via AI model configs
🔧
AI Tools
csoonline.com
·
6d
6 days ago
Actions for Hugging Face Transformers RCE flaw enables stealthy compromise via AI model configs
Less-relevant results
Machinic Psychopharmacology: Do
LLMs
Self-Medicate?
🔧
AI Tools
lesswrong.com
·
6h
6 hours ago
·
Hacker News
Actions for Machinic Psychopharmacology: Do LLMs Self-Medicate?
The Sequence Knowledge #874:
Transformers
or Not?
🦾
ROS
substackcdn.com
·
1d
1 day ago
·
Substack
Actions for The Sequence Knowledge #874: Transformers or Not?
LLM
Inference Engineering Room — Part 3: The Orchestration Layer
🔧
AI Tools
Content type:
Blog
vimal-dwarampudi.medium.com
·
6d
6 days ago
Actions for LLM Inference Engineering Room — Part 3: The Orchestration Layer
DeepSeekV4 1.6T Day 0 to Day 43 Performance Over Time - Huawei, GB300 NVL72, MI355X, B200
🔧
AI Tools
Content type:
News
newsletter.semianalysis.com
·
1d
1 day ago
·
Hacker News
Actions for DeepSeekV4 1.6T Day 0 to Day 43 Performance Over Time - Huawei, GB300 NVL72, MI355X, B200
Neo-X7/Neo-AI
: A fully offline
AI
assistant powered by
Ollama
. Stores and retrieves conversations using SQLite + LanceDB vector search. No cloud. No API keys. Runs entirely on your machine.
🔧
AI Tools
Content type:
Code
github.com
·
7h
7 hours ago
·
DEV
Actions for Neo-X7/Neo-AI: A fully offline AI assistant powered by Ollama. Stores and retrieves conversations using SQLite + LanceDB vector search. No cloud. No API keys. Runs entirely on your machine.
Running
LLM
Inference on Kubernetes: What It Actually Takes
🔧
AI Tools
Content type:
Blog
fairwinds.com
·
5d
5 days ago
Actions for Running LLM Inference on Kubernetes: What It Actually Takes
Building & Benchmarking:
LLMs
on a 16GB Jetson Orin NX for Hermes Agent
🔌
Embedded Systems
Content type:
Blog
dnhkng.github.io
·
1d
1 day ago
Actions for Building & Benchmarking: LLMs on a 16GB Jetson Orin NX for Hermes Agent
Google's new
open
model
DiffusionGemma
generates
text from noise instead of word by word
🔧
AI Tools
the-decoder.com
·
1h
1 hour ago
Actions for Google's new open model DiffusionGemma generates text from noise instead of word by word
Malicious
Hugging
Face
Models
Could Trigger Remote Code Execution
🔧
AI Tools
techrepublic.com
·
5d
5 days ago
Actions for Malicious Hugging Face Models Could Trigger Remote Code Execution
RKSC: Reasoning-Aware KV Cache Sharing and Confident Early Exit for Multi-Step
LLM
Inference
🔧
AI Tools
Content type:
Academic
arxiv.org
·
16h
16 hours ago
Actions for RKSC: Reasoning-Aware KV Cache Sharing and Confident Early Exit for Multi-Step LLM Inference
How I benchmarked a 100%
local
RAG pipeline to 9/9 (zero API keys)
🔧
AI Tools
buy.polar.sh
·
2d
2 days ago
·
DEV
Actions for How I benchmarked a 100% local RAG pipeline to 9/9 (zero API keys)
Hugging
Face
Transformers
flaw enables RCE via malicious model configs
🔧
AI Tools
4sysops.com
·
3d
3 days ago
Actions for Hugging Face Transformers flaw enables RCE via malicious model configs
NexusOS v2.0 – A zero-dependency pipeline streaming server chaos to Parquet
🔧
AI Tools
huggingface.co
·
2d
2 days ago
·
Hacker News
Actions for NexusOS v2.0 – A zero-dependency pipeline streaming server chaos to Parquet
I got a Crush on this new Terminal-based
AI
coding tool
🔧
AI Tools
xda-developers.com
·
23h
23 hours ago
Actions for I got a Crush on this new Terminal-based AI coding tool
Running Qwen 35B MoE at 450k Context on a Single 32GB GPU
🔧
AI Tools
local-llm.utop.workers.dev
·
3d
3 days ago
·
Hacker News
Actions for Running Qwen 35B MoE at 450k Context on a Single 32GB GPU
WWDC 2026: Foundation
Models
(& Anarlog)
🔧
AI Tools
skushagra.com
·
1d
1 day ago
Actions for WWDC 2026: Foundation Models (& Anarlog)
Analyzing the geometric dependence of thermoelastic Q -
factor
in micro hemispherical resonators via a data-augmented
CNN-transformer
model
🔌
Embedded Systems
Content type:
Academic
nature.com
·
5d
5 days ago
Actions for Analyzing the geometric dependence of thermoelastic Q -factor in micro hemispherical resonators via a data-augmented CNN-transformer model
16bit.com Update Summary: June 09, 2026
🔌
Embedded Systems
16bit.com
·
1d
1 day ago
Actions for 16bit.com Update Summary: June 09, 2026
Sign up or log in to see more results
Sign Up
Login
« Page 2
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help