Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
LLMs
💬 LLMs
Specific
ChatGPT, Hermes, large language models, GPT-4, open source LLM
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
514
posts in
7.8
ms
What Are
Tokens
in
LLMs
?
🤖
AI
Content type:
Blog
bearisland.dev
·
3d
3 days ago
·
Hacker News
Actions for What Are Tokens in LLMs?
Self-hosted remote access for
Ollama
without complicated setup
📰
RSS
oab.arc-i.co.uk
·
2d
2 days ago
·
r/selfhosted
Actions for Self-hosted remote access for Ollama without complicated setup
Show HN: Run
Llama.cpp
In-Process from Java with Project Panama FFM
🤖
AI
deemwar-products.github.io
·
5d
5 days ago
·
Hacker News
Actions for Show HN: Run Llama.cpp In-Process from Java with Project Panama FFM
defai-digital/ax-engine: Apple Silicon
LLM
runtime supporting Gemma 4 and Qwen 3.6 MTP
modes
🤖
AI
Content type:
Code
github.com
·
11h
11 hours ago
·
Hacker News
Actions for defai-digital/ax-engine: Apple Silicon LLM runtime supporting Gemma 4 and Qwen 3.6 MTP modes
NexusOS v2.0 – A zero-dependency pipeline streaming server chaos to Parquet
🌐
Web Tech
huggingface.co
·
2d
2 days ago
·
Hacker News
Actions for NexusOS v2.0 – A zero-dependency pipeline streaming server chaos to Parquet
Location: Göttingen, Germany Remote: Yes (preferred; hybrid also
fine
) Willing t...
💻
Software Dev
Content type:
Discussion
news.ycombinator.com
·
6d
6 days ago
·
Hacker News
Actions for Location: Göttingen, Germany Remote: Yes (preferred; hybrid also fine) Willing t...
Tales of an
Ollama
Honeypot (Part 3): More Traffic, More
Findings
🔐
Cybersecurity
posts.inthecyber.com
·
1d
1 day ago
Actions for Tales of an Ollama Honeypot (Part 3): More Traffic, More Findings
Train
Models
Faster with JAX and MaxText Using NVFP4 on NVIDIA Blackwell
🤖
AI
Content type:
News
Content type:
Blog
developer.nvidia.com
·
1d
1 day ago
Actions for Train Models Faster with JAX and MaxText Using NVFP4 on NVIDIA Blackwell
LLM
Inference Engineering Room — Part 3: The Orchestration Layer
🤖
AI
Content type:
Blog
vimal-dwarampudi.medium.com
·
6d
6 days ago
Actions for LLM Inference Engineering Room — Part 3: The Orchestration Layer
Melanie Mitchell: What We Get Wrong About AI
🤖
AI
yalereview.org
·
1d
1 day ago
·
Substack
,
Hacker News
,
Hacker News
Actions for Melanie Mitchell: What We Get Wrong About AI
Making Local
LLM
Go Brrr
🤖
AI
seanpedersen.github.io
·
6d
6 days ago
Actions for Making Local LLM Go Brrr
LLM-as-a-Discriminator
: When Synthetic Tables Still Look Real
🤖
AI
Content type:
Academic
arxiv.org
·
7h
7 hours ago
Actions for LLM-as-a-Discriminator: When Synthetic Tables Still Look Real
Running
Ollama
on a 15W CPU sounded ridiculous until I got it working with decent results
🤖
AI
xda-developers.com
·
5d
5 days ago
Actions for Running Ollama on a 15W CPU sounded ridiculous until I got it working with decent results
lightmetal: GPU
LLM
Inference From a Single Java 25 JAR
🤖
AI
Content type:
Blog
adambien.blog
·
1d
1 day ago
Actions for lightmetal: GPU LLM Inference From a Single Java 25 JAR
What's in the Box? A Field Guide to AI
Models
🤖
AI
Content type:
Blog
iankduncan.com
·
1d
1 day ago
Actions for What's in the Box? A Field Guide to AI Models
Running
LLM
Inference on Kubernetes: What It Actually Takes
🤖
AI
Content type:
Blog
fairwinds.com
·
4d
4 days ago
Actions for Running LLM Inference on Kubernetes: What It Actually Takes
Token4Token —
pay-per-token
inference on Gnosis + Swarm
₿
Crypto
t4t.eth.link
·
1d
1 day ago
·
Hacker News
Actions for Token4Token — pay-per-token inference on Gnosis + Swarm
The Rise of Agentic AI: What Every Engineer Should Learn
🤖
AI
Content type:
Blog
medium.com
·
1d
1 day ago
Actions for The Rise of Agentic AI: What Every Engineer Should Learn
How attackers are gaining access to
LLM
inference
🤖
AI
Content type:
Blog
intezer.com
·
6d
6 days ago
Actions for How attackers are gaining access to LLM inference
Report: GKE Inference Gateway delivers up to 92% faster AI responses
🤖
AI
Content type:
Blog
cloud.google.com
·
1d
1 day ago
·
Hacker News
Actions for Report: GKE Inference Gateway delivers up to 92% faster AI responses
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help