Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Large Language Model
🤖 Large Language Model
Specific
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
561
posts in
7.3
ms
What Are Tokens in LLMs?
🤖
AI
Content type:
Blog
bearisland.dev
·
5d
5 days ago
·
Hacker News
Actions for What Are Tokens in LLMs?
Why Your
LLM
Gets Dumber With More Context
📊
Dataset Curation
siliconopera.com
·
23h
23 hours ago
Actions for Why Your LLM Gets Dumber With More Context
147th airhacks tv: Local LLMs, LightMetal, ZSmith Agents,
AI
Rails, Saving Tokens
🤖
AI
Content type:
Blog
adambien.blog
·
2d
2 days ago
Actions for 147th airhacks tv: Local LLMs, LightMetal, ZSmith Agents, AI Rails, Saving Tokens
From Chatbot Hallucinations to Deterministic Agents: Forcing Local LLMs to Run Production-Grade…
🤖
AI
Content type:
Blog
medium.com
·
1h
1 hour ago
Actions for From Chatbot Hallucinations to Deterministic Agents: Forcing Local LLMs to Run Production-Grade…
CommBench: Can LLMs Write Correct and Efficient GPU Communication Code?
👁️
Computer Vision
uccl-project.github.io
·
1d
1 day ago
·
Hacker News
Actions for CommBench: Can LLMs Write Correct and Efficient GPU Communication Code?
Research Proposal: Decoupled
RISC-LLM
Architectures
via Circadian Synaptic Consolidation
👁️
Computer Vision
aermia.com
·
5d
5 days ago
·
Hacker News
Actions for Research Proposal: Decoupled RISC-LLM Architectures via Circadian Synaptic Consolidation
The Tech Download:
Mistral
's Arthur Mensch on agentic
AI
, chips and enterprise adoption
👁️
Computer Vision
Content type:
News
cnbc.com
·
14h
14 hours ago
Actions for The Tech Download: Mistral's Arthur Mensch on agentic AI, chips and enterprise adoption
Does ChatGPT need a psychiatrist? Similarities between human psychopathology and errors in
large
language
models
📊
Dataset Curation
Content type:
Academic
nature.com
·
2d
2 days ago
·
Hacker News
Actions for Does ChatGPT need a psychiatrist? Similarities between human psychopathology and errors in large language models
France’s
Mistral
in Funding Talks at About €20 Billion Valuation
🤖
AI
Content type:
News
bloomberg.com
·
1h
1 hour ago
Actions for France’s Mistral in Funding Talks at About €20 Billion Valuation
You don't need Copilot for code completion, try this instead
🤖
AI
mistral.ai
·
4d
4 days ago
·
r/GithubCopilot
Actions for You don't need Copilot for code completion, try this instead
Ask HN: Any Local
LLM
can I run without GPU for Local Agentic workflow
AI
?
🤖
AI
Content type:
Discussion
news.ycombinator.com
·
1d
1 day ago
·
Hacker News
Actions for Ask HN: Any Local LLM can I run without GPU for Local Agentic workflow AI?
massimo92/spark: CLI tool for serving LLMs with
vLLM
on NVIDIA DGX Spark. One file, zero friction.
🤖
AI
Content type:
Code
github.com
·
18h
18 hours ago
·
Hacker News
Actions for massimo92/spark: CLI tool for serving LLMs with vLLM on NVIDIA DGX Spark. One file, zero friction.
lightmetal: GPU
LLM
Inference From a Single Java 25 JAR
🤖
AI
Content type:
Blog
adambien.blog
·
3d
3 days ago
Actions for lightmetal: GPU LLM Inference From a Single Java 25 JAR
NVIDIA Accelerates Google DeepMind’s DiffusionGemma for Local
AI
🤖
AI
Content type:
Blog
blogs.nvidia.com
·
1d
1 day ago
Actions for NVIDIA Accelerates Google DeepMind’s DiffusionGemma for Local AI
DiffusionGemma: Discrete diffusion in a
large
language
model
👁️
Computer Vision
idlemachines.co.uk
·
15h
15 hours ago
·
Hacker News
Actions for DiffusionGemma: Discrete diffusion in a large language model
I ran local LLMs on my phone for a month, and now my desktop setup feels like overkill
🤖
AI
xda-developers.com
·
2h
2 hours ago
Actions for I ran local LLMs on my phone for a month, and now my desktop setup feels like overkill
Report: GKE Inference Gateway delivers up to 92% faster
AI
responses
🤖
AI
Content type:
Blog
cloud.google.com
·
3d
3 days ago
·
Hacker News
Actions for Report: GKE Inference Gateway delivers up to 92% faster AI responses
Inferoa
AI
harness claimed 90% cache savings. We ran it and measured 97.8%
🤖
AI
zozo123.github.io
·
2d
2 days ago
·
Hacker News
Actions for Inferoa AI harness claimed 90% cache savings. We ran it and measured 97.8%
Fine-tuning
Multi-modal LLMs with ART: Art-based Reinforcement
Training
🤖
AI
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Fine-tuning Multi-modal LLMs with ART: Art-based Reinforcement Training
Mi50 32GB / GFX906 -
vLLM
Qwen 3.5 Configuration for Qwen 3.5:9B AWQ-4bit
🤖
AI
huggingface.co
·
15h
15 hours ago
·
r/LocalLLaMA
Actions for Mi50 32GB / GFX906 - vLLM Qwen 3.5 Configuration for Qwen 3.5:9B AWQ-4bit
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help