Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
LLMs
💬 LLMs
Specific
large language model, LLM, foundation model, transformer
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
635
posts in
8.0
ms
You don't need Copilot for code completion, try this instead
🔓
Open Source
mistral.ai
·
3d
3 days ago
·
r/GithubCopilot
Actions for You don't need Copilot for code completion, try this instead
Why Your
LLM
Gets Dumber With More Context
🛡️
AI Safety
siliconopera.com
·
8h
8 hours ago
Actions for Why Your LLM Gets Dumber With More Context
AMD's Lemonade SDK For Local
AI
Adds NVIDIA CUDA Support
🔌
Embedded Systems
phoronix.com
·
1d
1 day ago
·
r/artificial
Actions for AMD's Lemonade SDK For Local AI Adds NVIDIA CUDA Support
What Are Tokens in
LLMs
?
🗺️
Mapping
Content type:
Blog
bearisland.dev
·
4d
4 days ago
·
Hacker News
Actions for What Are Tokens in LLMs?
CommBench: Can
LLMs
Write Correct and Efficient GPU Communication Code?
🔓
Open Source
uccl-project.github.io
·
16h
16 hours ago
·
Hacker News
Actions for CommBench: Can LLMs Write Correct and Efficient GPU Communication Code?
Report: GKE Inference Gateway delivers up to 92% faster
AI
responses
🌐
AGI
Content type:
Blog
cloud.google.com
·
2d
2 days ago
·
Hacker News
Actions for Report: GKE Inference Gateway delivers up to 92% faster AI responses
massimo92/spark: CLI tool for serving
LLMs
with
vLLM
on NVIDIA DGX Spark. One file, zero friction.
🔌
Embedded Systems
Content type:
Code
github.com
·
3h
3 hours ago
·
Hacker News
Actions for massimo92/spark: CLI tool for serving LLMs with vLLM on NVIDIA DGX Spark. One file, zero friction.
Google open-sources speedy DiffusionGemma text diffusion
model
🔓
Open Source
siliconangle.com
·
22h
22 hours ago
Actions for Google open-sources speedy DiffusionGemma text diffusion model
The Neutral Mask: How
RLHF
Provides Shallow Alignment while Leaving Partisan Structure Intact in a
Large
Language
Model
✨
Neural Radiance Fields
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for The Neutral Mask: How RLHF Provides Shallow Alignment while Leaving Partisan Structure Intact in a Large Language Model
Ollama
0.30 delivers faster NVIDIA GPU performance and wider hardware support
🔌
Embedded Systems
alternativeto.net
·
3d
3 days ago
Actions for Ollama 0.30 delivers faster NVIDIA GPU performance and wider hardware support
Making a Vintage
LLM
from Scratch
✨
Neural Radiance Fields
crlf.link
·
14h
14 hours ago
·
Hacker News
Actions for Making a Vintage LLM from Scratch
Inferoa
AI
harness claimed 90% cache savings. We ran it and measured 97.8%
🔌
Embedded Systems
zozo123.github.io
·
1d
1 day ago
·
Hacker News
Actions for Inferoa AI harness claimed 90% cache savings. We ran it and measured 97.8%
Show HN: Run
Llama.cpp
In-Process from Java with Project Panama FFM
🔌
Embedded Systems
deemwar-products.github.io
·
6d
6 days ago
·
Hacker News
Actions for Show HN: Run Llama.cpp In-Process from Java with Project Panama FFM
147th airhacks tv: Local
LLMs
, LightMetal, ZSmith Agents,
AI
Rails, Saving Tokens
🔌
Embedded Systems
Content type:
Blog
adambien.blog
·
1d
1 day ago
Actions for 147th airhacks tv: Local LLMs, LightMetal, ZSmith Agents, AI Rails, Saving Tokens
Gemma 4 QAT on 10GB Laptop: Local
AI
with 6.7GB VRAM
🔌
Embedded Systems
everylocalai.com
·
1d
1 day ago
·
DEV
Actions for Gemma 4 QAT on 10GB Laptop: Local AI with 6.7GB VRAM
MLPerf and the rise of latency-aware
LLM
benchmarking
👁️
Computer Vision
edn.com
·
6d
6 days ago
Actions for MLPerf and the rise of latency-aware LLM benchmarking
LLM
Routing: From Strategy Selection to Production Architecture
🔌
Embedded Systems
Content type:
Blog
blog.n8n.io
·
1d
1 day ago
Actions for LLM Routing: From Strategy Selection to Production Architecture
Timing Trick Cuts Energy Used in
LLM
Training by Up to 14 Percent
🔌
Embedded Systems
Content type:
News
spectrum.ieee.org
·
1d
1 day ago
·
Hacker News
Actions for Timing Trick Cuts Energy Used in LLM Training by Up to 14 Percent
Fixing a stuck
Ollama
runner and building a GPU watchdog
⚙️
ROS
patrickmccanna.net
·
3d
3 days ago
·
Hacker News
Actions for Fixing a stuck Ollama runner and building a GPU watchdog
Fine-tuning Multi-modal
LLMs
with ART: Art-based Reinforcement Training
✨
Neural Radiance Fields
Content type:
Academic
arxiv.org
·
19h
19 hours ago
Actions for Fine-tuning Multi-modal LLMs with ART: Art-based Reinforcement Training
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help