Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
LLMs
🧠 LLMs
Specific
large language models, GPT, transformers, inference
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
511
posts in
13.4
ms
Using
Scikit-LLM
with Open-Source LLMs
🤖
AI
machinelearningmastery.com
·
6d
6 days ago
Actions for Using Scikit-LLM with Open-Source LLMs
lightmetal: GPU
LLM
Inference
From a Single Java 25 JAR
🔍
Static Analysis
Content type:
Blog
adambien.blog
·
1d
1 day ago
Actions for lightmetal: GPU LLM Inference From a Single Java 25 JAR
The Shibboleth Effect: Auditing the Cross-Lingual Distributional Skew of
Large
Language
Models
🤖
AI
Content type:
Academic
arxiv.org
·
16h
16 hours ago
Actions for The Shibboleth Effect: Auditing the Cross-Lingual Distributional Skew of Large Language Models
LLM
Routing: From Strategy Selection to Production Architecture
🤖
AI Agents
Content type:
Blog
blog.n8n.io
·
5h
5 hours ago
Actions for LLM Routing: From Strategy Selection to Production Architecture
Alvaro-Manzo/promptshift:
Model-aware
prompt adapter for
Claude
— translate any prompt to
GPT
, Gemini, Mistral, Llama and more
👨💻
AI Coding
Content type:
Code
github.com
·
2d
2 days ago
·
r/PromptEngineering
Actions for Alvaro-Manzo/promptshift: Model-aware prompt adapter for Claude — translate any prompt to GPT, Gemini, Mistral, Llama and more
Initial impressions of
Claude
Fable 5
👨💻
AI Coding
simonwillison.net
·
20h
20 hours ago
·
Hacker News
Actions for Initial impressions of Claude Fable 5
local
llm
on laptop 780M GPU using
llama
+
gemma
4 qat
👨💻
AI Coding
Content type:
Blog
alper.bearblog.dev
·
4d
4 days ago
Actions for local llm on laptop 780M GPU using llama + gemma 4 qat
Slack bot for the whole team, not per-seat
👨💻
AI Coding
Content type:
Discussion
plugand.ai
·
10h
10 hours ago
·
Hacker News
Actions for Slack bot for the whole team, not per-seat
Why
LLMs
(still) lack taste
🤖
AI
beyondtheprior.com
·
1d
1 day ago
·
Hacker News
Actions for Why LLMs (still) lack taste
DiffusionGemma: 4x Faster Text Generation
🤖
AI
Content type:
News
Content type:
Blog
blog.google
·
4h
4 hours ago
·
Hacker News
,
r/LocalLLaMA
,
r/singularity
Actions for DiffusionGemma: 4x Faster Text Generation
NVIDIA Accelerates Google DeepMind’s DiffusionGemma for Local AI
🤖
AI
Content type:
Blog
blogs.nvidia.com
·
4h
4 hours ago
Actions for NVIDIA Accelerates Google DeepMind’s DiffusionGemma for Local AI
Google fills out the middle with the
Gemma
4 12B
🤖
AI
jonpeddie.com
·
1d
1 day ago
Actions for Google fills out the middle with the Gemma 4 12B
Research Proposal: Decoupled
RISC-LLM
Architectures via Circadian Synaptic Consolidation
🤖
AI
aermia.com
·
3d
3 days ago
·
Hacker News
Actions for Research Proposal: Decoupled RISC-LLM Architectures via Circadian Synaptic Consolidation
Google’s DiffusionGemma is 4x faster than its other
Gemma
models
🤖
Transformers
thenewstack.io
·
3h
3 hours ago
Actions for Google’s DiffusionGemma is 4x faster than its other Gemma models
Claude
Fable 5 is Mythos for the masses
🤖
AI Agents
Content type:
Blog
techzine.eu
·
23h
23 hours ago
Actions for Claude Fable 5 is Mythos for the masses
Report: GKE
Inference
Gateway delivers up to 92% faster AI responses
🏗️
Infrastructure
Content type:
Blog
cloud.google.com
·
1d
1 day ago
·
Hacker News
Actions for Report: GKE Inference Gateway delivers up to 92% faster AI responses
google/gemma-4-12B-it-qat-q4
_0-gguf
🤖
AI
huggingface.co
·
4d
4 days ago
Actions for google/gemma-4-12B-it-qat-q4_0-gguf
The Bill Arrives: How to Manage Agentic AI Costs at Scale
🤖
Agents
Content type:
Blog
cockroachlabs.com
·
20h
20 hours ago
Actions for The Bill Arrives: How to Manage Agentic AI Costs at Scale
You don't need Copilot for code completion, try this instead
🤖
Agents
mistral.ai
·
2d
2 days ago
·
r/GithubCopilot
Actions for You don't need Copilot for code completion, try this instead
A Plea to the Labs: Let the
Models
Diagnose.
🤖
Agentic AI
Content type:
Blog
tangent.bearblog.dev
·
4h
4 hours ago
·
Hacker News
Actions for A Plea to the Labs: Let the Models Diagnose.
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help