Skip to main content
Scour
Discover
Docs
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
LLMs
💬 LLMs
Specific
large language models, GPT, Claude, foundation models
Filter Results
Timeframe
Choose a timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
1,506
posts in
16.1
ms
🤖
AI
ubuntu.com
·
1d
1 day ago
Developing web apps with local
LLM
inference
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Developing web apps with local LLM inference
🧠
Agentic AI
cerebras.ai
·
4d
4 days ago
Gemma
4 on Cerebras—The Fastest
Inference
is Now Multimodal
Covers
Home | ArtificialAnalysis.ai
Covered by
habr.com
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Gemma 4 on Cerebras—The Fastest Inference is Now Multimodal
🧠
Agentic AI
llama-dash.dev
·
18h
18 hours ago
One go-to control plane for local
inference
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for One go-to control plane for local inference
🔧
Hardware
Posts by Year
·
20h
20 hours ago
Mac Studio: The Best Local
LLM
Workstation Money Can(‘t) Buy
Covers
7 stories
See all stories this covers
including
Introducing Claude Opus 4.8
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Mac Studio: The Best Local LLM Workstation Money Can(‘t) Buy
📚
LMS
alper.bearblog.dev
·
2d
2 days ago
Activate
Gemma
4 MTP
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Activate Gemma 4 MTP
🧠
Agentic AI
arXiv
·
5d
5 days ago
From Tokens to Energy Flexibility:
Quantization-Enabled
Demand Response for Data Centers with
LLM
Inference
Workloads
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for From Tokens to Energy Flexibility: Quantization-Enabled Demand Response for Data Centers with LLM Inference Workloads
🎓
eLearning
Bloomberg
·
17h
17 hours ago
Tech Disruptors: Invisible Technologies on
RLHF
and
LLM
Training
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Tech Disruptors: Invisible Technologies on RLHF and LLM Training
🤖
AI
GitHub
·
2d
2 days ago
Show HN: Alloy – a PyTorch backend and
inference
engine for Apple Silicon
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Show HN: Alloy – a PyTorch backend and inference engine for Apple Silicon
🤖
AI
Claude
·
1d
1 day ago
The full
Claude
Desktop experience on AWS, Google Cloud, and Microsoft
Foundry
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for The full Claude Desktop experience on AWS, Google Cloud, and Microsoft Foundry
🚀
Tech Trends
The Sun
·
8h
8 hours ago
Gemma
Atkinson reveals she accidentally flashed her boobs to Gordon Ramsay in mortifying FaceTime blunder
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Gemma Atkinson reveals she accidentally flashed her boobs to Gordon Ramsay in mortifying FaceTime blunder
🤖
AI
XDA
·
6d
6 days ago
My local
LLM
is helping me use
Claude
more effectively, and it's the perfect one-two punch for my workflow
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for My local LLM is helping me use Claude more effectively, and it's the perfect one-two punch for my workflow
🔧
Hardware
SiliconANGLE
·
9h
9 hours ago
Inference
chip startup Groq raises $650M to grow its cloud platform
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Inference chip startup Groq raises $650M to grow its cloud platform
🤖
AI
ByteByteGo Newsletter
·
2d
2 days ago
EP219: 12 Open-source
LLMs
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for EP219: 12 Open-source LLMs
📚
LMS
digitalocean.com
·
3d
3 days ago
Efficient
LLM
Compression with SparseGPT and Wanda on GPU Cloud
Covers
NVIDIA Triton Inference Server — NVIDIA Triton Inference Server
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Efficient LLM Compression with SparseGPT and Wanda on GPU Cloud
🤖
AI
Semiconductor Engineering
·
1d
1 day ago
Tool-Assisted
LLM
Targets RTL Code Generation (UC Riverside, Futurewei)
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Tool-Assisted LLM Targets RTL Code Generation (UC Riverside, Futurewei)
🔧
Hardware
Network World
·
5d
5 days ago
Tether is shipping TurboQuant KV-cache
quantization
with Vulkan support into its QVAC SDK
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Tether is shipping TurboQuant KV-cache quantization with Vulkan support into its QVAC SDK
🧠
Agentic AI
akarouter.dev
·
1d
1 day ago
Flat per-call
LLM
API gateway (20x cheaper than
Claude
Max)
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Flat per-call LLM API gateway (20x cheaper than Claude Max)
🤖
AI
fareedkhan-dev.github.io
·
2d
2 days ago
Train
LLM
from Scratch
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Train LLM from Scratch
🤖
AI
IEEE Spectrum
·
3d
3 days ago
IEEE Rolls Out
Large
Language
Models
Virtual Training Course
Covers
4 stories
See all stories this covers
including
How to Compress DICOM (.dcm) Images from 1.4 MB to KB Using Python?
Covered by
contextmaestro.com
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for IEEE Rolls Out Large Language Models Virtual Training Course
📚
LMS
Hacker News
·
1d
1 day ago
Good results
fine
tuning
a local
LLM
like Qwen 3:0.6B to categorize questions
Discussed on
Hacker News
Love
Like
Not for me
Save
See related topics
Feeds
Share
Report
Off Topic
Harmful Content
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Block Domain
Actions for Good results fine tuning a local LLM like Qwen 3:0.6B to categorize questions
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous post
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Discover
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help
Like
Save
Not for me
Report