Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
AI
🤖 AI
Broad
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
2281
posts in
4.3
ms
SLUUG Talk: Demystifying
Large
Language
Models
on Linux
🤖
ML
Content type:
Code
github.com
·
3d
3 days ago
·
DEV
Actions for SLUUG Talk: Demystifying Large Language Models on Linux
Report: GKE
Inference
Gateway delivers up to 92% faster
AI
responses
🏗️
Data Engineering
Content type:
Blog
cloud.google.com
·
1d
1 day ago
·
Hacker News
Actions for Report: GKE Inference Gateway delivers up to 92% faster AI responses
Using Probabilistic Programs to Train Inductive Reasoning in
Large
Language
Models
🔀
Transformers
Content type:
Academic
arxiv.org
·
19h
19 hours ago
Actions for Using Probabilistic Programs to Train Inductive Reasoning in Large Language Models
Ollama 0.30 GPU Boost: Faster local
Qwen
inference
on NVIDIA
🧭
Vector Databases
everylocalai.com
·
3h
3 hours ago
·
DEV
Actions for Ollama 0.30 GPU Boost: Faster local Qwen inference on NVIDIA
AI
inference
: what it is and why it matters for product managers
🔀
Transformers
marcabraham.com
·
2d
2 days ago
Actions for AI inference: what it is and why it matters for product managers
Inferoa
AI
harness claimed 90% cache savings. We ran it and measured 97.8%
📈
Time Series
zozo123.github.io
·
13h
13 hours ago
·
Hacker News
Actions for Inferoa AI harness claimed 90% cache savings. We ran it and measured 97.8%
Using
Scikit-LLM
with Open-Source LLMs
🔧
Feature Engineering
machinelearningmastery.com
·
6d
6 days ago
Actions for Using Scikit-LLM with Open-Source LLMs
NVIDIA Accelerates Google
DeepMind
’s
DiffusionGemma
for Local
AI
🔀
Transformers
Content type:
Blog
blogs.nvidia.com
·
7h
7 hours ago
Actions for NVIDIA Accelerates Google DeepMind’s DiffusionGemma for Local AI
A system programmer’s guide to
LLM
inference
🤖
ML
Content type:
Blog
blog.xiangpeng.systems
·
2d
2 days ago
·
Hacker News
Actions for A system programmer’s guide to LLM inference
Stop Wasting GPU Budget: Autoscaling
AI
Inference
on Kubernetes with KEDA
📈
Time Series
cloudnativenow.com
·
2d
2 days ago
Actions for Stop Wasting GPU Budget: Autoscaling AI Inference on Kubernetes with KEDA
Build a Medical Report Analyzer on Dedicated
Inference
with Python
🔀
Transformers
digitalocean.com
·
6d
6 days ago
Actions for Build a Medical Report Analyzer on Dedicated Inference with Python
Fine
tuning
classification in Elixir
🤖
ML
elixirstatus.com
·
2d
2 days ago
Actions for Fine tuning classification in Elixir
Using local LLMs for agentic coding
🔀
Transformers
Content type:
Blog
blog.alexewerlof.com
·
6d
6 days ago
Actions for Using local LLMs for agentic coding
End-to-end encrypted ML
inference
with Amazon SageMaker
AI
and FHE
🔧
Feature Engineering
Content type:
Blog
aws.amazon.com
·
2d
2 days ago
Actions for End-to-end encrypted ML inference with Amazon SageMaker AI and FHE
Discrete
Diffusion
Modelling
by Estimating the Ratios of the Data Distribution
🤖
ML
Content type:
News
Content type:
Blog
leetarxiv.substack.com
·
1d
1 day ago
·
Substack
,
r/programming
Actions for Discrete Diffusion Modelling by Estimating the Ratios of the Data Distribution
How LLMs work | Practical Leaders
🔀
Transformers
practical-leaders.com
·
6d
6 days ago
·
Hacker News
Actions for How LLMs work | Practical Leaders
Why LLMs (still) lack taste
🎮
Reinforcement Learning
beyondtheprior.com
·
1d
1 day ago
·
Hacker News
Actions for Why LLMs (still) lack taste
Conversational
AI
vs
generative
AI
: What's the difference?
🔀
Transformers
twilio.com
·
6d
6 days ago
Actions for Conversational AI vs generative AI: What's the difference?
LeLab Is
Hugging
Face
’s New Browser-Based GUI for the LeRobot Ecosystem
🔀
Transformers
Content type:
News
hackster.io
·
1d
1 day ago
Actions for LeLab Is Hugging Face’s New Browser-Based GUI for the LeRobot Ecosystem
lightmetal: GPU
LLM
Inference
From a Single Java 25 JAR
🤖
ML
Content type:
Blog
adambien.blog
·
1d
1 day ago
Actions for lightmetal: GPU LLM Inference From a Single Java 25 JAR
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help