Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
AI Engineering
🤖 AI Engineering
AI engineer, ML pipelines, model deployment, inference
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
194
posts in
7.3
ms
mirkolenz/llmhop: Tiny, stateless Go router that dispatches OpenAI-compatible requests to
single-model
vLLM
and sglang backends with zero external dependencies
🧠
LLMs
Content type:
Code
github.com
·
5d
5 days ago
·
Hacker News
Actions for mirkolenz/llmhop: Tiny, stateless Go router that dispatches OpenAI-compatible requests to single-model vLLM and sglang backends with zero external dependencies
AI
Governance Tools: How To Achieve Compliance and Visibility
⚖️
AI Ethics
Content type:
Blog
blog.n8n.io
·
4h
4 hours ago
Actions for AI Governance Tools: How To Achieve Compliance and Visibility
Central Bank strengthens data governance for
AI
solutions
⚙️
MLOps
Content type:
News
en.apa.az
·
1d
1 day ago
Actions for Central Bank strengthens data governance for AI solutions
How we fight GPU scarcity without compromise
🧠
LLMs
Content type:
Blog
equixly.com
·
5d
5 days ago
·
Hacker News
Actions for How we fight GPU scarcity without compromise
Youssof Altoukhi (@Youssofal_)
🧠
LLMs
xcancel.com
·
2d
2 days ago
·
r/LocalLLaMA
Actions for Youssof Altoukhi (@Youssofal_)
Google Shrank Gemma 4 by 72% and Unsloth Fixed the 4-Bit Bug Nobody Else Caught on One 4090, and 4-Bit Shouldn’t Be This Good
🧠
LLMs
Content type:
Blog
towardsai.net
·
2d
2 days ago
Actions for Google Shrank Gemma 4 by 72% and Unsloth Fixed the 4-Bit Bug Nobody Else Caught on One 4090, and 4-Bit Shouldn’t Be This Good
Google's new open
model
DiffusionGemma generates text from noise instead of word by word
🤖
ai
the-decoder.com
·
25m
25 minutes ago
Actions for Google's new open model DiffusionGemma generates text from noise instead of word by word
heterodoxin/graphkv: Graph-guided KV cache compression for memory-efficient LLM
inference
.
🧠
LLMs
Content type:
Code
github.com
·
3d
3 days ago
·
r/LocalLLaMA
Actions for heterodoxin/graphkv: Graph-guided KV cache compression for memory-efficient LLM inference.
CLP: Collocation-Length Prediction for Zero-Loss Adaptive Multi-Token
Inference
🧠
LLM Inference
Content type:
Academic
arxiv.org
·
15h
15 hours ago
Actions for CLP: Collocation-Length Prediction for Zero-Loss Adaptive Multi-Token Inference
Unlocking
AI
flexibility in Europe: A guide to cross-region
inference
for EU data processing and
model
access
⚙️
MLOps
Content type:
Blog
aws.amazon.com
·
2d
2 days ago
Actions for Unlocking AI flexibility in Europe: A guide to cross-region inference for EU data processing and model access
Understanding Agentic
AI
Infrastructure
⚖️
AI Ethics
Content type:
Blog
mirantis.com
·
21h
21 hours ago
Actions for Understanding Agentic AI Infrastructure
Six Proto6 Vulnerabilities in protobuf.js Expose Node.js Apps to RCE and DoS
🧠
LLMs
thehackernews.com
·
14h
14 hours ago
Actions for Six Proto6 Vulnerabilities in protobuf.js Expose Node.js Apps to RCE and DoS
ADATA Memory and Storage Products at Computex 2026
🤖
ai
techpowerup.com
·
5d
5 days ago
Actions for ADATA Memory and Storage Products at Computex 2026
For Robotaxis, Safety Must Be Built In, Not Bolted On
🧠
LLMs
Content type:
Blog
blogs.nvidia.com
·
45m
45 minutes ago
Actions for For Robotaxis, Safety Must Be Built In, Not Bolted On
KJLdefeated/RL.cu: RLVR training for LLM in CUDA/C++
🤖
Machine Learning
Content type:
Code
github.com
·
3d
3 days ago
·
Hacker News
Actions for KJLdefeated/RL.cu: RLVR training for LLM in CUDA/C++
The Practitioner’s Guide to AgentOps
🧠
LLMs
machinelearningmastery.com
·
2d
2 days ago
Actions for The Practitioner’s Guide to AgentOps
Intel aims Crescent Island at
inference
🧠
LLM Inference
jonpeddie.com
·
6d
6 days ago
Actions for Intel aims Crescent Island at inference
ASTRA-sim 3.0: Next-Level Distributed Machine Learning Simulations via High-Fidelity GPU and Infrastructure
Modeling
🧠
LLM Inference
Content type:
Academic
arxiv.org
·
15h
15 hours ago
Actions for ASTRA-sim 3.0: Next-Level Distributed Machine Learning Simulations via High-Fidelity GPU and Infrastructure Modeling
Qualcomm Announces On-Device
AI
Claw Ecosystem Plan
🧠
LLM Inference
autonews.gasgoo.com
·
2d
2 days ago
Actions for Qualcomm Announces On-Device AI Claw Ecosystem Plan
FinOps FOCUS specification becomes the common language for
AI
cost accountability
🕵️
Fraud Detection
siliconangle.com
·
1d
1 day ago
Actions for FinOps FOCUS specification becomes the common language for AI cost accountability
Sign up or log in to see more results
Sign Up
Login
« Page 2
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help