🐿️ Scour
Browse
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🏗️ AI Infrastructure
Model Serving, GPU Clusters, Inference Optimization, MLOps
Filter Results
Timeframe
Hot
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
4979
posts in
39.9
ms
Optimizing LLM inference on Amazon SageMaker AI with BentoML’s LLM- Optimizer
aws.amazon.com
·
3d
🤖
AI Inference
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
TheRemyyy/neurox-ai: Neuromorphic Computing System GPU-accelerated, spiking neural network platform targeting 1-10M neurons with biological accuracy and real-time performance.
github.com
·
7h
·
Discuss:
Hacker News
🧠
Neuromorphic Chips
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Introducing the XLab AI Security Guide
lesswrong.com
·
5h
🛡️
Computer Security
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
🦉 From Broken Models to Living Systems: My Journey Building AI Without a GPU
dev.to
·
1d
·
Discuss:
DEV
📱
Edge AI
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
A tiny AI supercomputer for your desk
youtube.com
·
1d
·
Discuss:
r/hardware
⚡
Hardware Acceleration
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
AI Infrastructure Basics: How MCP Works
newsletter.systemdesign.one
·
1d
·
Discuss:
r/programming
🤖
AI Coding Tools
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Show HN: Why is ML inference still so ad-hoc in practice?
news.ycombinator.com
·
1d
·
Discuss:
Hacker News
🚀
MLOps
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Your Team Uses AI. Why Aren't You 10x Faster?
bits.logic.inc
·
3h
·
Discuss:
Hacker News
🤖
AI Coding Tools
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Yann LeCun’s VL-JEPA: The breakthrough that gives AI a "Mind's Eye" (instead of just a mouth).
hisohan.substack.com
·
4h
·
Discuss:
Substack
📱
Edge AI
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Christmas came with a pleasant GPU surprise
pechotierra.bearblog.dev
·
15h
⚡
Hardware Acceleration
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Show HN: Chat-DeepAI – DeepSeek pricing and getting-started guides (fan project)
chat-deepai.com
·
8h
·
Discuss:
Hacker News
📱
Edge AI
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
The Transformer Architecture: A Deep Dive into How LLMs Actually Work
dev.to
·
2h
·
Discuss:
DEV
🤖
Transformers
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
wwes4/AI_Accel_1.5x: AI acceleration framework for ~1.5x speedups in mid-sized models via tension-based pruning. Built utilizing xAI's Grok.
github.com
·
1d
·
Discuss:
Hacker News
🔥
PyTorch
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
GoMLX: Accelerating Machine Learning with Go, GPUs, and TPUs
dev.to
·
9h
·
Discuss:
DEV
🔥
Burn
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
AIAuditTrack: A Framework for AI Security system
arxiv.org
·
2d
🤖
AI Inference
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Inferal Workspace Architecture: How We Work at Inferal
gist.github.com
·
2d
·
Discuss:
Hacker News
🔄
Operational Transforms
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
The 2025 Guide to Machine Learning
ibm.com
·
1d
·
Discuss:
Hacker News
📱
Edge AI
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Microservices work perfectly fine while you’re just returning simple JSON. But the moment you start real-time token streaming from multiple AI agents simultaneously — distributed architecture turns…
linkedin.com
·
20h
·
Discuss:
r/programming
🎯
Microservices
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Building a Local-First RAG Engine for AI Coding Assistants
dev.to
·
8h
·
Discuss:
DEV
🤖
AI Coding Tools
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
How IntelliNode Automates Complex Workflows with Vibe Agents
towardsdatascience.com
·
9h
🤖
AI agents
Preview
Share
Show Feeds
Block Domain
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Loading...
Loading more...
Page 2 »