🐿️ Scour
Browse
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
📉 Model Quantization
Model Compression, Inference Optimization, Edge Deployment, Performance
Filter Results
Timeframe
Hot
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Regression Is All You Need
blog.tilderesearch.com
·
1d
·
Discuss:
Hacker News
🧱
Chunking
Preview
Share
Show Feeds
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Building intelligent physical AI: From edge to cloud with Strands Agents, Bedrock AgentCore, Claude 4.5, NVIDIA GR00T, and Hugging Face LeRobot
aws.amazon.com
·
17h
📱
Edge AI
Preview
Share
Show Feeds
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Meta-Optimized Continual Adaptation for bio-inspired soft robotics maintenance in hybrid quantum-classical pipelines
dev.to
·
4h
·
Discuss:
DEV
🧠
Neuroplasticity
Preview
Share
Show Feeds
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Day 5: 21 Days of Building a Small Language Model: Data
reddit.com
·
21h
·
Discuss:
r/LocalLLaMA
📱
Edge AI
Preview
Share
Show Feeds
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Fine-tuning Gemma 3 for mobile
opensource.googleblog.com
·
15h
·
Discuss:
Hacker News
📱
Edge AI
Preview
Share
Show Feeds
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Building a RAG Server with PostgreSQL – Part 3: Deploying Your RAG API
pgedge.com
·
1d
·
Discuss:
Hacker News
📊
Data Pipelines (ETL)
Preview
Share
Show Feeds
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
CVHub520/X-AnyLabeling
github.com
·
2d
📸
Visual Regression Testing
Preview
Share
Show Feeds
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Launch HN: Mentat (YC S16) – Controlling LLMs with Runtime Intervention
playground.ctgt.ai
·
3d
·
Discuss:
Hacker News
🧩
LLM Integration
Preview
Share
Show Feeds
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
How much does it cost to generate an image?
flopsandfinance.substack.com
·
2d
·
Discuss:
Substack
📱
Edge AI
Preview
Share
Show Feeds
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Rcarmo/Python-FastAPI-trmnl-server: Lightweight TRMNL BYOS server
github.com
·
2h
·
Discuss:
Hacker News
⚡
FastAPI
Preview
Share
Show Feeds
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Reverse-Engineering the RK3588 NPU: Hacking Memory Limits to run massive Vision Transformers
amohan.dev
·
1d
·
Discuss:
r/LocalLLaMA
💾
Retro Computing
Preview
Share
Show Feeds
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
There are things that AIs understand and no human can
jovex.substack.com
·
19h
·
Discuss:
Substack
🛡️
AI Security
Preview
Share
Show Feeds
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Deploying NVIDIA Dynamo & LMCache for LLMs: Installation, Containers, and Integration
dev.to
·
2d
·
Discuss:
DEV
🧩
LLM Integration
Preview
Share
Show Feeds
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
When Deep Learning Meets the Devil's Wheel: RL for European Roulette (Part 1)"
dev.to
·
11h
·
Discuss:
DEV
📱
Edge AI
Preview
Share
Show Feeds
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
A Glance at GPU Goodness in Java: LLM Inference with TornadoVM
javaadvent.com
·
2d
·
Discuss:
Hacker News
💸
Affordable LLMs
Preview
Share
Show Feeds
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
External Semantic Memory Architecture for Multi-Agent LLM Systems
github.com
·
3d
·
Discuss:
DEV
💸
Affordable LLMs
Preview
Share
Show Feeds
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Implementing nanochat using AMD’s MI300X hardware and dev credits.
theatomsofai.substack.com
·
4d
·
Discuss:
r/LocalLLaMA
💸
Affordable LLMs
Preview
Share
Show Feeds
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
The Fluid Substrate: Streaming 1TB Models from NVMe via Io_uring
zenodo.org
·
3d
·
Discuss:
Hacker News
🦭
Podman
Preview
Share
Show Feeds
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Can a 14B Model Match a 100B+ Model? We Fine-Tuned 8 Models to Find Out
orq.ai
·
3d
·
Discuss:
Hacker News
🔧
DSPy
Preview
Share
Show Feeds
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Edge-to-Cloud Swarm Coordination for heritage language revitalization programs with embodied agent feedback loops
dev.to
·
1d
·
Discuss:
DEV
🔄
Autonomous Agents
Preview
Share
Show Feeds
Report Post
Harmful Content
Off Topic
Low Quality
Spam
Misleading
Duplicate
Wrong Language
Loading...
Loading more...
Page 2 »