Edge AI

edge inference, on-device AI, NVIDIA Jetson, TensorRT

Feeds to Scour
SubscribedAll
Scoured 49 posts in 11.8 ms

FlexNPU: Transparent NPU Virtualization for Dynamic LLM Prefill-Decode Co-location

 🇨🇳Chinese AI  Content type: Academic
arxiv.org·

Integrate on-device AI models into your app using Core AI - WWDC26 - Videos

 🤖AI

localcodeai/localcode: Turn natural language into CLI commands using Apple's on-device AI

 💻Terminal Tools  Content type: Code
github.com··Hacker News

Apple rebuilt its on-device AI stack at WWDC 2026

 🧠Machine Learning  Content type: Blog
ziraph.com··Hacker News

Apple Silicon's on-device AI bet hasn't moved – only the chip range that runs it

 🤖AI

From Human Guidance to Autonomy: Agent Skill System for End-to-End LLM Deployment on Spatial NPUs

 🦉Qwen  Content type: Academic
arxiv.org·

Spacecoin Signs $100M Deal to Deploy DePIN Satellites (1 minute read)

 📱Edge AI Optimization
threadreaderapp.com
·

apple/coreai-models: Model export recipes, Python primitives, and Swift runtime utilities for on-device AI

 🐍Python  Content type: Code
github.com··Hacker News

Search sound libraries with natural language, on-device AI

 🔎Semantic Search

HydraCIL: Decoupled Class-Incremental Learning through Prototype-Guided Multi-Head Classifiers

 🧠Machine Learning  Content type: Academic
arxiv.org·

Apple Announces Liquid Glass Improvements and Transparency Slider

 🪄Prompt Engineering  Content type: News

Uncle Sam considers buying a seat on the Titanic

 🎭Claude  Content type: News

OpenCV 5.0 Released With Rewritten DNN Engine, Built-In LLM & VLM Support

 🧠Machine Learning
phoronix.com··Hacker News

hashwnath/KMCP: Open-source MCP server for your docs. Zero LLM at query time. docker compose up and go.

 🔧Agent Tooling  Content type: Code
github.com··Hacker News

CFRNet: Cycle-Consistent Fixed-Point Training for Real-Time Blind Face Restoration on Consumer Embedded NPUs

 🔢BitNet Inference  Content type: Academic
arxiv.org·

Your Lambda isn't leaking memory — your metrics are lying to you

 🪝eBPF  Content type: Blog

Introducing the Third Generation of Apple’s Foundation Models

 🧠Machine Learning

harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.

 LLMs  Content type: Code
github.com··Hacker News

STEPS: Semantic-Contract-Guided Scheduling for LLM-Assisted Natural-Language-Driven Edge AI Services

 📱Edge AI Optimization  Content type: Academic
arxiv.org·

anthonypjshaw/doom-onnx

 🪝eBPF

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help