Edge AI Optimization

Feeds to Scour
SubscribedAll
Scoured 17 posts in 10.5 ms

LLMCodec: Adapting Video Codecs for Efficient Weight Compression of Large Language Models

 🤖LLM  Content type: Academic
arxiv.org·
Less-relevant results

Uncle Sam considers buying a seat on the Titanic

 Edge AI  Content type: News

Spacecoin Signs $100M Deal to Deploy DePIN Satellites (1 minute read)

 Edge AI
threadreaderapp.com
·

HydraCIL: Decoupled Class-Incremental Learning through Prototype-Guided Multi-Head Classifiers

 🧠Machine Learning  Content type: Academic
arxiv.org·

STEPS: Semantic-Contract-Guided Scheduling for LLM-Assisted Natural-Language-Driven Edge AI Services

 Edge AI  Content type: Academic
arxiv.org·

Running Qwen 35B MoE at 450k Context on a Single 32GB GPU

 🤖AI

Gemma 4 QAT models: Optimizing model compression for mobile and laptop efficiency

 ⚗️Knowledge Distillation  Content type: News  Content type: Blog
blog.google··Hacker News

A 65 nm Multi-Modal Bayesian Inference Engine with 16.3 fJ/Sample Calibration-Free GRNG for Risk-Aware At-Home Skin Lesion Screening

 🧠Machine Learning  Content type: Academic
arxiv.org·

Accuracy-Configurable Floating-Point Multiplier Design for SRAM-Based Compute-in-Memory

 Edge AI  Content type: Academic
arxiv.org·

Show HN: I nerfed our coding agents on purpose

 🤝Human-AI Collaboration  Content type: Discussion

Jeff Bezos Is Funding a Wild Hunt for the Brain’s ‘Core Algorithm’

 🧠Machine Learning

A 65 nm Trustworthy Hypoglycemia Forecasting Engine Achieving 11.3 nJ per Inference

 Edge AI  Content type: Academic
arxiv.org·

Magenta RealTime 2: Open and Local Live Music Models

 LLMs

Amazon Ring's Familiar Faces – Perfect Privacy and Environmental Storm

 🛡️Privacy  Content type: Blog

GenAutoML: An Agentic Framework for Dynamic Architecture Generation and Optimization in Time-Series Analysis

 🧠Machine Learning  Content type: Academic
arxiv.org·

Do Transformers Need Three Projections? Systematic Study of QKV Variants

 LLMs  Content type: Academic
arxiv.org··Hacker News

BIDENT: Heterogeneous Operator-level Mapping for Efficient Edge Inference

 Edge AI  Content type: Academic
arxiv.org·

No more posts from hop1.ng.1357's subscribed feeds.

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help