QMC: Efficient SLM Edge Inference via Outlier-Aware Quantization and Emergent Memories Co-Design
arxiv.org·1h
Edge AI: The future of AI inference is smarter local compute
infoworld.com·2d
IGAA: Intent-Driven General Agentic AI for Edge Services Scheduling using Generative Meta Learning
arxiv.org·1d
Artificial Intelligence
radiofreemobile.com·23h
DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation
machinelearning.apple.com·1d
Qdrant - Vector Database
qdrant.tech·1d
I replaced my ChatGPT subscription with a 12GB GPU and never looked back
xda-developers.com·10h
Co-optimization Approaches For Reliable and Efficient AI Acceleration (Peking University et al.)
semiengineering.com·12h
Learning from Models
rodney.bearblog.dev·1d
Why AI Needs GPUs and TPUs: The Hardware Behind LLMs
blog.bytebytego.com·2d
Loading...Loading more...