WebAssembly, On-device Models, TensorFlow Lite, Mobile Inference

A Thesis and Playbook for Edge AI
ondeviceguy.substack.com·1d·
Discuss: Substack
Serverless
Flag this post
Continuous Autoregressive Language Models
shaochenze.github.io·3h·
Discuss: Hacker News
🧭Vector Databases
Flag this post
Why Multimodal AI Broke the Data Pipeline — And How Daft Is Beating Ray and Spark to Fix It
hackernoon.com·2d
🦀Rust
Flag this post
Topographical sparse mapping: A training framework for deep learning models
sciencedirect.com·9h·
Discuss: Hacker News
🧭Vector Databases
Flag this post
Generalizing Test-Time Compute-Optimal Scaling as an Optimizable Graph
huggingface.co·2h·
Discuss: Hacker News
🧭Vector Databases
Flag this post
KTransformers Open Source New Era: Local Fine-tuning of Kimi K2 and DeepSeek V3
reddit.com·17h·
Discuss: r/LocalLLaMA
Serverless
Flag this post
Enabling Trillion-Parameter Models on AWS EFA
research.perplexity.ai·6h·
Discuss: Hacker News
Serverless
Flag this post
GEN-0: SoTA 10B+ Foundation Model for Robotics with Harmonic Reasoning
generalistai.com·10h·
Discuss: Hacker News
🤖AI
Flag this post
Inside Pinecone: Slab Architecture
pinecone.io·13h·
Discuss: Hacker News
Serverless
Flag this post
Context Engineering with Real-Time, Processed Data
confluent.io·4h·
Discuss: Hacker News
Serverless
Flag this post
Choosing the best AI coding agent for Bitrise
bitrise.io·8h·
Discuss: Hacker News
🤖AI
Flag this post
Writing an LLM from scratch, part 27 – what's left, and what's next?
gilesthomas.com·1d·
Discuss: Hacker News
Serverless
Flag this post
Cutting LLM Batch Inference Time in Half: Dynamic Prefix Bucketing at Scale
daft.ai·13h·
Discuss: Hacker News
Serverless
Flag this post
Small Vs. Large Language Models
semiengineering.com·1d·
Discuss: Hacker News, r/LLM
🤖AI
Flag this post
Show HN: ReadMyMRI DICOM native preprocessor with multi model consensus/ML pipes
github.com·7h·
Discuss: Hacker News
Serverless
Flag this post
Most Gen AI Players Remain 'Far Away' from Profiting: Interview with Andy Wu
library.hbs.edu·10h·
Discuss: Hacker News
🤖AI
Flag this post
The Case That A.I. Is Thinking
newyorker.com·1d·
Discuss: Hacker News
🤖AI
Flag this post
I Taught an AI to Dream
blog.minibase.ai·13h·
Discuss: Hacker News
🤖AI
Flag this post
LangChain vs LangGraph: A Beginner’s Guide to Building Smarter AI Workflows
hackernoon.com·1d
🤖AI
Flag this post
Open Source Context-Aware PII Classifier
corp.roblox.com·10h·
Discuss: Hacker News
🤖AI
Flag this post