WebAssembly, On-device Models, TensorFlow Lite, Mobile Inference

A Thesis and Playbook for Edge AI
ondeviceguy.substack.com·1d·
Discuss: Substack
Serverless
Flag this post
Why Multimodal AI Broke the Data Pipeline — And How Daft Is Beating Ray and Spark to Fix It
hackernoon.com·1d
🦀Rust
Flag this post
KTransformers Open Source New Era: Local Fine-tuning of Kimi K2 and DeepSeek V3
reddit.com·4h·
Discuss: r/LocalLLaMA
Serverless
Flag this post
Writing an LLM from scratch, part 27 – what's left, and what's next?
gilesthomas.com·15h·
Discuss: Hacker News
Serverless
Flag this post
The Case That A.I. Is Thinking
newyorker.com·1d·
Discuss: Hacker News
🤖AI
Flag this post
Small Vs. Large Language Models
semiengineering.com·1d·
Discuss: Hacker News, r/LLM
🤖AI
Flag this post
Show HN: Refusal-Aware Logical Framework for LLMs
github.com·1h·
Discuss: Hacker News
🤖AI
Flag this post
Real-time stock volatility prediction with deep learning on a time-series DB
medium.com·8h·
Discuss: Hacker News
🧭Vector Databases
Flag this post
LangChain vs LangGraph: A Beginner’s Guide to Building Smarter AI Workflows
hackernoon.com·1d
🤖AI
Flag this post
Show HN: Oodle – Unified Debugging with OpenSearch and Grafana
blog.oodle.ai·56m·
Discuss: Hacker News
Serverless
Flag this post
Ranking LLMs based on 180k French votes (French government's AI arena)
comparia.beta.gouv.fr·3h·
Discuss: Hacker News
🤖AI
Flag this post
Beyond Standard LLMs
magazine.sebastianraschka.com·3h·
Discuss: Hacker News, r/LLM
🤖AI
Flag this post
OlmoEarth: A new state-of-the-art Earth observation foundation model family
allenai.org·2h·
Discuss: Hacker News
🤖AI
Flag this post
The race to train AI robots how to act human in the real world
latimes.com·6h·
Discuss: Hacker News
🤖AI
Flag this post
AI Uses Functions to Fetch Real Data (Not Just Chat)
farukalpay.substack.com·3h·
Discuss: Substack
🤖AI
Flag this post
Why stop at 1M tokens when you can have 10M?
news.ycombinator.com·5h·
Discuss: Hacker News
🦀Rust
Flag this post
Why agents do not write most of our code – a reality check
octomind.dev·22h·
Discuss: Hacker News
🦀Rust
Flag this post
Open Sourcing Kubetorch
run.house·1d·
Discuss: Hacker News
Serverless
Flag this post
Hybrid-Attention models are the future for SLMs
inference.net·14h·
Discuss: Hacker News
🦀Rust
Flag this post
Lazy loading isn't the magic pill to fix AI Inference
tensorfuse-docs.mintlify.dev·2h·
Discuss: Hacker News
Serverless
Flag this post