Model Quantization, ONNX Runtime, Embedded Inference, TinyML

Most people rarely use AI, and dark personality traits predict who uses it more
psypost.org·12h·
Discuss: Hacker News
🏗️AI Infrastructure
The Linus Method: How we simiplifed RFC reviews
devashish.me·2d·
Discuss: Hacker News
☁️Serverless Rust
Free Software Hasn't Won
dorotac.eu·2h·
Discuss: Hacker News
📂open source
Krish Naik: Complete RAG Crash Course With Langchain In 2 Hours
dev.to·22h·
Discuss: DEV
🏗️AI Infrastructure
Ready or not, enterprises are betting on AI
techcrunch.com·1d
🧠AI
Tech With Tim: How to Build AI Agents in Python
dev.to·1d·
Discuss: DEV
🤖AI agents
Thinking on the Fly: Test-Time Reasoning Enhancement via Latent Thought Policy Optimization
arxiv.org·5d
💻Local LLMs
MultiCNKG: Integrating Cognitive Neuroscience, Gene, and Disease Knowledge Graphs Using Large Language Models
arxiv.org·3d
💻Local LLMs
Cluster Paths: Navigating Interpretability in Neural Networks
arxiv.org·3d
🏗️AI Infrastructure
The billion-dollar infrastructure deals powering the AI boom
techcrunch.com·2d
🏗️AI Infrastructure
How to Teach Large Multimodal Models New Skills
arxiv.org·2d
💻Local LLMs
Krish Naik: Agentic AI 3.0 - Live Ultimate RAG Bootcamp Course Announcement
dev.to·1d·
Discuss: DEV
🤖AI agents
What is a Large Language Model (LLM)
dev.to·2d·
Discuss: DEV
💻Local LLMs
Structured Cognition for Behavioral Intelligence in Large Language Model Agents: Preliminary Study
arxiv.org·4d
🏗️AI Infrastructure
AI-Driven Predictive Maintenance Optimization via Federated Learning in Semiconductor Fabrication
dev.to·4d·
Discuss: DEV
🏗️AI Infrastructure
The 9 Best CLIs with Artificial Intelligence
dev.to·23h·
Discuss: DEV
🤖AI agents
AsyncSpade: Efficient Test-Time Scaling with Asynchronous Sparse Decoding
arxiv.org·2d
Tokio