💻 Local LLMs - nmarshall · Scour

CoPE: Clipped RoPE as A Scalable Free Lunch for Long Context LLMs

arxiv.org·4d

📝Parser Combinators

Harmonia: Algorithm-Hardware Co-Design for Memory- and Compute-Efficient BFP-based LLM Inference

arxiv.org·5d

GAN Augmentation: Augmenting Training Data using Generative Adversarial Networks

dev.to·3d·

Discuss: DEV

**Pulse‑Sequence Tuning for Fault‑Tolerant Exponentiation in Shor’s Algorithm on Transmon Qubits**

dev.to·2d·

Discuss: DEV

⚙️CPU Microarchitecture

Is artificial general intelligence already here? A new case that today's LLMs meet key tests

techxplore.com·3d

🏗️AI Infrastructure

AI Inference Pipelines – Building Low-Latency Systems With gRPC

youtube.com·5d

🏗️AI Infrastructure

Fast Autoscheduling for Sparse ML Frameworks

ajroot.pl·5d·

Discuss: Hacker News, r/Compilers

First Proof | Research-Level Math for AI Evaluation

1stproof.org·4d·

Discuss: Hacker News

🧩Constraint Programming

Give Your Agent a Language Server

blog.gorewood.games·3d

💬Language Servers

Human-like Search for Modern Applications

anvitra.ai·2d·

Discuss: Hacker News

🎯Vector Databases

Does "AI-Ready Data" simply mean "Good Data Modeling"?

motherduck.com·4d

🏗️AI Infrastructure

Issues with AI: Toxic Dependencies

blog.mathieui.net·4d·

Discuss: Hacker News

🤖AI Coding Tools

Kubernetes Operator for automated Jupyter Notebook validation in MLOps pipelines

reddit.com·3d·

Discuss: r/kubernetes

Local Agent Bench: Test 11 small LLMs on tool-calling judgment, on CPU, no GPU

github.com·3d·

Discuss: Hacker News, r/LocalLLaMA

🏗️AI Infrastructure

Designing MCP tool schemas that LLMs understand

news.ycombinator.com·1d·

Discuss: Hacker News

📐Data Modeling

Zuck, ConnectU, and intellectual property in the age of vibecoding

pechotierra.bearblog.dev·4d

A Ralph Loop for Reading: Beating GPT 5.2 with a 4k Context Window (and 4 GPUs)

stevehanov.ca·4d

💾Cache Optimization

Meta’s Next-Generation LLM ‘Avocado’ Surpasses Top Open-Source Models in Pretraining Alone

kmjournal.net·4d·

Discuss: Hacker News

🏠Self-hosted AI

Life at the Edge

asadk.com·3d·

Discuss: Hacker News

📡Edge Computing

Style tips for less experienced developers coding with AI

honnibal.dev·4d·

Discuss: Hacker News

🤖AI Coding Tools

Loading more...