⚡ ONNX Runtime - miterion · Scour

Self-hosted AI research engine using SearXNG + Ollama

github.com·8h·

Discuss: r/selfhosted

Show HN: Fighting the War Against Expensive Reinforcement Learning

cadenza-landing-qtu7gbjwb-akshparekh123-3457s-projects.vercel.app·22h·

Discuss: Hacker News

🤖AI Coding Tools

MDST Engine: run GGUF models in the browser with WebGPU/WASM

mdst.app·1d·

Discuss: Hacker News

How We Built the Fastest Kimi K2.5 on Artificial Analysis

baseten.co·1d·

Discuss: Hacker News

🏎️TensorRT

Configuration-to-Performance Scaling Law with Neural Ansatz

arxiv.org·1d

🏎️TensorRT

Prime Intellect Lab: a full-stack platform for training your own models

primeintellect.ai·1d·

Discuss: Hacker News

🤖AI Coding Tools

AI-Chat with Strava — Developing an LLM-Integration with MCP

pub.towardsai.net

·1h

🤖AI Coding Tools

Gibbs Measures from Deep Shaped Multilayer Perceptrons

link.aps.org·16h

📊Gradient Accumulation

How Andrej Karpathy Built a Working Transformer in 243 Lines of Code

analyticsvidhya.com·16h

📜TorchScript

AI Analytics Platforms

trendhunter.com·14h

🤖AI Coding Tools

Optimal timing for superintelligence

marginalrevolution.com·5h

👁️Attention Optimization

Leading Inference Providers Cut AI Costs by up to 10x With Open Source Models on NVIDIA Blackwell

blogs.nvidia.com·13h

🏎️TensorRT

Show HN: The Algorithm's Favorite Child

chatbotkit.com·14h·

Discuss: Hacker News

🧩Attention Kernels

SotA ARC-AGI-2 Results with REPL Agents

symbolica.ai·20h·

Discuss: Hacker News

🤖AI Coding Tools

Yori – Isolating AI Logic into "Semantic Containers" (Docker for Code)

news.ycombinator.com·1h·

Discuss: Hacker News

🤖AI Coding Tools

Link-checking with generative AI

natemeyvis.com·1d

🤖AI Coding Tools

MiniMaxAI MiniMax-M2.5 has 230b parameters and 10b active parameters

openhands.dev·8h·

Discuss: r/LocalLLaMA

⏱️Benchmarking

EyesOff: Why Some Models Quantize Better Than Others

ym2132.github.io·1d·

Discuss: Hacker News

📉Model Quantization

DeepSeek R1 on Localhost: Building a Private Coding Assistant for $0

dev.to·1d·

Discuss: DEV

🤖AI Coding Tools

CCBench: How do agents perform on codebases that aren't part of training data?

ccbench.org·7h·

Discuss: Hacker News

🤖AI Coding Tools

Loading more...