🔄 ONNX - miterion · Scour

snapllm/snapllm: 🔥 🔥 Alternative to Ollama 🔥 🔥 multi-model <1ms LLM switching

github.com·1h·

Discuss: Hacker News

🏎️TensorRT

Leaning Into the Coding Interview: Lean 4 vs Dafny cage-match

ntaylor.ca·42m·

Discuss: Lobsters, Hacker News

🔍Type Checkers

LLM Optimization: From Research to Production

dev.to·5h·

Discuss: DEV

Building an Embedding API with Rust, Arm, and EmbeddingGemma on AWS Lambda

sobolev.substack.com·1d·

Discuss: Substack

Show HN: PolyMCP – Orchestrate AI agents across Python tools and MCP servers

news.ycombinator.com·1d·

Discuss: Hacker News

Compiling High-Level Neural Network Specifications into VNN-LIB Queries

arxiv.org·1d

⚡ONNX Runtime

A Practical Guide to Multi-Model AI Workflows

dev.to·21h·

Discuss: DEV

🤖AI Coding Tools

Show HN: Langasync – Use OpenAI/Anthropic Batch APIs with LangChain Chains

github.com·5h·

Discuss: Hacker News

📜TorchScript

Introducing Dedicated Container Inference: Delivering 2.6x faster inference for custom AI models

together.ai·2d

⚡ONNX Runtime

Is anyone smarter than me able to help with this?

forum.godotengine.org·3h·

Discuss: r/godot

🔍Type Checkers

Completed Hyperparameter Transfer across Modules, Width, Depth, Batch and Duration

machinelearning.apple.com·1d

🎓Model Distillation

Computer Vision Agent

npmjs.com·8h·

Discuss: Hacker News

⚡ONNX Runtime

multi-model weather with uncertainty and activity insights

klimly.com·8h·

Discuss: Hacker News

⚡ONNX Runtime

Show HN: A small embeddable Datalog engine in Zig

news.ycombinator.com·6h·

Discuss: Hacker News

⚡ONNX Runtime

exascaleproject.org·18m

⚡ONNX Runtime

AI Agent Mimics Scientific Reasoning To Uncover Hidden Equations

quantumzeitgeist.com·20h

⚡ONNX Runtime

Running an experiment with Claude Code overnight

blog.nolank.ca·2h

🤖AI Coding Tools

Olmix: A framework for data mixing throughout LM development

allenai.org·1d

⚡ONNX Runtime

SWE-rebench Jan 2026: GLM-5, MiniMax M2.5, Qwen3-Coder-Next, Opus 4.6, Codex Performance

swe-rebench.com·1d·

Discuss: r/LocalLLaMA

⏱️Benchmarking

HiFloat4 Format for Language Model Inference

arxiv.org·1d

📉Model Quantization

Loading more...