Model Interchange, Cross-framework, Inference Runtime, Model Export

I made a tensor runtime & inference framework in C (good for learning how inference works)
github.com·18h·
📜TorchScript
Flag this post
Enhanced Richardson Extrapolation via Adaptive Kernel Regression and Uncertainty Quantification
dev.to·6h·
Discuss: DEV
🏎️TensorRT
Flag this post
How to Use Multimodal AI Models With Docker Model Runner
docker.com·6h
🚀MLOps
Flag this post
LangChain vs LangGraph: A Beginner’s Guide to Building Smarter AI Workflows
hackernoon.com·4h
🤖AI Coding Tools
Flag this post
We found embedding indexing bottleneck in the least expected place: JSON parsing
nixiesearch.substack.com·3h·
Discuss: Substack
🐕Ruff
Flag this post
Reasoning Models Sometimes Output Illegible Chains of Thought
arxiv.org·14h
🎓Model Distillation
Flag this post
Makefile vs. YAML: Modernizing verification simulation flows
edn.com·9h
🏗️Build Optimization
Flag this post
Deploy an LLM inference service on OpenShift AI
developers.redhat.com·12h
ONNX Runtime
Flag this post
Understanding Federated Learning: Best Practices for Implementing Privacy-Preserving AI in C# Projects
dev.to·11h·
Discuss: DEV
🔗NCCL
Flag this post
Beating XLoader at Speed: Generative AI as a Force Multiplier for Reverse Engineering
research.checkpoint.com·6h
🐕Ruff
Flag this post
Gated DeltaNet (Linear Attention variant in Qwen3-Next and Kimi Linear)
sebastianraschka.com·16h·
Discuss: r/LLM
👁️Attention Optimization
Flag this post
Can-t stop till you get enough
cant.bearblog.dev·1d·
Discuss: Hacker News
📜TorchScript
Flag this post
ClipTagger-12B VLM: Frame Captioning Tutorial
dev.to·1d·
Discuss: DEV
🏎️TensorRT
Flag this post
Incremental Compilation in Recursive‑Descent Parser (Roslyn)
langdev.stackexchange.com·1d·
Discuss: Hacker News
🐕Ruff
Flag this post
My First Multi-GPU Kernel: Writing All-to-All for AMD MI300X
gau-nernst.github.io·18h·
Discuss: Hacker News
🎯GPU Kernels
Flag this post
Scaling Coding-Agent RL to 32x H100s. 160% Improvement on Stanford's TBench
github.com·7h·
🤖AI Coding Tools
Flag this post
GSoC 2025: Introducing an ABI Lowering Library
blog.llvm.org·19h
📊Profiling Tools
Flag this post
Relation-Aware Bayesian Optimization of DBMS Configurations Guided by Affinity Scores
arxiv.org·14h
ONNX Runtime
Flag this post