Model Interchange, Cross-framework, Inference Runtime, Model Export

I made a tensor runtime & inference framework in C (good for learning how inference works)
github.comยท4hยท
๐Ÿ“œTorchScript
Flag this post
Reasoning Models Sometimes Output Illegible Chains of Thought
arxiv.orgยท41m
๐ŸŽ“Model Distillation
Flag this post
ClipTagger-12B VLM: Frame Captioning Tutorial
dev.toยท13hยท
Discuss: DEV
๐ŸŽ๏ธTensorRT
Flag this post
Gated DeltaNet (Linear Attention variant in Qwen3-Next and Kimi Linear)
sebastianraschka.comยท2hยท
Discuss: r/LLM
๐Ÿ‘๏ธAttention Optimization
Flag this post
Can-t stop till you get enough
cant.bearblog.devยท11hยท
Discuss: Hacker News
๐Ÿ“œTorchScript
Flag this post
Incremental Compilation in Recursiveโ€‘Descent Parser (Roslyn)
langdev.stackexchange.comยท10hยท
Discuss: Hacker News
๐Ÿ•Ruff
Flag this post
My First Multi-GPU Kernel: Writing All-to-All for AMD MI300X
gau-nernst.github.ioยท4hยท
Discuss: Hacker News
๐ŸŽฏGPU Kernels
Flag this post
Automated Anomaly Detection & Root Cause Analysis in Complex System Simulations via Adaptive Bayesian Networks
dev.toยท1dยท
Discuss: DEV
โšกONNX Runtime
Flag this post
A Practitioner's Guide to Kolmogorov-Arnold Networks
arxiviq.substack.comยท11hยท
Discuss: Substack
๐Ÿ“‰Model Quantization
Flag this post
Relation-Aware Bayesian Optimization of DBMS Configurations Guided by Affinity Scores
arxiv.orgยท41m
โšกONNX Runtime
Flag this post
When Five Dumb AIs Beat One Smart AI: The Case for Multi-Agent Systems
ksramalakshmi.medium.comยท19hยท
Discuss: r/LocalLLaMA
๐Ÿค–AI Coding Tools
Flag this post
Weak-To-Strong Generalization
lesswrong.comยท1d
๐Ÿ“‰Model Quantization
Flag this post
TinyML is the most impressive piece of software you can run on any ESP32
xda-developers.comยท2d
โšกONNX Runtime
Flag this post
Synthesized Generative Modeling via Graph-Constrained Semantic Embedding
dev.toยท13hยท
Discuss: DEV
๐ŸŽ“Model Distillation
Flag this post
ZkML Breakthrough: 13B Models Verified in 15 Minutes
lightcapai.medium.comยท13hยท
Discuss: Hacker News
๐ŸŽฏTensor Cores
Flag this post
Active transfer learning for structural health monitoring
arxiv.orgยท41m
๐ŸŽ“Model Distillation
Flag this post
Phased DMD: Few-step Distribution Matching Distillation via Score Matching within Subintervals
arxiv.orgยท41m
๐ŸงฎcuDNN
Flag this post
From product to system network challenges in system of systems lifecycle management
arxiv.orgยท41m
๐Ÿ’กLSP
Flag this post