I made a tensor runtime & inference framework in C (good for learning how inference works)
๐TorchScript
Flag this post
Reasoning Models Sometimes Output Illegible Chains of Thought
arxiv.orgยท41m
๐Model Distillation
Flag this post
I'm the author of LocalAI (the local OpenAI-compatible API). We just released v3.7.0 with full Agentic Support (tool use!), Qwen 3 VL, and the latest llama.cpp
๐กLSP
Flag this post
Gated DeltaNet (Linear Attention variant in Qwen3-Next and Kimi Linear)
๐๏ธAttention Optimization
Flag this post
Can-t stop till you get enough
๐TorchScript
Flag this post
<p>**Abstract:** Accurate characterization of geothermal fluids and subsurface reservoirs is critical for efficient and sustainable energy extraction. Tradition...
freederia.comยท11h
๐MLOps
Flag this post
Automated Anomaly Detection & Root Cause Analysis in Complex System Simulations via Adaptive Bayesian Networks
โกONNX Runtime
Flag this post
Relation-Aware Bayesian Optimization of DBMS Configurations Guided by Affinity Scores
arxiv.orgยท41m
โกONNX Runtime
Flag this post
When Five Dumb AIs Beat One Smart AI: The Case for Multi-Agent Systems
๐คAI Coding Tools
Flag this post
Weak-To-Strong Generalization
lesswrong.comยท1d
๐Model Quantization
Flag this post
TinyML is the most impressive piece of software you can run on any ESP32
xda-developers.comยท2d
โกONNX Runtime
Flag this post
Synthesized Generative Modeling via Graph-Constrained Semantic Embedding
๐Model Distillation
Flag this post
Active transfer learning for structural health monitoring
arxiv.orgยท41m
๐Model Distillation
Flag this post
Phased DMD: Few-step Distribution Matching Distillation via Score Matching within Subintervals
arxiv.orgยท41m
๐งฎcuDNN
Flag this post
From product to system network challenges in system of systems lifecycle management
arxiv.orgยท41m
๐กLSP
Flag this post
Loading...Loading more...