I made a tensor runtime & inference framework in C (good for learning how inference works)
📜TorchScript
Flag this post
Enhanced Richardson Extrapolation via Adaptive Kernel Regression and Uncertainty Quantification
🏎️TensorRT
Flag this post
How to Use Multimodal AI Models With Docker Model Runner
docker.com·6h
🚀MLOps
Flag this post
LangChain vs LangGraph: A Beginner’s Guide to Building Smarter AI Workflows
hackernoon.com·4h
🤖AI Coding Tools
Flag this post
Reasoning Models Sometimes Output Illegible Chains of Thought
arxiv.org·14h
🎓Model Distillation
Flag this post
Makefile vs. YAML: Modernizing verification simulation flows
edn.com·9h
🏗️Build Optimization
Flag this post
Deploy an LLM inference service on OpenShift AI
developers.redhat.com·12h
⚡ONNX Runtime
Flag this post
Beating XLoader at Speed: Generative AI as a Force Multiplier for Reverse Engineering
research.checkpoint.com·6h
🐕Ruff
Flag this post
I'm the author of LocalAI (the local OpenAI-compatible API). We just released v3.7.0 with full Agentic Support (tool use!), Qwen 3 VL, and the latest llama.cpp
💡LSP
Flag this post
Gated DeltaNet (Linear Attention variant in Qwen3-Next and Kimi Linear)
👁️Attention Optimization
Flag this post
Can-t stop till you get enough
📜TorchScript
Flag this post
ClipTagger-12B VLM: Frame Captioning Tutorial
🏎️TensorRT
Flag this post
AI coding transforms data engineering: How dltHub's open-source Python library helps developers create data pipelines for AI in minutes
venturebeat.com·4h
🤖AI Coding Tools
Flag this post
Scaling Coding-Agent RL to 32x H100s. 160% Improvement on Stanford's TBench
🤖AI Coding Tools
Flag this post
GSoC 2025: Introducing an ABI Lowering Library
blog.llvm.org·19h
📊Profiling Tools
Flag this post
Relation-Aware Bayesian Optimization of DBMS Configurations Guided by Affinity Scores
arxiv.org·14h
⚡ONNX Runtime
Flag this post
Loading...Loading more...