Model Deployment, Cross-framework, Inference Engine, Optimization

A QOJ week
blog.mitrichev.ch·1d·
🔍Type Checkers
Flag this post
Great, now even malware is using LLMs to rewrite its code, says Google, as it documents new phase of 'AI abuse'
pcgamer.com·21h
🤖AI Coding Tools
Flag this post
AWS S3 Vectors at scale: Real performance numbers at 10 million Vectors
dev.to·23h·
Discuss: DEV
✂️CUTLASS
Flag this post
SAP’s AI Model “sap-rpt-1” is a Research Project, Not a Revolution
pub.towardsai.net·6h
🎓Model Distillation
Flag this post
Migration Case: From Azkaban to DolphinScheduler
dev.to·1d·
Discuss: DEV
🤖Automation
Flag this post
Unlock Multi-Domain NLP: Adapt Pre-trained Models Without the Heavy Lifting
dev.to·1d·
Discuss: DEV
🎓Model Distillation
Flag this post
How to Design Efficient Memory Architectures for Agentic AI Systems
pub.towardsai.net·2d
Flash Attention
Flag this post
Deep Dive in Transparent Proxy Code
dev.to·27m·
Discuss: DEV
📜TorchScript
Flag this post
Unlock the Power of GANs: Train with Tiny Datasets!
dev.to·2d·
Discuss: DEV
📊Gradient Accumulation
Flag this post
The Art of Luminous Code: A Journey with Dynamic `import()` in Node.js
dev.to·3d·
Discuss: DEV
💡LSP
Flag this post
Build software sustainably in the AI era
cloud.google.com·1d
🤖AI Coding Tools
Flag this post
Building Syllabi – Agentic AI with Vercel AI SDK, Dynamic Tool Loading, and RAG
dev.to·4d·
Discuss: DEV
🤖AI Coding Tools
Flag this post
MoM: Mixtures of Scenario-Aware Document Memories for Retrieval-AugmentedGeneration Systems
dev.to·6h·
Discuss: DEV
🤖AI Coding Tools
Flag this post
## Adaptive Multi-Heuristic Intrusion Detection for Collaborative Welding Robot Networks
freederia.com·14h
🤖AI Coding Tools
Flag this post
Evaluating Generative AI as an Educational Tool for Radiology Resident Report Drafting
arxiv.org·1d
🤖AI Coding Tools
Flag this post
CoCoVa: Chain of Continuous Vision-Language Thought for Latent Space Reasoning
arxiv.org·2d
🧮cuDNN
Flag this post
Teaching AI to Take Initiative – Building a Self-Thinking App with LangGraph and Ollama
dev.to·2d·
Discuss: DEV
🤖AI Coding Tools
Flag this post
What's the stack for going from a fine-tune on vLLM to a simple, paid public API?
reddit.com·1d·
Discuss: r/LocalLLaMA
🚀MLOps
Flag this post