Running MiniMax-M2 locally - Existing Hardware Advice
reddit.com·2h·
Discuss: r/LocalLLaMA
🔧PTX
Flag this post
How Data 360 Vector Search Delivers Near Real-Time Intelligence on 90% of Enterprise Data
engineering.salesforce.com·1d
🤖AI Coding Tools
Flag this post
We found embedding indexing bottleneck in the least expected place: JSON parsing
nixiesearch.substack.com·1d·
Discuss: Substack
🐕Ruff
Flag this post
Algorithmic Complexity Reduction via Quantized State Space Search
dev.to·1h·
Discuss: DEV
📉Model Quantization
Flag this post
Attention Is All You Need for KV Cache in Diffusion LLMs
paperium.net·14h·
Discuss: DEV
🎯Tensor Cores
Flag this post
GrowthHacker: Automated Off-Policy Evaluation Optimization Using Code-Modifying LLM Agents
arxiv.org·14h
🤖AI Coding Tools
Flag this post
My First Multi-GPU Kernel: Writing All-to-All for AMD MI300X
gau-nernst.github.io·1d·
Discuss: Hacker News
🎯GPU Kernels
Flag this post
Hyper-Dimensional Bayesian Optimization for Enhanced Statistical Process Control
dev.to·3d·
Discuss: DEV
⏱️Benchmarking
Flag this post
Top Open Source Tools for Kubernetes ML: From Development to Production
dev.to·2h·
Discuss: DEV
🚀MLOps
Flag this post
Physics informed machine learning based predictive control for intelligent operation of edge datacenters
sciencedirect.com·2d
🎓Model Distillation
Flag this post
ASAN: A conceptual architecture for a self-creating, energy-efficient AI system
github.com·1d·
Discuss: Hacker News
🚀MLOps
Flag this post
The Learning Loop and LLMs
martinfowler.com·4h·
Discuss: Hacker News
🤖AI Coding Tools
Flag this post
Reforging the ReScript Build System
rescript-lang.org·2h·
🏗️Build Optimization
Flag this post
Efficiency vs. Alignment: Investigating Safety and Fairness Risks in Parameter-Efficient Fine-Tuning of LLMs
arxiv.org·14h
🔄ONNX
Flag this post
Enforcing Architecture in an Agent-Driven Codebase
phoebe.work·1d·
Discuss: Hacker News
🏗️Build Optimization
Flag this post
Fine-Tuning an AI – Part I
zwischenzugs.com·1h
🤖AI Coding Tools
Flag this post
Automated Defect Prediction via Cross-Entropy Regularized Graph Neural Networks for Microservice Architectures
dev.to·17h·
Discuss: DEV
ONNX Runtime
Flag this post
Writing an LLM from scratch, part 26 – evaluating the fine-tuned model
gilesthomas.com·23h·
Discuss: Hacker News
💡LSP
Flag this post
AI Infrastructure as Code - Automating AI Model Deployment and Scaling in Cloud Environments
dev.to·8h·
Discuss: DEV
🤖AI Coding Tools
Flag this post
Engineering.ai: A Platform for Teams of AI Engineers in Computational Design
arxiv.org·14h
🔄ONNX
Flag this post