⚙️ MLOps - hop1.ng.1357 · Scour

AI Observability for Large Language Model Systems: A Multi-Layer Analysis of Monitoring Approaches from Confidence Calibration to Infrastructure Tracing 🛡️AI Safety

DeepSeek-V4 on Day 0: From Fast Inference to Verified RL with SGLang and Miles 📱Edge AI Optimization

lmsys.org·5d·Hacker News

carlovalenti/TRiP: A complete transformer engine in C — inference, training, chat, vision. ✨Gemini

github.com·1d·Hacker News, r/C_Programming

Building Semantic Version Control in Rust ⚙️Compilers

therohansharma.com·5d·Hacker News

Progressive Semantic Communication for Efficient Edge-Cloud Vision-Language Models ⚡Edge AI

How we built the most performant DeepSeek V3.2, MiniMax-M2.5 and Qwen 3.5 397B on DigitalOcean NVIDIA HGX™ B300 GPU Droplets 📱Edge AI Optimization

digitalocean.com·2d

RaMP: Runtime-Aware Megakernel Polymorphism for Mixture-of-Experts 📱Edge AI Optimization

Rcarmo/gte-go: Golang inference for the GTE Small embedding model 🤖LLM

github.com·5d·Hacker News

Identifying the Achilles' Heel: An Iterative Method for Dynamically Uncovering Factual Errors in Large Language Models 🤖LLM

FlowBot: Inducing LLM Workflows with Bilevel Optimization and Textual Gradients ✨LLMs

PAINT: Partial-Solution Adaptive Interpolated Training for Self-Distilled Reasoners ⚗️Knowledge Distillation

Programming with Data: Test-Driven Data Engineering for Self-Improving LLMs from Raw Corpora ✨LLMs

Benchmarking the Safety of Large Language Models for Robotic Health Attendant Control 🛡️AI Safety

A Survey on Split Learning for LLM Fine-Tuning: Models, Systems, and Privacy Optimizations ✨LLMs

Efficient, VRAM-Constrained xLM Inference on Clients 📱Edge AI Optimization

Scalable Inference Architectures for Compound AI Systems: A Production Deployment Study 📱Edge AI Optimization

LLM Psychosis: A Theoretical and Diagnostic Framework for Reality-Boundary Failures in Large Language Models ✨LLMs

Optimization of Model Splitting, Placement, and Chaining for Multi-hop Split Learning and Inference 📱Edge AI Optimization

LAF-Based Evaluation and UTTL-Based Learning Strategies with MIATTs 🧠Machine Learning

ClawGym: A Scalable Framework for Building Effective Claw Agents 🕹️Agentic AI

Log in to enable infinite scrolling