⚙️ MLOps - jinkai_lau · Scour

LLM-Guided Runtime Parameter Optimization for Energy-Efficient Model Inference 🧠LLMs

DeepSeek-V4 on Day 0: From Fast Inference to Verified RL with SGLang and Miles 📊Benchmarking

lmsys.org·6d·Hacker News

Optimizing ML Workload Network Efficiency (Part I): Feature Trimmer 🔬eBPF

·16h

Stanford CS229 | Machine Learning I Building Large Language Models (LLMs) 🧠LLMs

The Inference Economy: Token Use 🧠LLMs

frontierai.substack.com·1d·Substack

OpenShift AI observability summarizer: Transform metrics into meaning 📊Observability

developers.redhat.com·4d

RT by @paulg: A 7-million parameter model outperforming models a thousand times its size on tasks like ARC Prize. That's what recursive reasoning unlocks. 🔬AI Research

twitter.macworks.dev

·17h

Flow generation through natural language: An agentic modeling approach (11 minute read) 🤖AI Engineering

shopify.engineering·2d

Agentic AI Security, TinyML, Inference on Edge AI MCUs: Embedded Week Insights 🤖AI Engineering

embedded.com·12h

The Data Layer Tax for Robot Learning 🔬AI Research

rerun.io·1d·Hacker News

not much happened today 🔬AI Research

news.smol.ai·4d

google-deepmind/proeval: Proactive failure discovery and efficient performance estimation for GenAI evaluation. 🤖AI Engineering

Fixing What LLMs Get Wrong (22 minute read) 🧠LLMs

thebigdataguy.substack.com·5d·Substack

[AINews] The Inference Inflection 🔬AI Research

·2d

Dedicated vs Serverless Inference as You Scale 🌍Edge Computing

digitalocean.com·2d

Adaptive Thinking: Large Language Models Know When to Think in Latent Space 🧠LLMs

machinelearning.apple.com·3d

Announcing Together AI and Adaption Partnership 🤖AI Engineering

together.ai·2d

Three Cobblers, One Zhuge Liang: Making Cheaper Models Work Together 🤖AI Engineering

markhuang.ai·2d·Hacker News

Can IBM’s RITS Platform and vLLM Reset the Bar for Enterprise AI Access? 🤖AI Engineering

futurumgroup.com·6d

Introducing DigitalOcean AI-Native Cloud for Production AI Workloads 🤖AI Engineering

news.radio-t.com·2d

Log in to enable infinite scrolling