🤖 AI Engineering - aaaaa · Scour

mirkolenz/llmhop: Tiny, stateless Go router that dispatches OpenAI-compatible requests to single-model vLLM and sglang backends with zero external dependencies

🧠LLMs Code

github.com··Hacker News

AI Governance Tools: How To Achieve Compliance and Visibility

⚖️AI Ethics Blog

Central Bank strengthens data governance for AI solutions

⚙️MLOps News

How we fight GPU scarcity without compromise

🧠LLMs Blog

equixly.com··Hacker News

Youssof Altoukhi (@Youssofal_)

xcancel.com··r/LocalLLaMA

Google Shrank Gemma 4 by 72% and Unsloth Fixed the 4-Bit Bug Nobody Else Caught on One 4090, and 4-Bit Shouldn’t Be This Good

🧠LLMs Blog

towardsai.net·

Google's new open model DiffusionGemma generates text from noise instead of word by word

the-decoder.com

·

heterodoxin/graphkv: Graph-guided KV cache compression for memory-efficient LLM inference.

🧠LLMs Code

github.com··r/LocalLLaMA

CLP: Collocation-Length Prediction for Zero-Loss Adaptive Multi-Token Inference

🧠LLM Inference Academic

Unlocking AI flexibility in Europe: A guide to cross-region inference for EU data processing and model access

⚙️MLOps Blog

aws.amazon.com·

Understanding Agentic AI Infrastructure

⚖️AI Ethics Blog

Six Proto6 Vulnerabilities in protobuf.js Expose Node.js Apps to RCE and DoS

thehackernews.com·

ADATA Memory and Storage Products at Computex 2026

techpowerup.com·

For Robotaxis, Safety Must Be Built In, Not Bolted On

🧠LLMs Blog

blogs.nvidia.com·

KJLdefeated/RL.cu: RLVR training for LLM in CUDA/C++

🤖Machine Learning Code

github.com··Hacker News

The Practitioner’s Guide to AgentOps

machinelearningmastery.com·

Intel aims Crescent Island at inference

🧠LLM Inference

jonpeddie.com·

ASTRA-sim 3.0: Next-Level Distributed Machine Learning Simulations via High-Fidelity GPU and Infrastructure Modeling

🧠LLM Inference Academic

Qualcomm Announces On-Device AI Claw Ecosystem Plan

🧠LLM Inference

autonews.gasgoo.com·

FinOps FOCUS specification becomes the common language for AI cost accountability

🕵️Fraud Detection

siliconangle.com·

Sign up or log in to see more results

Log in to enable infinite scrolling