🤖 AI - barisamiw · Scour

SLUUG Talk: Demystifying Large Language Models on Linux

🤖ML Code

github.com··DEV

Report: GKE Inference Gateway delivers up to 92% faster AI responses

🏗️Data Engineering Blog

cloud.google.com··Hacker News

Using Probabilistic Programs to Train Inductive Reasoning in Large Language Models

🔀Transformers Academic

Ollama 0.30 GPU Boost: Faster local Qwen inference on NVIDIA

🧭Vector Databases

everylocalai.com··DEV

AI inference: what it is and why it matters for product managers

🔀Transformers

marcabraham.com·

Inferoa AI harness claimed 90% cache savings. We ran it and measured 97.8%

📈Time Series

zozo123.github.io··Hacker News

Using Scikit-LLM with Open-Source LLMs

🔧Feature Engineering

machinelearningmastery.com·

NVIDIA Accelerates Google DeepMind’s DiffusionGemma for Local AI

🔀Transformers Blog

blogs.nvidia.com·

A system programmer’s guide to LLM inference

🤖ML Blog

blog.xiangpeng.systems··Hacker News

Stop Wasting GPU Budget: Autoscaling AI Inference on Kubernetes with KEDA

📈Time Series

cloudnativenow.com·

Build a Medical Report Analyzer on Dedicated Inference with Python

🔀Transformers

digitalocean.com·

Fine tuning classification in Elixir

elixirstatus.com·

Using local LLMs for agentic coding

🔀Transformers Blog

blog.alexewerlof.com·

End-to-end encrypted ML inference with Amazon SageMaker AI and FHE

🔧Feature Engineering Blog

aws.amazon.com·

Discrete Diffusion Modelling by Estimating the Ratios of the Data Distribution

🤖ML News Blog

leetarxiv.substack.com··Substack, r/programming

How LLMs work | Practical Leaders

🔀Transformers

practical-leaders.com··Hacker News

Why LLMs (still) lack taste

🎮Reinforcement Learning

beyondtheprior.com··Hacker News

Conversational AI vs generative AI: What's the difference?

🔀Transformers

LeLab Is Hugging Face’s New Browser-Based GUI for the LeRobot Ecosystem

🔀Transformers News

lightmetal: GPU LLM Inference From a Single Java 25 JAR

🤖ML Blog

adambien.blog·

Log in to enable infinite scrolling