🧠 LLMs - tfriedel · Scour

LLM Cheat Sheet

✍️Prompt Engineering Blog

drkpxl.bearblog.dev·

lightmetal: GPU LLM Inference From a Single Java 25 JAR

⚙️MLOps Blog

adambien.blog·

local llm on laptop 780M GPU using llama + gemma 4 qat

⚙️MLOps Blog

alper.bearblog.dev·

Google open-sources speedy DiffusionGemma text diffusion model

📝Active Learning

siliconangle.com·

Comprehensive evaluation of LLM capabilities for interpretation and analysis of genome-scale metabolic models in metabolic engineering

⚙️MLOps Academic

Ask HN: Is it feasible to run a model on device for complete privacy?

⚙️MLOps Discussion

news.ycombinator.com··Hacker News

WWDC 2026: Foundation Models (& Anarlog)

skushagra.com·

Mother sues OpenAI: chat logs show GPT-4o discussed suicide with her daughter

✍️Prompt Engineering

Intelligent inference scheduling with llm-d on Red Hat AI

✍️Prompt Engineering

developers.redhat.com·

Ollama 0.30 GPU Boost: Faster local Qwen inference on NVIDIA

everylocalai.com··DEV

LangChain Explained: Understanding Models, Prompts, Chains, Memory, Indexes, and Agents

🤖AI Agents Blog

towardsai.net·

LLM Routing: From Strategy Selection to Production Architecture

⚙️MLOps Blog

Introducing the Third Generation of Apple’s Foundation Models

machinelearning.apple.com··Hacker News, r/apple

Fine-tuning Multi-modal LLMs with ART: Art-based Reinforcement Training

🤖AI Agents Academic

147th airhacks tv: Local LLMs, LightMetal, ZSmith Agents, AI Rails, Saving Tokens

🔌MCP Blog

adambien.blog·

google/gemma-4-31B-it · fix: chat template — null handling, reasoning preservation, turn-tag balance, input validation

huggingface.co··r/LocalLLaMA

fix(opencode-go): add qwen plus tiered pricing (#91351)

⚙️MLOps Code

·

Google’s DiffusionGemma is 4x faster than its other Gemma models

📝Active Learning

thenewstack.io·

What's in the Box? A Field Guide to AI Models

⚙️MLOps Blog

iankduncan.com·

Apple's Foundation Models can now use third-party LLMs (Claude, Gemini) [video]

developer.apple.com··Hacker News

Log in to enable infinite scrolling