LLMs

large language models, foundation models, transformer, GPT

Feeds to Scour
SubscribedAll
Scoured 1043 posts in 7.9 ms

LLM Cheat Sheet

 ✍️Prompt Engineering  Content type: Blog
drkpxl.bearblog.dev·

lightmetal: GPU LLM Inference From a Single Java 25 JAR

 ⚙️MLOps  Content type: Blog
adambien.blog·

local llm on laptop 780M GPU using llama + gemma 4 qat

 ⚙️MLOps  Content type: Blog
alper.bearblog.dev·

Google open-sources speedy DiffusionGemma text diffusion model

 📝Active Learning
siliconangle.com·

Comprehensive evaluation of LLM capabilities for interpretation and analysis of genome-scale metabolic models in metabolic engineering

 ⚙️MLOps  Content type: Academic
biorxiv.org·

Ask HN: Is it feasible to run a model on device for complete privacy?

 ⚙️MLOps  Content type: Discussion

WWDC 2026: Foundation Models (& Anarlog)

 🔌MCP
skushagra.com·

Mother sues OpenAI: chat logs show GPT-4o discussed suicide with her daughter

 ✍️Prompt Engineering
ppc.land·

Intelligent inference scheduling with llm-d on Red Hat AI

 ✍️Prompt Engineering
developers.redhat.com·

Ollama 0.30 GPU Boost: Faster local Qwen inference on NVIDIA

 ⚙️MLOps
everylocalai.com··DEV

LangChain Explained: Understanding Models, Prompts, Chains, Memory, Indexes, and Agents

 🤖AI Agents  Content type: Blog
towardsai.net·

LLM Routing: From Strategy Selection to Production Architecture

 ⚙️MLOps  Content type: Blog
blog.n8n.io·

Introducing the Third Generation of Apple’s Foundation Models

 🤖AI Agents

Fine-tuning Multi-modal LLMs with ART: Art-based Reinforcement Training

 🤖AI Agents  Content type: Academic
arxiv.org·

147th airhacks tv: Local LLMs, LightMetal, ZSmith Agents, AI Rails, Saving Tokens

 🔌MCP  Content type: Blog
adambien.blog·

google/gemma-4-31B-it · fix: chat template — null handling, reasoning preservation, turn-tag balance, input validation

 ⚙️MLOps

fix(opencode-go): add qwen plus tiered pricing (#91351)

 ⚙️MLOps  Content type: Code
github.com
·

Google’s DiffusionGemma is 4x faster than its other Gemma models

 📝Active Learning
thenewstack.io·

What's in the Box? A Field Guide to AI Models

 ⚙️MLOps  Content type: Blog
iankduncan.com·

Apple's Foundation Models can now use third-party LLMs (Claude, Gemini) [video]

 🔌MCP

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help