🧠 LLMs - kevincrane · Scour

Google's new open-weights model brings image-generation tricks to AI text generation

🤖AI Engineering News

theregister.com··Hacker News

LLM are universal simulators

🛡️AI Safety

invertedpassion.com··Hacker News

Ask HN: Any Local LLM can I run without GPU for Local Agentic workflow AI?

🤝AI Agents Discussion

news.ycombinator.com··Hacker News

DeepSeekV4 1.6T Day 0 to Day 43 Performance Over Time - Huawei, GB300 NVL72, MI355X, B200

🤖AI Engineering News

newsletter.semianalysis.com

··Hacker News·Cited by 1 article

Franklin Templeton, BNP Paribas see tokenization boosting EU's capital efficiency

🖥️Backend Development

cointelegraph.com·

Acoda: Adversarial Code Obfuscation for Defending against LLM-based Analysis

🔍RAG Academic

How LLMs Actually Work: A Friendly Map for Humans • oreoro

oreoro.github.io··Hacker News

Making FlashAttention-4 faster for inference

📐System Design Blog

modal.com··Hacker News

TradFi advisors want stablecoins, tokenization over Bitcoin: Bitwise

📐System Design

cointelegraph.com·

Running Qwen 35B MoE at 450k Context on a Single 32GB GPU

📐System Design

local-llm.utop.workers.dev··Hacker News·Cited by 1 article

LLM Cheat Sheet

🔍RAG Blog

drkpxl.bearblog.dev·

TOON: Beyond JSON for LLMs

🤖AI Engineering Blog

towardsai.net·

NVIDIA RTX Pro 6000 Blackwell: 96GB GDDR7 and the End of VRAM Anxiety

🤖AI Engineering Blog

fitservers.com·

Does ChatGPT need a psychiatrist? Similarities between human psychopathology and errors in large language models

🔍RAG Academic

nature.com··Hacker News

My Notes on the Progression from Context to Prompt to Harness engineering in making GPT LLMs Useful: (TUESDAY) MAMLMs

🤖AI Engineering News Blog

braddelong.substack.com

Show HN: BeamWeaver – LangChain/DeepAgents-style agents and workflows for Elixir

🤖AI Engineering Code

github.com··Hacker News·Cited by 1 article

ChatGPT easily bypasses its own guardrails; all LLMs are inherently unsafe

🛡️AI Safety Blog

NVIDIA A100 vs RTX 4090 for AI Workloads: The Cost Per FLOP Reality

📐System Design Blog

fitservers.com·

Token4Token — pay-per-token inference on Gnosis + Swarm

🤖AI Engineering

t4t.eth.link··Hacker News

Latest technical articles & videos.

🤖AI Engineering

certdepot.net·

Sign up or log in to see more results

Log in to enable infinite scrolling