💬 LLMs - sarah · Scour

🎓eLearning baseten.co·

Baseten raised a $1.5B Series F and achieved a $13B valuation

Discussed on Hacker News

📚LMS marble.onl·

There is minimal downside to switching to open models

Covered by tldr.tech

Discussed on Hacker News

🤖AI ianbarber.blog·

LLMs Are Complicated Now

Discussed on Hacker News

🤖AI lmsys.org·

DFlash and Spec V2 Decoding (14 minute read)

Covers 6 stories including Looking for a self-hosted alternative to Modal.com for running ML workloads

Discussed on Hacker News

🤖AI GitHub·

Show HN: Alloy – a PyTorch backend and inference engine for Apple Silicon

Discussed on Hacker News

🧠Agentic AI moorcheh.ai·

Information-Theoretic Vector Search Is Having Its Moment

Covered by GitHub

Discussed on Hacker News

🔧Hardware auriko.ai·

Quantifying LLM Cost Savings from Cache-Aware Inference Routing

Discussed on Hacker News

🔧Hardware XDA·

I tested Google's new Gemma 4 12B on my 8GB GPU, and now I don't want to go back to smaller models

Discussed on Hacker News

🧠Agentic AI Hacker News·

The AI Conundrum: We are living in highly subsidized, interesting times

Discussed on Hacker News

🔧Hardware Anyscale blog posts·

High Performance Distributed Inference with Ray Serve LLM

Covered by Google Cloud Blog

Discussed on Hacker News

🧠Agentic AI aircityshops.com·

Zero Weights Graph Language Engine (MSE-GLM)

Discussed on Hacker News

🏫AI in Education nextweekai.com·

How to Build ChatGPT from Scratch: Understanding LLMs Step by Step

Covered by JavaScript Development Space

Discussed on Hacker News

🧠Agentic AI av.codes blog·

On local inference

Discussed on Hacker News

📚LMS huggingface.co·

NovaVest/VN-Noxa-v1-7B-Beta-Low

Discussed on Hacker News

📚LMS wattfare.com·

LLM API that's paid by users, not dev

Discussed on Hacker News

📚LMS GitHub·

Show HN: Loqi, a "local-first" translation tool using Ollama/llama.cpp

Covers Ollama

Discussed on Hacker News

🤖AI venturebeat.com·

Z.ai’s open-weights GLM-5.2 beats GPT-5.5 on multiple long-horizon coding benchmarks for 1/6th the cost

Covers 8 stories including GLM-5.2 (6 minute read)

Covered by 4 sources including vettedconsumer.com, AI Changes Everything

🏫AI in Education teachmecoolstuff.com·

Fine Tuning a Tiny Local LLM to Categorize Questions

Discussed on Hacker News and Hacker News

📚LMS arxiv.org·

The Benchmark Illusion: Pruned LLMs Can Pass Multiple Choice but Fail to Answer

Discussed on Hacker News

🤖AI unsloth.ai·

GLM-5.2 – How to Run Locally

Covers 2 stories including GitHub here . You can follow the build instructions below as well. Change -DGGML_CUDA=ON to -DGGML_CUDA=OFF if you don't have a GPU or just want CPU inferen...

Covered by news.smol.ai

Discussed on Hacker News and Hacker News

Log in to enable infinite scrolling