🤖 AI - samfundev · Scour

🌐web development GitHub·

RimantasZ/contextspy: Context profiler for LLMs and AI agents - used to introspect context contents and reduce token costs

Covers 5 stories including FastAPI

Covered by indiehacker.news

Discussed on Hacker News

🥟Bun lector.dev·

Show HN: Evaluating Local LLMs as language translators for my app

Discussed on Hacker News

🥟Bun akarouter.dev·

Flat per-call LLM API gateway (20x cheaper than Claude Max)

Discussed on Hacker News

🔧technical deep dives portal.neuralwatt.com·

Neuralwatt: Energy-based pricing for AI inference. Efficient prompts cost less

Discussed on Hacker News

🔧technical deep dives ianbarber.blog·

LLMs Are Complicated Now

Discussed on Hacker News

🥟Bun huggingface.co·

225B-A23B

Covered by news.smol.ai

Discussed on r/LocalLLaMA

🌐web development ludion.ai·

WebGPU feature detection was not enough to run small LLMs on phones

Discussed on Hacker News

⚡performance optimization youtube.comVideo·

Musician correctly predicts rise of local LLMs

Discussed on Hacker News

💻personal programming project explainers didon.appVideo·

Show HN: Didon – AI workday reports for productivity analysis

Discussed on Hacker News

🥟Bun Anyscale blog posts·

High Performance Distributed Inference with Ray Serve LLM

Covered by Google Cloud Blog

Discussed on Hacker News

🥟Bun youtu.beVideo·

Two Word docs talking to each other via local LLMs — what real use cases would you actually want?

Discussed on r/LocalLLaMA

🔧technical deep dives Electrek·

Tesla plans to sell modular AI data center hardware called ‘Megapod’

Covered by hardware.slashdot.org

Discussed on Hacker News

💻personal programming project explainers fareedkhan-dev.github.io·

Train LLM from Scratch

Discussed on Hacker News

🔧technical deep dives rocm.blogs.amd.com·

Unlocking Extreme AMD Instinct Inference with Software-Hardware Co-Optimization

Discussed on Hacker News

⚡performance optimization GitHub·

Show HN: Phileas – Local-first long-term memory for the AI you chat with

Covers 2 stories including Model Context Protocol And OAuth

Discussed on Hacker News

🥟Bun unsloth.ai·

GLM-5.2 – How to Run Locally

Covers 2 stories including GitHub here . You can follow the build instructions below as well. Change -DGGML_CUDA=ON to -DGGML_CUDA=OFF if you don't have a GPU or just want CPU inferen...

Covered by news.smol.ai

Discussed on Hacker News

🔧technical deep dives lesbarclays.substack.com·

What Is the Return on Tokens?

Covers 6 stories including Statement on the US government directive to suspend access to Fable 5 and Mythos 5

Discussed on Substack

💻personal programming project explainers speechmark.co·

On-device meeting notes for Mac (no bot, no cloud)

Discussed on Hacker News

🥟Bun konxios.com·

Show HN: Konxios a local first AI OS that connects LM Studio, Ollama and cloud

Discussed on Hacker News

🌐web development brightray.ai·

Built Uber aggregator that tracks top AI researchers and leaders

Discussed on Hacker News

Log in to enable infinite scrolling