akapaka's Feed

Show HN: Karpenter Provider for Hetzner

An open-source Karpenter provider for Hetzner Cloud: cost-optimal Kubernetes node autoscaling that launches the cheapest server type for the pending pods. Read more ›

Covers Karpenter

Discussed on Hacker News

🧠Local llm GitHub·

Running a 35B MoE model on a 2017 AMD RX 580 8GB via Vulkan (no ROCm/CUDA)

Complete guide to running local AI on AMD RX 580 8GB via Vulkan — llama.cpp, Ollama, OpenWebUI, Stable Diffusion. No CUDA. No cloud. Free. - aivisionslab-studios/rx580-local-ai-guide Read more ›

Discussed on Hacker News

⎈Helm theregister·

2,000 retired Google Pixel phones get a second life as a private cloud

You might say the system packs two kilapixels of compute Read more ›

Covers A low-carbon computing platform from your retired phones

Discussed on Hacker News

🦀Rust grigio.org·

Bun 1.4: The Controversial AI-Driven Rewrite from Zig to Rust

In May 2026, the Bun team did something the software industry has been whispering about for years: they rewrote their entire runtime from Zig to Rust. Not over the course of a year with a dedicated team. In six days. Using AI agents. At nearly a million lines of code, Read more ›

Discussed on Hacker News

📊Prometheus GitHub·

Show HN: Cerberus – PromQL/LogQL/TraceQL Query APIs Backed by ClickHouse

Drop-in Prometheus / Loki / Tempo HTTP gateway for ClickHouse. Translate PromQL, LogQL, and TraceQL into optimized CH SQL — keep Grafana, swap the backend. - tsouza/cerberus Read more ›

Covers SLSA • Supply-chain Levels for Software Artifacts

Discussed on Hacker News

🕸️WebAssembly gloathub.org·

Gloat compiles Clojure and YAMLScript to Go code, native binaries and WASM

Compile Clojure and YAMLScript to Go, native binaries, Wasm and more Read more ›

Discussed on Hacker News

🤖Machine Learning ianbarber.blog·

LLMs Are Complicated Now

Back in 2022 and 2023 there were two big branches of machine learning happening at Meta1. The LLM work that led to Llama was a clean, smooth stack of repeated Transformer modules; the recommendatio… Read more ›

Discussed on Hacker News

🦕Deno hackernoon.com·

Comparing Dependency Management Models of npm, Yarn, pnpm, Bun, and Deno

Modern JavaScript package managers solve dependency management in fundamentally different ways. npm prioritizes compatibility but creates disk bloat, pnpm optimizes storage through content-addressable linking, Yarn Berry removes node_modules with Plug’n’Play, Bun focuses on installation speed, and Deno rethinks dependency fetching entirely. The right choice depends on monorepo scale, performance, tooling, and ecosystem maturity. Read more ›

🔌Model Context Protocol microsoft.com·

AutoJack: How a single page can RCE the host running your AI agent

AutoJack is a novel exploit chain showing how a single malicious webpage can turn an AI browsing agent into a remote code execution vector on the host machine. By abusing trust in localhost, missing authentication, and unsafe parameter handling, attackers can trigger arbitrary process execution through AutoGen Studio’s MCP WebSocket. The research highlights a broader pattern - when agents can browse untrusted content and access local services, traditional boundaries like localhost are no long... Read more ›

Covered by 5 sources including This Week In 4n6, thehackernews.com

Discussed on Hacker News

📝SQLite WAL hackernoon.com·

The Architecture of Local-First AI Memory: No Cloud, No Keys, No Read-Time LLMs

PMB is a local-first memory system for AI agents that stores knowledge in SQLite and LanceDB, avoids LLM calls on the read path, and prioritizes fast, deterministic retrieval. This article explores the storage model, asynchronous write path, hybrid retrieval architecture, memory lifecycle management, and the design principles behind persistent agent memory that remains fully under user control. Read more ›

🧠LLM Inference Anyscale blog posts·

High Performance Distributed Inference with Ray Serve LLM

Learn how Ray Serve LLM + vLLM stack achieves up to 24x higher throughput with direct streaming, HAProxy integration, and a new vLLM Ray executor backend. Read more ›

Covered by Google Cloud Blog

Discussed on Hacker News

🏠Self-Hosting hgsocket.com·

Gsocket Alternative, Hgsocket

Like gsocket.io but self-hosted. Remote shell to any machine — no public IP, no port forwarding, no VPN. One command install. Read more ›

Discussed on Hacker News

🧠Local llm huggingface.co·

Cheapest way to run GLM 5.x locally that's not a unified memory system?

We’re on a journey to advance and democratize artificial intelligence through open source and open science. Read more ›

Discussed on r/LocalLLaMA

☸️Kubernetes postfinance.github.io·

TOPF

Early Stage Project TOPF has just been released and is in an early stage of development\. While it is actively used at PostFinance, APIs, configuration formats, and CLI flags may change between releases\. Feedback and contributions are welcome — please open an issue if you run into problems or have suggestions\. Get Started ## What TOPF does TOPF is a single binary that handles the full lifecycle of a Talos cluster: - **Apply configurations** with pre-flight health checks, dry-run diffs... Read more ›

Covers Talos Linux

Discussed on Hacker News

👁️Observability jakkaru.de·

Disclosure of Vulnerabilities in Fox ESS Cloud Infrastructure

Jakkaru proactively defeats threats with penetration testing, ensuring confidence in your business’s future Read more ›

Discussed on Hacker News

🤖Qwen anbeeld.com·

7900XTX 24GB vram, can finally fit Q6K+MTP with Qwen 3.6 27B at 131k context

Tests on Qwen 3.6 27B show why TurboQuant is overrated but saved by TCQ, q5 deserves more attention, and symmetric q8 might be a waste of VRAM. Read more ›

Discussed on r/LocalLLaMA

🦀Rust microsoft.github.io·

Rust for C#/.NET Developers

This is a (non-comprehensive) guide for C# and .NET developers that are completely new to the Rust programming language. Some concepts and constructs translate fairly well between C#/.NET and Rust, but which may be expressed differently, whereas others are a radical departure, like memory management. This guide provides a brief comparison and mapping of those constructs and concepts with concise examples. Read more ›

Discussed on Hacker News

🕸️WebAssembly guaracloud.github.io·

Purple Wolf – A fast, verifiable WAF for Traefik

Traefik WASM WAF with signed artifacts, SBOMs, benchmark evidence, and monitor-first Kubernetes rollout. Read more ›

Discussed on Hacker News

⎈Helm diogocapela.com·

Stop reaching for microservices. You are not Netflix

June 17, 2026 Read more ›

Discussed on Hacker News

🤖Machine Learning arxiv.org·

Behind Python: The Languages That Power AI

Python dominates AI development, yet the numerical work behind frameworks like PyTorch and NumPy is executed in C, C++, or Rust. When a developer must implement an algorithm without such libraries -- because none exists, the target is resource-constrained, or a new system is being built -- which language should they choose? This paper answers that question empirically. Five algorithms covering data mining (k-means), machine learning (k-NN), neur... Read more ›

Discussed on Hacker News