ierdna's Feed · Scour

the art of programming why i was wrong about llm

llm can't replace competence Read more ›

Some Complexity Results for Robustness Verification for Binarized Neural Networks

This paper studies the computational complexity of verification problems for Binarized Neural Networks (BNNs), where activations (and sometimes weights) are binary. We analyze two problems: satisfiability and robustness under uniform image occlusion. We show that BNN satisfiability is NP-complete via a reduction from Boolean satisfiability problem (SAT), and that uniform occlusion induces a piecewise-constant structure in the network output, ena... Read more ›

🏠selfhosting Sébastien Dubois·

Docker Compose for AI

Docker Compose now treats AI models as first-class application components via a top-level `models:` element, so you can wire models, agents, and tools into one declarative file and bring them up with a single `docker compose up`. The required field is `model:` (the OCI artifact pulled and run by Doc Read more ›

🏗️Systems Design medium.com

ML System Design interview questions I wish I knew before my interviews

I’ve sat in machine learning System Design interviews where the candidate jumped straight into building a distributed feature store with… Read more ›

🔬science Inquiry Into Inquiry·

Information = Comprehension × Extension • Preamble

Perhaps the best perspective from which to being the connection between the theory of signs and the theory of inquiry into its proper focus is Peirce’s own Theory of Information, which he began setting forth in lectures at Harvard and … → Read more ›

🤖LLMs The Decoder

Sam Altman says a whole generation of researchers held AI back by underestimating what scaling could do

At a Stanford talk, Sam Altman defended LLM scaling and hit back at skeptics, saying a whole generation of researchers slowed the field by underestimating what scaling could do. He cited OpenAI's recent disproof of a mathematical conjecture as evidence. The article appeared first on . Read more ›

🏠selfhosting techrepublic.com·

Ditch the Cloud: 5 Best Self-Hosted Photo Video-Gallery Options

Looking to store your media privately? Explore five top self-hosted photo and video gallery solutions for full control and customization. The post appeared first on <a href=" Read more ›

💻computer science arxiv.org·

The algebra of Krom logic programs

This paper investigates the algebraic structure of Krom logic programs, consisting only of facts and rules with at most one body atom. We show that sequential composition endows the class of Krom programs with a natural monoid structure and that this structure admits rich algebraic extensions to Krom seminearrings, Krom quemirings, Krom-Conway seminearrings, and Krom-Conway omegaseminearrings. Furthermore, we establish explicit generating sets a... Read more ›

🤖LLMs tokoscope.com·

Automatic LLM token compression and cost monitoring in 2 lines

Audit, compress and monitor your LLM token usage. Most teams cut API costs 40-60% with two lines of code. Read more ›

Discussed on Hacker News

🏗️Systems Design arxiv.org·

Architecture Carbon Tool v3: Enabling Sustainability-aware Silicon System Design Exploration

As the carbon cost of manufacturing and operating semiconductor devices has come into sharper focus, sustainability has gradually emerged as a new system architecture design metric. Like power and performance modeling tools, enabling sustainability-aware silicon systems design and optimizations will require a new generation of electronic design automation and architectural modeling tools. Towards this end, we present an update to the Architectur... Read more ›

🏠selfhosting medium.com

Building an Active Directory IAM and Access Control Homelab with Windows Server

To strengthen my practical understanding of identity and access management, I built an Active Directory homelab using Windows Server 2019… Read more ›

🤖LLMs Silicon Opera·

Context Windows Are Bigger and Dumber Than You Think

The Number You See Is Not What You Get When Anthropic announced Claude’s 200,000-token context window, or when Google unveiled Gemini 1.5 Pro with a million-token window, the coverage treated it as straightforward progress. More tokens in, more capability out. The framing makes intuitive sense: if a model can see more text at once, it should be able to reason about more text at once. This is not quite right. Context window size and context window effectiveness are two different things, and th... Read more ›

🏗️Systems Design arxiv.org·

AoiZora: Topology-Aware Auto-Parallel Optimization for Inference of Diffusion Transformers

Video diffusion has quickly grown into a key generative serving workload, yet producing each clip demands many denoising iterations over large spatio-temporal latents, which puts low-latency inference out of reach on a single device. A denoising step is therefore typically distributed across multiple accelerators, and TPU sub-slices have become an attractive and practical fabric for doing so. Current auto-parallel systems, however, search almost... Read more ›

🏠selfhosting RasterWeb!·

TrueNAS Update

Things are working, and sometimes that's a lot... Read more ›

Covers Jellyfin: The Free Software Media System

🤖LLMs medium.com

LLM Evaluation 101: Why You Can't Test an LLM Like You Test Your Code

Software testing is deterministic. LLMs aren't. Here's the mental shift you need before you can evaluate any LLM application. Read more ›

🤖LLMs Fortune

NBC’s Tom Llamas climbed from 15-year-old intern to the top anchor chair—and still isn’t satisfied: ‘If you’re not growing, you’re dying’

NBC Nightly News anchor Tom Llamas shares his career advice for Gen Z, work-life balance philosophy, and why success starts with hustle. Read more ›

🏠selfhosting XDA·

My home server now hosts my entire ebook collection, and this free container makes managing everything painless

It turned my home server into a personal digital library Read more ›

🤖LLMs medium.com

Retrieval Augmented Generation (RAG) in Large Language Model(LLMs)

Explore how Retrieval Augmented Generation (RAG) is revolutionizing the precision of responses from large language models (LLMs) such as… Read more ›

🏠selfhosting lemmy.world·

Why We Chose Self-Hosting Over Cloud APIs

submitted by While building PolyTalk, one of the biggest decisions we faced was whether to rely on cloud APIs or keep everything self-hosted. At first, cloud services seemed like the obvious choice. They make it easy to get started and remove a lot of operational overhead. But the deeper we got into the project, the more we realized that self-hosting wasn’t just a deployment preference, it was a requirement for many of the use cases we were exploring. A few things we learned along the way: Ru... Read more ›

🤖LLMs Elezea by Rian van der Merwe·

I am dreading our LLM-written incident report future

LLM-generated incident reports pose unique dangers because their plausible-sounding errors lack immediate verification mechanisms that catch problems in code… Read more ›