I’ve been working since they came out, and finally, they’re surprisingly good now. I have a 2022 M2 Mac with 64 GB RAM and 1TB storage and I’ve used , as well as a number of other Qwen variants like across like raw llama.cpp with llama-cpp-python Ollama llamafiles and LM Studio Where are local models now? Early on, models were slow, hard to use, and just not that accurate for most programming tasks. The idea that local models were severely lagging behind was largely true until, for me, the re... Read more ›
Linux 7.2 has finally eliminated the strncpy API from the Linux kernel. The strncpy() function for copying up to a specified number of bytes has long been deprecated and after six years of work and hundreds of patches, no more users of the strncpy within the Linux kernel remained that it has now been eliminated... Read more ›
An “export control directive” for Anthropic’s Fable and Mythos models highlights the chaotic, fast-changing state of AI regulation. Read more ›
One of the reasons little progress was made on the Power Macintosh emulation in MAME for a long time is that it’s very tedious to debug. There’s a lot of code surface, it’s written in 3 languages (PowerPC, emulated 680×0, and compiled FORTH), and I’m not as familiar with the innards of the newer stuff like the Code Fragment Manager as I am with the behavior of the 680×0 codebase. So, this being 2026, I asked Claude Code if it could control and debug MAME. It came back with “yes, with limitati... Read more ›
— , The White House Is Ratcheting Up Its War Against Anthropic Tags: <a href=" <a href=" <a href=" <a href=" <a href=" <a href=" <a href=" <a href=" <a href=" Read more ›
We've all heard people say that Qwen is near-Sonnet level, or near-Opus, but I have receipts and am here to be transparent with you. Read more ›
First, I want to tell you how exactly I got to this point and why I started researching different options for handling asynchronous I/O on Linux… Last year, my students and I built a reverse proxy server called TinyGate. It was super simple, worker-based, and it basically worked well. Of course, I didn’t expect it to be very fast, but it was an educational project, and since we’d made a real, kind of production-ready tool, I was really proud of it. But my students weren’t as happy as I was - ... Read more ›
prefeitura-rio/Rio-3.5-Open-397B is presented as an original 397B model trained by IplanRIO. It is not. Its weights are a direct element-wise merge of our model, Nex, with the official Qwen3.5-397B... Read more ›
Systemd 261 is out as stable today with a number of new features and ready to coincide with H2'2026 Linux distributions... Read more ›
Tense calls between Anthropic's CEO and administration officials on Friday underscore how the White House is wrestling with advanced AI models. Read more ›
AI helping pharmaceutical researchers query decades of information buried in PDF reports Read more ›
How KV cache compression - from MQA and GQA to MLA and linear-attention hybrids - quietly unlocked the long context windows that make modern agentic LLMs possible. Read more ›
I don't know about good, I use a lot of local models and they're still pretty painful to run locally Read more ›
- nothing really impressive, but definitely a helpful tool for a maintainer. I think I would be using it much more, if I didn't have to spend a lot of my time on reviewing PRs. Currently, I have a very lightweight harness - the pi agent with everything stripped (pi -nc --offline) and to align it a bit with my style. — , Hacker News comment on by Boykis Tags: <a href=" <a href=" <a href=" <a href=" <a href=" <a href=" <a href=" <a href=" <a href=" Read more ›