caesarlsy's Feed

Running local models is good now

I’ve been working since they came out, and finally, they’re surprisingly good now. I have a 2022 M2 Mac with 64 GB RAM and 1TB storage and I’ve used , as well as a number of other Qwen variants like across like raw llama.cpp with llama-cpp-python Ollama llamafiles and LM Studio Where are local models now? Early on, models were slow, hard to use, and just not that accurate for most programming tasks. The idea that local models were severely lagging behind was largely true until, for me, the re... Read more ›

Covers 9 stories including Pi.dev: There are many coding agents, but this one is mine

Covered by 10 sources including Simon Willison's Newsletter, lemmy.ml

Discussed on Hacker News, Hacker News, and Lobsters

🐧Computing Systems Phoronix·

Linux Finally Eliminates The strncpy API After Six Years Of Work, 360+ Patches

Linux 7.2 has finally eliminated the strncpy API from the Linux kernel. The strncpy() function for copying up to a specified number of bytes has long been deprecated and after six years of work and hundreds of patches, no more users of the strncpy within the Linux kernel remained that it has now been eliminated... Read more ›

Discussed on Hacker News

⚖️AI Regulation The Conversation·

Why the US government shut down Anthropic’s latest Claude AI model

An “export control directive” for Anthropic’s Fable and Mythos models highlights the chaotic, fast-changing state of AI regulation. Read more ›

Covers 5 stories including Statement on the US government directive to suspend access to Fable 5 and Mythos 5

Covered by 4 sources including Anil Dash, oreilly.com

🎨GPU Computing rbelmont.mameworld.info·

I need your clothes, your boots, and your motorcycle

One of the reasons little progress was made on the Power Macintosh emulation in MAME for a long time is that it’s very tedious to debug. There’s a lot of code surface, it’s written in 3 languages (PowerPC, emulated 680×0, and compiled FORTH), and I’m not as familiar with the innards of the newer stuff like the Code Fragment Manager as I am with the behavior of the 680×0 codebase. So, this being 2026, I asked Claude Code if it could control and debug MAME. It came back with “yes, with limitati... Read more ›

Discussed on Hacker News

🤨AI Skepticism simonwillison.net·

Quoting Matteo Wong, The Atlantic

— , The White House Is Ratcheting Up Its War Against Anthropic Tags: <a href=" <a href=" <a href=" <a href=" <a href=" <a href=" <a href=" <a href=" <a href=" Read more ›

Covered by Simon Willison's Newsletter

🤖AI and Tactical Agents Alex Ellis' Blog·

Local Qwen isn't a worse Opus, it's a different tool

We've all heard people say that Qwen is near-Sonnet level, or near-Opus, but I have receipts and am here to be transparent with you. Read more ›

Covered by 4 sources including lemmy.ml, tldr.tech

Discussed on Hacker News, Lobsters, and r/LocalLLaMA

🐧Computing Systems sibexi.co·

Epoll vs. Io_uring in Linux

First, I want to tell you how exactly I got to this point and why I started researching different options for handling asynchronous I/O on Linux… Last year, my students and I built a reverse proxy server called TinyGate. It was super simple, worker-based, and it basically worked well. Of course, I didn’t expect it to be very fast, but it was an educational project, and since we’d made a real, kind of production-ready tool, I was really proud of it. But my students weren’t as happy as I was - ... Read more ›

Discussed on Hacker News

⚖️AI Regulation Marcus on AI·

What Washington must do

”The only way out is through” Read more ›

Covered by naked capitalism, The Torment Nexus

Discussed on Substack

🤖AI and Tactical Agents GitHub·

Rio-3.5-Open-397B ≈ 0.6 x Nex-N2_pro + 0.4 x Qwen · Issue #4 · nex-agi/Nex-N2

prefeitura-rio/Rio-3.5-Open-397B is presented as an original 397B model trained by IplanRIO. It is not. Its weights are a direct element-wise merge of our model, Nex, with the official Qwen3.5-397B... Read more ›

Covers 2 stories including Qwen3.5-397B-A17B is out!!

Covered by 5 sources including DEV Community, akitaonrails.com

Discussed on Hacker News

🐧Computing Systems Phoronix·

systemd 261 Released With New systemd-sysinstall OS Installer, IMDSD & Storagectl

Systemd 261 is out as stable today with a number of new features and ready to coincide with H2'2026 Linux distributions... Read more ›

Covers Systemd v261 Released

Discussed on Hacker News

⚖️AI Regulation Business Insider

Inside the whirlwind 24 hours that led the White House to slap export controls on Anthropic

Tense calls between Anthropic's CEO and administration officials on Friday underscore how the White House is wrestling with advanced AI models. Read more ›

Covers Inside the whirlwind 24 hours that led the White House to slap export controls on Anthropic

Covered by 4 sources including DEV Community, kite.kagi.com

Discussed on Hacker News

🤖AI and Tactical Agents martinfowler.com·

Building Reliable Agentic AI Systems

AI helping pharmaceutical researchers query decades of information buried in PDF reports Read more ›

Covers The think tool: Enabling Claude to stop and think in complex tool use situations

Covered by baeldung.com

Discussed on Hacker News, Hacker News, and Hacker News

🤖AI and Tactical Agents Martin Alderson·

A brief history of KV cache compression developments

How KV cache compression - from MQA and GQA to MLA and linear-attention hybrids - quietly unlocked the long context windows that make modern agentic LLMs possible. Read more ›

Covers TurboQuant: Redefining AI efficiency with extreme compression

🤖AI and Tactical Agents Hacker News·

Running local models is good now

I don't know about good, I use a lot of local models and they're still pretty painful to run locally Read more ›

Covers 2 stories including antirez/ds4: DeepSeek 4 Flash local inference engine for Metal

Covered by 5 sources including Simon Willison's Newsletter, simonwillison.net

Discussed on Hacker News

🤖AI and Tactical Agents simonwillison.net·

Quoting Georgi Gerganov

- nothing really impressive, but definitely a helpful tool for a maintainer. I think I would be using it much more, if I didn't have to spend a lot of my time on reviewing PRs. Currently, I have a very lightweight harness - the pi agent with everything stripped (pi -nc --offline) and to align it a bit with my style. — , Hacker News comment on by Boykis Tags: <a href=" <a href=" <a href=" <a href=" <a href=" <a href=" <a href=" <a href=" <a href=" Read more ›

Covers 2 stories including Running local models is good now