🦙 Llama - abdus

Blog

medium.com

Would a prepaid pass for a coding agent solve a real need or is it just my itch?

codehamr.com··r/SideProject

[AINews] FrontierCode: Benchmarking for Code Quality over Slop

News

latent.space

google/gemma-4-31B-it · fix: chat template — null handling, reasoning preservation, turn-tag balance, input validation

huggingface.co··r/LocalLLaMA

DiffusionGemma: The Developer Guide

Blog

developers.googleblog.com·

Calibration Drift Under Reasoning: How Chain-of-Thought Budgets Induce Overconfidence in Large Language Models

Academic

arxiv.org·

techjarves/Portable-AI-USB: A 100% offline, fully portable, zero-trace AI (Ollama + Llama 3 + AnythingLLM) that runs natively from a USB drive on Windows and Mac.

Code

github.com·

Gemma 4 QAT models: Optimizing model compression for mobile and laptop efficiency

News Blog

blog.google··Hacker News

DiffusionGemma: 4x Faster Text Generation

News Blog

blog.google··Hacker News, r/LocalLLaMA, r/singularity

Burning 2.1M Tokens Version of Misadventures in Vibe-Programming: LAUGH OF THE DAY

substackcdn.com··Substack

How I benchmarked a 100% local RAG pipeline to 9/9 (zero API keys)

buy.polar.sh··DEV

Omnifs: APIs and data sources as files you can ls, cat, grep, and pipe

omnifs.dev··Hacker News

Build a local voice agent with Red Hat OpenShift AI

developers.redhat.com·

How we fight GPU scarcity without compromise

Blog

equixly.com··Hacker News

Shrinking a Neural Network Often Makes It Smarter

siliconopera.com·

Fine-tuning Multi-modal LLMs with ART: Art-based Reinforcement Training

Academic

arxiv.org·

Study Shows AI Can Pass The Turing Test More Reliably Than Humans

bgr.com·

[AINews] not much happened today

News

latent.space

I built an open-source persistent memory layer for AI coding agents

Code

github.com··r/GithubCopilot

local AI agents for Cursor with pre-tuned marketplace/commu

PagedAttention vs Traditional KV Cache: How vLLM Reinvented GPU Memory for LLM Inference

Would a prepaid pass for a coding agent solve a real need or is it just my itch?

[AINews] FrontierCode: Benchmarking for Code Quality over Slop

google/gemma-4-31B-it · fix: chat template — null handling, reasoning preservation, turn-tag balance, input validation

DiffusionGemma: The Developer Guide

Calibration Drift Under Reasoning: How Chain-of-Thought Budgets Induce Overconfidence in Large Language Models

techjarves/Portable-AI-USB: A 100% offline, fully portable, zero-trace AI (Ollama + Llama 3 + AnythingLLM) that runs natively from a USB drive on Windows and Mac.

Gemma 4 QAT models: Optimizing model compression for mobile and laptop efficiency

DiffusionGemma: 4x Faster Text Generation

Burning 2.1M Tokens Version of Misadventures in Vibe-Programming: LAUGH OF THE DAY

How I benchmarked a 100% local RAG pipeline to 9/9 (zero API keys)

Omnifs: APIs and data sources as files you can ls, cat, grep, and pipe

Build a local voice agent with Red Hat OpenShift AI

How we fight GPU scarcity without compromise

Shrinking a Neural Network Often Makes It Smarter

Fine-tuning Multi-modal LLMs with ART: Art-based Reinforcement Training

Study Shows AI Can Pass The Turing Test More Reliably Than Humans

[AINews] not much happened today

I built an open-source persistent memory layer for AI coding agents