🦙 Ollama - hugonoss · Scour

Local LLMs are ready for real work 💾Local-First Software

thelurkreport.beehiiv.com·2d·r/LocalLLaMA

Find bugs in YOUR code using OpenCode, Llama.cpp and Qwen3.6 🕹️PICO-8

wtarreau.blogspot.com·3d·Lobsters, Hacker News, wtarreau.blogspot.com

BrunoArsioli/llama-optimus: Lightweight Python tool using Optuna for tuning llama.cpp flags: towards optimal tok/s for your machine 🚀Performance

github.com·6h·r/LocalLLaMA

Apple Silicon costs more than OpenRouter 🤪outlandish and economical

williamangel.net·3d·Hacker News, r/LocalLLaMA

Peekaboo documentation 🕹️PICO-8

peekaboo.sh·5d

llama : MTP clean-up by ggerganov · Pull Request #23269 💫slick production values

github.com·1d·r/LocalLLaMA

I've updated my glorified Llama fork (LLM Inference Server) for P40's to utilise MTP + TurboQuant + DFlash 🚀Performance

github.com·4d·r/LocalLLaMA

A VERY lightweight open web-search tool for smaller local LLMs 🔎Search engine

github.com·6d·Hacker News, r/LocalLLaMA

5p00kyy/club-5060ti: Practical local LLM recipes and benchmarks for RTX 5060 Ti setups 🚀Performance

github.com·5d·r/LocalLLaMA

llama: avoid copying logits during prompt decode in MTP by am17an · Pull Request #23198 🔧CAT Tools

github.com·3d·r/LocalLLaMA

Refactor: convert_hf_to_gguf.py by pwilkin · Pull Request #17114 👑Globofeudalist

github.com·5d·r/LocalLLaMA

No more posts from hugonoss's subscribed feeds.

Scour all 24660 feeds Learn more about Feeds

Log in to enable infinite scrolling