⚛️ React - il_monho · Scour

SSO Is More Than "Log In Once"

🖥️Fullstack Dev Code

github.com··DEV

Week 1 of building Quantamind: Ditching Electron for Rust & Tauri 🦀

🧠LLMs Code

github.com··DEV

bigattichouse/packed-twin-inference: PTI achieves ~2× throughput using a single quantized model (Q5_K_M or better) by running 4 generation streams in one batched decode call. The GPU loads model weights once per step and produces 4 predictions simultaneously. KV cache overhead is ~0.8 GiB total for all 4 streams. No draft model. No quality loss

🧠LLMs Code

github.com··r/LocalLLaMA

Log in to enable infinite scrolling