⚡ Model Efficiency - jimman · Scour

Creeping memory allocation

community.folivora.ai·2d

OmniMoE: An Efficient MoE by Orchestrating Atomic Experts at Scale

arxiv.org·5d

⚡LLM Optimization

Length-Unbiased Sequence Policy Optimization: Revealing and Controlling Response Length Variation in RLVR

arxiv.org·5d

⚡LLM Optimization

H100 GPU: Powering the Next Era of AI and High-Performance Computing

dev.to·4d·

Discuss: DEV

⚡LLM Optimization

**Python Techniques for Complete Machine Learning Model Lifecycle Management**

dev.to·4d·

Discuss: DEV

⚡LLM Optimization

Faster than Dijkstra?

systemsapproach.org·2d·

Discuss: Hacker News

✍️Prompt Engineering

Understanding the Go Runtime: The Bootstrap

internals-for-interns.com·2d·

Discuss: Hacker News, r/golang

Using light-based computing to tackle complex challenges

queensu.ca·1d·

Discuss: Hacker News

⚡LLM Optimization

Manufacturing QMS Software

samrian.com·1d·

Discuss: Hacker News

⚡LLM Optimization

Learnings from Creating a GUI Library

blog.s-schoener.com·2d·

Discuss: Lobsters, Hacker News

🛠️Developer Tools

AI is dominating the world’s memory chips. That could make phones more expensive

restofworld.org·1d·

Discuss: Hacker News

⚡LLM Optimization

Tell it your hardware, get the exact local AI model to run

localcoder.xyz·5d·

Discuss: Hacker News

Deep dive into Hierarchical Navigable Small Worlds

amandeepsp.github.io·3d·

Discuss: Hacker News, r/Zig, r/programming

⚡LLM Optimization

Human-like Search for Modern Applications

anvitra.ai·3d·

Discuss: Hacker News

⚡LLM Optimization

Understanding How GIL Affects Checkpoint Performance in PyTorch Training

shayon.dev·4d·

Discuss: Hacker News

⚡LLM Optimization

Designing and Using Combinators: The Essence of Functional Programming

cse.chalmers.se·1d·

Discuss: Hacker News

✍️Prompt Engineering

Training language models on TPUs shouldn't be scary

dogac.dev·5d·

Discuss: Hacker News

⚡LLM Optimization

Sequential Attention: Making AI models leaner and faster without sacrificing accuracy

research.google·6d·

Discuss: Hacker News, r/LocalLLaMA

⚡LLM Optimization

Why my H100 cluster idled 130 hours/month (RoCEv2 and Storage bottlenecks)

rack2cloud.com·5d·

Discuss: Hacker News

⚡LLM Optimization

Experiments in building bespoke tools with AI

knlb.dev·2d·

Discuss: Hacker News

✍️Prompt Engineering

Loading more...