⚡ Model Efficiency - jimman · Scour

Claude: Speed up responses with fast mode

simonwillison.net·3d

✍️Prompt Engineering

The Top 10 Best Practices for AI/BI Dashboards Performance Optimization (Part 2)

databricks.com·6d

📊Data Visualization

AI Search Engine Performance Monitoring Systems

open.forem.com·2d·

Discuss: DEV

Balancing FP8 Computation Accuracy and Efficiency on Digital CIM via Shift-Aware On-the-fly Aligned-Mantissa Bitwidth Prediction

arxiv.org·5d

⚡LLM Optimization

Length-Unbiased Sequence Policy Optimization: Revealing and Controlling Response Length Variation in RLVR

arxiv.org·5d

⚡LLM Optimization

**Python Techniques for Complete Machine Learning Model Lifecycle Management**

dev.to·4d·

Discuss: DEV

⚡LLM Optimization

Deterministic AI: Reclaiming Predictable Latency with Rust and Zero-Cost Abstractions

dev.to·4d·

Discuss: DEV

⚡LLM Optimization

Building an AI-Native Pharma

formation.bio·5d·

Discuss: Hacker News

🔍AI Interpretability

Production pain points and coordination patterns from building a dual-orchestrator (Claude + Kimi) system on Claude Code. 8 failure modes with specs and invariants.

gist.github.com·3d·

Discuss: Hacker News

🛠️Developer Tools

How Meta turned the Linux Kernel into a planet-scale Load Balancer. Part I

softwarefrontier.substack.com·3d·

Discuss: Substack

Seedance2 – multi-shot AI video generation

genstory.app·3d·

Discuss: Hacker News

✍️Prompt Engineering

Oatmeal - Constraint propagation for fun

eli.li·3d·

Discuss: Lobsters, Hacker News

✍️Prompt Engineering

Building the Future with AI That Acts

devxt.com·3d·

Discuss: Hacker News

🔍AI Interpretability

I Built a 6 BIPS JIT in Five Months

unlikelyemphasis.substack.com·5d·

Discuss: Substack

✍️Prompt Engineering

Pony Alpha: New free 200K context model for coding, reasoning and roleplay

ponyalpha.pro·3d·

Discuss: Hacker News

✍️Prompt Engineering

Achieving Ultra-Fast AI Chat Widgets

cjroth.com·3d·

Discuss: Hacker News

✍️Prompt Engineering

NotebookLM: The AI that only learns from you

byandrev.dev·3d·

Discuss: Hacker News

⚡LLM Optimization

Writing an LLM from scratch, part 32c – Interventions: removing dropout

gilesthomas.com·5d·

Discuss: Hacker News

⚡LLM Optimization

EBM vs. LLMs: Our Kona EBM a 96% vs. 2% Sudoku Benchmark

logicalintelligence.com·5d·

Discuss: Hacker News

⚡LLM Optimization

How do you use AI coding tools at scale without losing architectural control?

contextfirst.dev·3d·

Discuss: Hacker News

✍️Prompt Engineering

Loading more...