🤖 LLM Inference - touyou · Scour

How we fight GPU scarcity without compromise

⚙️AI Infrastructure Blog

equixly.com··Hacker News

Less-relevant results

Token4Token — pay-per-token inference on Gnosis + Swarm

⚙️AI Infrastructure

t4t.eth.link··Hacker News

Making LLMs faster and more efficient across multiple languages

👁️Multimodal LLMs

techxplore.com·

Build a local voice agent with Red Hat OpenShift AI

⚙️AI Infrastructure

developers.redhat.com·

Making Local LLM Go Brrr

⚙️AI Infrastructure

seanpedersen.github.io·

Breaking the Ice: Analyzing Cold Start Latency in vLLM

⚙️AI Infrastructure Academic

arxiv.org··Hacker News

CoreML vs TFLite: iPhone 15 Pro GPU 2.3x Faster

⚡Inference Optimization Blog Discussion

defai-digital/ax-engine: Apple Silicon LLM runtime supporting Gemma 4 and Qwen 3.6 MTP modes

⚡Inference Optimization Code

github.com··Hacker News

3x Faster Search: Parallel Test-Time Scaling with Instructed-Retriever-1

⚡Inference Optimization Blog

databricks.com·

OpenCV 5.0 Computer Vision Library Released with Rewritten DNN Engine

👁️Multimodal LLMs

OpenCV 5 release - New DNN engine with enhanced ONNX and LLM/VLM support, Intel, Arm, and RISC-V hardware optimizations - CNX Software

👁️Multimodal LLMs News

cnx-software.com·

A field journal on Ray Data and Daft for multimodal data lake (14 minute read)

👁️Multimodal LLMs Blog

mehulbatra.medium.com·

Intro — Sehastrajit

👁️Multimodal LLMs Blog

Where to Host Your Open-Source Model (Under 10B Parameters)

⚙️AI Infrastructure

digitalocean.com·

not much happened today | AINews

⚙️AI Infrastructure

The 1-Second Timeout Hack: Running Infinite Parallel Workloads Natively on Google Apps Script

⚙️AI Infrastructure Blog

·

The Death of the Four Golden Signals: Designing Telemetry for Non-Deterministic Infrastructure

🔍Retrieval-Augmented Generation

RKSC: Reasoning-Aware KV Cache Sharing and Confident Early Exit for Multi-Step LLM Inference

⚙️AI Infrastructure Academic

[AINews] FrontierCode: Benchmarking for Code Quality over Slop

⚙️AI Infrastructure News

·

Ask HN: Is software engineering still a good career choice for new students?

⚡Inference Optimization Discussion

news.ycombinator.com··Hacker News

Sign up or log in to see more results

Log in to enable infinite scrolling