Fast PEFT Serving at Scale
databricks.com·12h
🏗️LLM Infrastructure
Flag this post
Neural audio codecs: how to get audio into LLMs
🔢BitNet
Flag this post
Accelerating Hybrid Inference in SGLang with KTransformers CPU Kernels
lmsys.org·6h
⚡Hardware Acceleration
Flag this post
FlashInfer Bench: A Benchmark Suite for AI Systems That Improve Themselves
🏗️LLM Infrastructure
Flag this post
The Continual Learning Problem
🏗️LLM Infrastructure
Flag this post
Humans and LLMs represent sentences similarly, study finds
techxplore.com·18h
📋Text Quality
Flag this post
New paper! We reverse engineered the mechanisms underlying Claude Haiku’s ability to perform a simple “perceptual” task. We discover beautiful feature families ...
threadreaderapp.com·13h
🎭Claude
Flag this post
Requirement Adherence: Boosting Data Labeling Quality Using LLMs
uber.com·16h
🏗️LLM Infrastructure
Flag this post
Simple GPU Selection Tool for AI and Deep Learning
🖥GPUs
Flag this post
Samuel x Bhishma - Superintelligence by 2030?
lesswrong.com·14h
🛡️AI Security
Flag this post
Show HN: I built an LLM that never forgets – persistent user memory with RAG
🏗️LLM Infrastructure
Flag this post
Selecting hardware for local LLM
🏗️LLM Infrastructure
Flag this post
A Bayesian approach towards atomically-precise localization in fluorescence microscopy
nature.com·16h
🏗️LLM Infrastructure
Flag this post
My Relationship with an AI Code Editor
spin.atomicobject.com·18h
👨💻AI Coding
Flag this post
10 CMU Students Selected for Amazon AI Ph.D. Fellowship Program
cs.cmu.edu·17h
🛡️AI Safety
Flag this post
Identify User Journeys at Pinterest
medium.com·8h
🎯Recommendation Metrics
Flag this post
AI models can now be customized with far less data and computing power
techxplore.com·10h
🆕New AI
Flag this post
The Invisible Pollution Behind ChatGPT
pub.towardsai.net·7h
🆕New AI
Flag this post
Loading...Loading more...