Bandits in Your LLM Gateway
🤖AI
Flag this post
How to use Claude Code for big tasks without turning your code to shit
💻Software development
Flag this post
Fast and Affordable LLMs serving on Intel Arc Pro B-Series GPUs with vLLM
blog.vllm.ai·21h
🤖AI
Flag this post
Provable Benefit of Curriculum in Transformer Tree-Reasoning Post-Training
arxiv.org·16h
🤖AI
Flag this post
Self play and autocurricula in the age of agents
🤖AI
Flag this post
The 5 FREE Must-Read Books for Every LLM Engineer
kdnuggets.com·6d
🤖AI
Flag this post
Routing Manifold Alignment Improves Generalization of Mixture-of-Experts LLMs
arxiv.org·16h
🤖AI
Flag this post
How to Achieve 4x Faster Inference for Math Problem Solving
developer.nvidia.com·1d
🤖AI
Flag this post
Very cool blog by @character_ai diving into how they trained their proprietary model Kaiju (13B, 34B, 110B), before switching to OSS model, and spoiler: it has...
threadreaderapp.com·2h
🤖AI
Flag this post
Walking the Tightrope of LLMs for Software Development: A Practitioners' Perspective
arxiv.org·16h
💻Software development
Flag this post
Textual Self-attention Network: Test-Time Preference Optimization through Textual Gradient-based Attention
arxiv.org·16h
🤖AI
Flag this post
Everything You Need to Know About LLM Evaluation Metrics
machinelearningmastery.com·1d
🤖AI
Flag this post
DiagnoLLM: A Hybrid Bayesian Neural Language Framework for Interpretable Disease Diagnosis
arxiv.org·16h
🤖AI
Flag this post
Intelligent inference request routing for large language models
next.redhat.com·6h
🤖AI
Flag this post
Retracing the Past: LLMs Emit Training Data When They Get Lost
arxiv.org·16h
🤖AI
Flag this post
Building AI Agents in Kotlin – Part 1: A Minimal Coding Agent
blog.jetbrains.com·12h
🤖AI
Flag this post
Loading...Loading more...