InferenceMAX โ€“ open-source Inference Frequent Benchmarking
github.comยท3hยท
Discuss: Hacker News
๐Ÿ—๏ธLLM Infrastructure
Benchmarking LLM Inference on RTX 4090 / RTX 5090 / RTX PRO 6000 #2
reddit.comยท5hยท
Discuss: r/LocalLLaMA
๐Ÿ—๏ธLLM Infrastructure
HiPRAG: Hierarchical Process Rewards for Efficient Agentic Retrieval Augmented Generation
arxiv.orgยท19h
๐Ÿ—๏ธLLM Infrastructure
How we built a structured Streamlit Application Framework in Snowflake
about.gitlab.comยท23h
๐Ÿ”งDeveloper tools
Assuring Agent Safety Evaluations By Analysing Transcripts
lesswrong.comยท13h
๐Ÿ†LLM Benchmarking
Scaling Time-Series Data for AI Models
singlestore.comยท8h
๐ŸŽ›๏ธFeed Filtering
Supercharge your Enterprise BI: How to approach your migration to AI/BI
databricks.comยท2h
๐Ÿ—๏ธInfrastructure Economics
YouTube gets ~5% CTR lift on Shorts by replacing embedding tables with Semantic IDs
shaped.aiยท23h
๐Ÿ“ŠFeed Optimization
MECE โ€” The AI Principle Youโ€™ll Never Stop Using After Reading This
pub.towardsai.netยท12h
๐Ÿ”AI Interpretability
How different AI engines generate and cite answers
searchengineland.comยท11h
๐Ÿ“ŠFeed Optimization
Debugging Humidity: Lessons from deploying software in the physical world
physical-ai.ghost.ioยท3hยท
Discuss: Hacker News
๐ŸŒDistributed systems
Can AI Co-Design Distributed Systems? Scaling from 1 GPU to 1k
harvard-edge.github.ioยท1hยท
Discuss: Hacker News
๐ŸŒDistributed systems
How View Caching in Rails Works (2020)
honeybadger.ioยท9hยท
Discuss: Hacker News
๐Ÿ’พPrompt Caching
QUIC! Jump to User Space!
hackaday.comยท8h
โšกQUIC Protocol
Truly distributed and 5x more resilient - CockroachDB vs Oracle GDD
cockroachlabs.comยท23h
๐Ÿ˜PostgreSQL
No Bullshit Guide to Statistics prerelease
minireference.comยท5hยท
Discuss: Hacker News
๐Ÿ“ŠStatistical Ranking
LLM-Based AI Agent That Automates The Transistor Sizing Process (Univ. of Edinburgh)
semiengineering.comยท3h
๐Ÿ†•New AI
OBCache: Optimal Brain KV Cache Pruning for Efficient Long-Context LLM Inference
arxiv.orgยท19h
๐Ÿง LLM Inference
OpenAI's newly launched Sora 2 makes AI's environmental impact impossible to ignore
techxplore.comยท12h
๐Ÿ†•New AI