Model Optimization, Inference Engines, LLM Quantization, Privacy-focused Deployments

Building AI-Powered APIs in Minutes, Not Months
dev.to·5h·
Discuss: DEV
📱Edge AI
Flag this post
MiniOneRec: An Open-Source Framework for Scaling Generative Recommendation
arxiv.org·2d
🏗️AI Infrastructure
Flag this post
Nirvana: A Specialized Generalist Model With Task-Aware Memory Mechanism
arxiv.org·5h
💻Local LLMs
Flag this post
StreetMath: Study of LLMs' Approximation Behaviors
arxiv.org·5h
💻Local LLMs
Flag this post
How Well Does RL Scale?
tobyord.com·15h·
Discuss: Hacker News
🏗️AI Infrastructure
Flag this post
One Memory Layer, Multiple Models (Claude, GPT, Llama, etc.)
github.com·5h·
Discuss: Hacker News
🏗️AI Infrastructure
Flag this post
Show HN: Everything it took to run an LLM at 10k tok/s on H200s
relace.ai·1d·
Discuss: Hacker News
📱Edge AI
Flag this post
Cross-Platform Evaluation of Reasoning Capabilities in Foundation Models
arxiv.org·5h
🏗️AI Infrastructure
Flag this post
Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning
arxiv.org·5h
🏗️AI Infrastructure
Flag this post
Scalable Quantum Approximate Optimization Algorithm (QAOA) Parameter Optimization via Adaptive Bayesian Hyperparameter Tuning
dev.to·14h·
Discuss: DEV
📱Edge AI
Flag this post
Generative AI, Simplicity, and Easiness
gioleppe.github.io·13h·
Discuss: Hacker News
🏗️AI Infrastructure
Flag this post
Torchforge – a PyTorch native library for scalable RL post-training
pytorch.org·22h·
Discuss: Hacker News
🏗️AI Infrastructure
Flag this post
Freephdlabor: Customizable multiagent research automation system
freephdlabor.github.io·1d·
Discuss: Hacker News
🤖AI agents
Flag this post
Model Inversion with Layer-Specific Modeling and Alignment for Data-Free Continual Learning
arxiv.org·5h
📱Edge AI
Flag this post
Quantum-Leaping Collateral: AI-Powered Optimization for the Future of Finance
dev.to·4h·
Discuss: DEV
💻Local LLMs
Flag this post
A Minimal Route to Transformer Attention
neelsomaniblog.com·1d·
Discuss: Hacker News
🏗️AI Infrastructure
Flag this post
Enhanced Knowledge Graph Reasoning via Multi-Modal Data Fusion and Automated Verification
dev.to·18h·
Discuss: DEV
🕸️Graph Databases
Flag this post