Attention Optimization, Memory Efficiency, Transformer Acceleration, IO-Aware

SeamFit: Towars Practical Smart Clothing for Automatic Exercies Logging
dl.acm.org·6h·
Discuss: Hacker News
🏎️TensorRT
Flag this post
Sweep (YC S23) is hiring to build autocomplete for JetBrains
ycombinator.com·1d·
Discuss: Hacker News
🤖AI Coding Tools
Flag this post
Using the probabilistic method to bound the performance of toy transformers by Alex Gibson
greaterwrong.com·1d
📊Gradient Accumulation
Flag this post
FastMinify - Free Client-Side JS/CSS Minifier
fastminify.com·1d·
Discuss: DEV
🚀Compiler Optimization
Flag this post
The Advent Of ‘Thinking Tokens’ Causes Unforeseen Inflationary Impact On Generative AI
forbes.com·3d
🤖AI Coding Tools
Flag this post
Links 07/11/2025: Software Patents Squashed, Stock Markets Wobble Over Slop Uncertainties
techrights.org·1d
🔄ONNX
Flag this post
New build LLaMA - Lenovo P920 base - How to make for max large context?
reddit.com·12h·
Discuss: r/LocalLLaMA
📈Occupancy Optimization
Flag this post
Combining Harmonic Sampling with the Worm Algorithm to Improve the Efficiency of Path Integral Monte Carlo
arxiv.org·1d
🔄ONNX
Flag this post
🤖Building an AI-Powered Digital Receptionist: Automating Business Communication
dev.to·10h·
Discuss: DEV
🤖AI Coding Tools
Flag this post
Explaining Human Choice Probabilities with Simple Vector Representations
arxiv.org·2d
📊Gradient Accumulation
Flag this post
Gemini CLI: The Future of Programming and Reflections on the Impacts of AI
dev.to·1d·
Discuss: DEV
🤖AI Coding Tools
Flag this post
LLMs Talking in Secret: Direct Semantic Links for AI Collaboration by Arvind Sundararajan
dev.to·4h·
Discuss: DEV
💡LSP
Flag this post
Raspberry Pi just got a pro upgrade with Pi Vision 10.1
howtogeek.com·23h
🚀MLOps
Flag this post
CoT-Saliency: Unified Chain-of-Thought Reasoning for Heterogeneous Saliency Tasks
arxiv.org·4d
🏎️TensorRT
Flag this post
Google Debuts “Nested Learning” — A New ML Paradigm for Continual Learning
dev.to·13h·
Discuss: DEV
📊Gradient Accumulation
Flag this post
Beyond Pinecone: A Developer's Deep Dive into the Top 10 Vector Databases for GenAI in 2024
dev.to·1d·
Discuss: DEV
ONNX Runtime
Flag this post
EQ-Negotiator: Dynamic Emotional Personas Empower Small Language Models for Edge-Deployable Credit Negotiation
arxiv.org·2d
🛠Ml-eng
Flag this post
CueBench: Advancing Unified Understanding of Context-Aware Video Anomalies in Real-World
arxiv.org·4d
🧮cuDNN
Flag this post
VidEmo: Affective-Tree Reasoning for Emotion-Centric Video Foundation Models
arxiv.org·3d
🧩Attention Kernels
Flag this post