H-FA: A Hybrid Floating-Point and Logarithmic Approach to Hardware Accelerated FlashAttention
arxiv.org·7h
⚡Model Efficiency
Flag this post
GPU Pro – Master Your AI Workflow
⚡Model Efficiency
Flag this post
Intel's LLM-Scaler Updated With OpenAI's GPT-OSS Model Support
phoronix.com·1h
⚡LLM Optimization
Flag this post
Why stop at 1 million tokens when you can have 10? My journey to extreme context on a gaming GPU. [P]
⚡Model Efficiency
Flag this post
Why Multimodal AI Broke the Data Pipeline — And How Daft Is Beating Ray and Spark to Fix It
hackernoon.com·1d
⚡Model Efficiency
Flag this post
Learning Sparse Approximate Inverse Preconditioners for Conjugate Gradient Solvers on GPUs
arxiv.org·1d
⚡Model Efficiency
Flag this post
Automated Anomaly Detection and Self-Calibration in CMUT Array Fabrication via Bayesian Optimization
⚡Model Efficiency
Flag this post
AMD Will Continue Game Optimization Support For Older Radeon GPU's After All
tech.slashdot.org·12h
⚡Model Efficiency
Flag this post
This Month in Ladybird – October 2025
🛠️Developer Tools
Flag this post
A Thesis and Playbook for Edge AI
⚡Model Efficiency
Flag this post
Dive into Systems
✍️Prompt Engineering
Flag this post
CueBench: Advancing Unified Understanding of Context-Aware Video Anomalies in Real-World
arxiv.org·7h
✍️Prompt Engineering
Flag this post
Labs for Broke – EKS for Pennies
⚡Model Efficiency
Flag this post
The next RISC-V processor frontier: AI
⚡Model Efficiency
Flag this post
Loading...Loading more...