Attention Is All You Need for KV Cache in Diffusion LLMs
paperium.net·2d·
Discuss: DEV
🔁Cache Coherence
Flag this post
Enabling Trillion-Parameter Models on AWS EFA
research.perplexity.ai·1d·
Discuss: Hacker News
Hardware Acceleration
Flag this post
Small Vs. Large Language Models
semiengineering.com·2d·
Discuss: Hacker News, r/LLM
📱Edge AI
Flag this post
Predicting Encoding Energy from Low-Pass Anchors for Green Video Streaming
arxiv.org·2d
🎬WebCodecs
Flag this post
A brief guide for those who slept (on AI) the last two years
github.com·16h·
Discuss: DEV
💬Prompt Engineering
Flag this post
Machine Learning Fundamentals: Everything I Wish I Knew When I Started
dev.to·4d·
Discuss: DEV
🧠Machine Learning
Flag this post
Deploying Rapid Damage Assessments from sUAS Imagery for Disaster Response
arxiv.org·2h
👁️Computer Vision
Flag this post
Attention ISN'T all you need?! New Qwen3 variant Brumby-14B-Base leverages Power Retention technique
venturebeat.com·1d
Incremental Computation
Flag this post
Decoupled Entropy Minimization
arxiv.org·2h
📊Information Theory
Flag this post
Why AI infrastructure and multi-platform compute strategy matters now?
dev.to·8h·
Discuss: DEV
Hardware Acceleration
Flag this post
[P] triplet-extract: GPU-accelerated triplet extraction via Stanford OpenIE in pure Python
reddit.com·2d·
🧮Vector Databases
Flag this post
Shrinking the Variance: Shrinkage Baselines for Reinforcement Learning with Verifiable Rewards
arxiv.org·2h
🎯Reinforcement Learning
Flag this post
SurgViVQA: Temporally-Grounded Video Question Answering for Surgical Scene Understanding
arxiv.org·2h
👁️Computer Vision
Flag this post
GMoPE:A Prompt-Expert Mixture Framework for Graph Foundation Models
arxiv.org·2h
💬Prompt Engineering
Flag this post
CueBench: Advancing Unified Understanding of Context-Aware Video Anomalies in Real-World
arxiv.org·2d
💬Prompt Engineering
Flag this post
CoCoVa: Chain of Continuous Vision-Language Thought for Latent Space Reasoning
arxiv.org·1d
👁️Computer Vision
Flag this post
GAFD-CC: Global-Aware Feature Decoupling with Confidence Calibration for OOD Detection
arxiv.org·1d
👁️Computer Vision
Flag this post