GEMM Optimization

Feeds to Scour
SubscribedAll
Scoured 50 posts in 9.9 ms

From Database and Virtualized Workloads to Backup: Dell PowerEdge R4715 and R5715 for SMB Realities

 🕸️Network Fabrics
storagereview.com·

RhinoVLA Technical Report

 ⚙️MLOps  Content type: Academic
arxiv.org·

RATrain: A Resource-Aware Training Runtime for Large Language Models on Bandwidth-Constrained Heterogeneous Supercomputing Platforms

 🎮GPU Computing  Content type: Academic
arxiv.org·

AlphaQ: Calibration-Free Bit Allocation for Mixture-of-Experts Quantization

 💰Inference Cost  Content type: Academic
arxiv.org·

APEX4: Efficient Pure W4A4 LLM Inference via Intra-SM Compute Rebalancing

 🎮GPU Computing  Content type: Academic
arxiv.org·

null-drift: Using tokio RwLocks and bincode to build an O(1) fault-tolerant AI memory architecture.

 🔄Cache-Coherence  Content type: Code
github.com··r/rust

I built a local memory engine for AI agents using Rust, Python, and a 10,000D mathematical array.

 🔄Cache-Coherence  Content type: Code
github.com··r/SideProject

Gauss Circle Lattices with Geometric Convolutions for Synthesizing High Dimensional Image-Source Room Impulse Responses

 ☁️Cloud Infrastructure  Content type: Academic
arxiv.org·

IBL-tools/rawtohdri: rawtohdri Converts bracketed camera raw files directly to an OpenEXR format HDRI.

 💻Systems Programming  Content type: Code
github.com··Hacker News

DiffusionGemma: The Developer Guide- Google Developers Blog

 🧠Inference Engineering  Content type: Blog

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help