HPC

high performance computing, parallel processing, SIMD, efficient computation

Feeds to Scour
SubscribedAll
Scoured 379 posts in 6.4 ms

Concepts in Practice: C++ MPI Bindings for the HPC Ecosystem. From a Standardizable Core to a Composable Interface

 🕸️Distributed Systems  Content type: Academic
arxiv.org·

CUDA-Oxide 0.2 Brings Early Improvements To Pure Rust CUDA Kernels

 🖥️Computer Hardware
phoronix.com·

Supercomputers alone won’t speed up discoveries without trained researchers, says NSCC chief

 🖥️Computer Hardware
channelnewsasia.com·

(PR) NextSilicon to Productize Arbel RISC-V Core Into 64-Core Enterprise Processor for AI and HPC

 🖥️Computer Hardware
techpowerup.com·

Framework Desktop AMD 395+ (rdna 3.5) cannot run confyui err Fix 2026

 📡LoRa  Content type: Blog
runaihome.com··DEV

SWIFT: Shallow and SIMD-Aware CKKS Functional Bootstrapping for Low-Latency

 🔢Numerical Methods
eprint.iacr.org·

Big Blue’s Redbook on Storage Scale KV Cache management

 🕸️Distributed Systems  Content type: News
blocksandfiles.com·

How Airbus’ supercomputers are driving the future of design

 🔢Numerical Methods  Content type: News

Flatpak 1.18 adds AMD ROCm support, improved error output, and faster Fish shell start-up

 ❄️Nix
alternativeto.net·

NVIDIA Nsight Compute

 🖥️Computer Hardware
developer.nvidia.com·

HydraMPP: A lightweight library for distributed massive parallel processing in Python - threading at scale.

 🕸️Distributed Systems  Content type: Academic
biorxiv.org·

HFT Latency Monitoring with Probabilistic Calling Context

 📡LoRa
hftuniversity.com··Hacker News

RightNow-AI/AutoMegaKernel: An agent harness that compiles a model into one provably-correct, self-retargeting CUDA megakernel and self-tunes it past cuBLAS at batch-1 LLM decode.

 🖥️Computer Hardware  Content type: Code
github.com··Hacker News

Why my SIMD code was silently running as scalar, and what debugging it taught me about production environment assumptions

 🔢Numerical Methods  Content type: Blog

1-bit and 1.58 bit LLM Benchmarking on Jetson Orin Nano Super | Bonsai LM

 🖥️Computer Hardware
smolhub.com··r/LocalLLaMA

Full Context on a Vulkan-Only Strix Halo: The Decode-Drop Reproduces, but the Sweet Spot Moves

 🕸️Distributed Systems

Exploiting GPU Tensor Cores from Java using Babylon [Juan Fumero]

 🖥️Computer Hardware
openjdk.org··r/java

How Will the Chiplet IC Market Transform Semiconductor Design Through 2034?

 🖥️Computer Hardware  Content type: Blog

RATrain: A Resource-Aware Training Runtime for Large Language Models on Bandwidth-Constrained Heterogeneous Supercomputing Platforms

 🔢Numerical Methods  Content type: Academic
arxiv.org·

Core Automation co-founder Jerry Tworek jokes that Nvidia's CUDA translates to miracles in Polish

 🖥️Computer Hardware
digg.com·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help