Parallel Computing

Feeds to Scour
SubscribedAll
Scoured 696 posts in 8.6 ms

Kosovo votes again amid political deadlock, seeking EU and NATO progress

 🔀Concurrency  Content type: Video  Content type: News
aljazeera.com·

Issue 753

 🔭Observability

A Double Victory for Web Speed: Chrome Breaks Records Again on Speedometer 3.1 and Jetstream 3

 🗂️Data Structures  Content type: News  Content type: Blog
blog.google··Hacker News

AutoMegaKernel: A Statically-Checked Agent Harness for Self-Retargeting Megakernel Synthesis

 🎮CUDA  Content type: Academic
arxiv.org··Hacker News

Fast Exact Nearest-Neighbor Learning for High-Frequency Financial Time Series

 📈Investing  Content type: Academic
arxiv.org·

jeffhuen/RustyCSV: High-performance CSV parsing for Elixir. Rust NIF with SIMD acceleration, parallel parsing, and bounded-memory streaming. Drop-in NimbleCSV replacement.

 λFunctional Programming  Content type: Code
github.com··Hacker News

APEX4: Efficient Pure W4A4 LLM Inference via Intra-SM Compute Rebalancing

 🎮CUDA  Content type: Academic
arxiv.org·

FlashCP: Load-Balanced Communication-Efficient Context Parallelism for LLM Training

 🌐Distributed Systems  Content type: Academic
arxiv.org·

KJLdefeated/RL.cu: RLVR training for LLM in CUDA/C++

 🎮CUDA  Content type: Code
github.com··Hacker News

A remark on diagnosability verification

 🔀Concurrency  Content type: Academic
arxiv.org·

Structuring agentic AI for HPC code modernization

 🚀High Performance Computing  Content type: Academic
arxiv.org·

llama.cpp - Qwen3.6/3.5-MTP - Share your benchmarks t/s

 🖼️GPU Computing  Content type: Code
github.com··r/LocalLLaMA

docs: document release audit scripts · openclaw/openclaw@72547a1

 📈Performance  Content type: Code
github.com·

zhongkaifu/TensorSharp: A C# inference engine for running large language models (LLMs) locally using GGUF model files. TensorSharp provides a console application, a web-based chatbot interface, and Ollama/OpenAI-compatible HTTP APIs for programmatic access. It supports Windows/MacOS/Linux with full GPU capability

 🐧Linux  Content type: Code
github.com··Hacker News

test(docker): cap npm scheduler concurrency · openclaw/openclaw@023427b

 🦀Rust  Content type: Code
github.com·

LLM-Based Porting of Optimized C++ to CUDA Through Deoptimization and Reoptimization

 🚀High Performance Computing  Content type: Academic
arxiv.org·

Does anyone know what PCIe mode was used for these benchmarks?

 🖼️GPU Computing  Content type: Code
github.com··r/LocalLLaMA

SET: Stream-Event-Triggered Scheduling for Efficient CUDA Graph Pipelines

 🖼️GPU Computing  Content type: Academic
arxiv.org·

jdalang/jda-lang: Jda: A high-performance systems language bootstrapped from assembly. Beats C on sudoku & LZ77. Self-hosted compiler, no GC, built-in concurrency & ML.

 🐧Linux  Content type: Code
github.com··DEV

YouZhi: Towards High-Concurrency Financial LLMs via Adaptive GQA-to-MLA Transition

 🌲LSM Trees  Content type: Academic
arxiv.org·
Sign up or log in to see more results

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help