GPU Programming

Feeds to Scour
SubscribedAll
Scoured 42 posts in 9.7 ms

Communication Strategy Selection for Multi-GPU 3D FDTD with Convolutional Perfectly Matched Boundary Layers

 Computer Graphics  Content type: Academic
arxiv.org·

Does anyone know what PCIe mode was used for these benchmarks?

 💬LLMs  Content type: Code
github.com··r/LocalLLaMA

Efficient $(\alpha,\beta)$-core Computation and On-the-fly Query at Billion Scale with GPUs

 🕸️Graph Theory  Content type: Academic
arxiv.org·

On GPU Implementation for Multi-Precision Integer Division

 Hardware Acceleration  Content type: Academic
arxiv.org·

GoodQ02/goodq4all: Local-first multimodal epistemic memory for scene-level video, audio, and text intelligence.

 🔍Information Retrieval  Content type: Code
github.com··Hacker News

MusaCoder: Native GPU Kernel Generation with Full-Stack Training on Moore Threads GPU

 🎮Reinforcement Learning  Content type: Academic
arxiv.org·

heterodoxin/graphkv: Graph-guided KV cache compression for memory-efficient LLM inference.

 💬LLMs  Content type: Code
github.com··r/LocalLLaMA

CodegenBench: Can LLMs Write Efficient Code Across Architectures?

 🤖AI  Content type: Academic
arxiv.org··Hacker News

NVIDIA/cosmos: NVIDIA Cosmos is an open platform of world models, datasets, and tools that enables developers to build Physical AI for robots, autonomous vehicles, smart infrastructure, and more.

 🤖AI  Content type: Code
github.com·

Graph Traversal on Tensor Cores: A BFS Framework for Modern GPUs

 ⚙️Algorithms  Content type: Academic
arxiv.org·

GNStor: Design of GPU-Native High-Performance Remote All-Flash Array

 Computer Graphics  Content type: Academic
arxiv.org·

DeployBench: Benchmarking LLM Agents for Research Artifact Deployment

 Hardware Acceleration  Content type: Academic
arxiv.org·

huawei-csl/KVarN: KVarN is a native vLLM KV-cache quantization backend for your agents: 3-5x more context, throughput above FP16, and FP16-level accuracy. Calibration-free, one flag.

 💬LLMs  Content type: Code
github.com··Hacker News

AutoLab: Can Frontier Models Solve Long-Horizon Auto Research and Engineering Tasks?

 🎯AI Agents  Content type: Academic
arxiv.org·

harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.

 🤖AI  Content type: Code
github.com··Hacker News

Video-Rate Streaming Stylization on a Vision-Aware MLLM-Conditioned Edit Diffusion: Asymmetric Batched Inference on a Distilled UNet + MLLM Text Encoder

 🎨Generative AI  Content type: Academic
arxiv.org·

No more posts from jhcha.oyo's subscribed feeds.

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help