🖥️ GPU Programming - jhcha.oyo · Scour

Communication Strategy Selection for Multi-GPU 3D FDTD with Convolutional Perfectly Matched Boundary Layers

✨Computer Graphics Academic

Does anyone know what PCIe mode was used for these benchmarks?

💬LLMs Code

github.com··r/LocalLLaMA

Efficient $(\alpha,\beta)$-core Computation and On-the-fly Query at Billion Scale with GPUs

🕸️Graph Theory Academic

On GPU Implementation for Multi-Precision Integer Division

⚡Hardware Acceleration Academic

GoodQ02/goodq4all: Local-first multimodal epistemic memory for scene-level video, audio, and text intelligence.

🔍Information Retrieval Code

github.com··Hacker News

MusaCoder: Native GPU Kernel Generation with Full-Stack Training on Moore Threads GPU

🎮Reinforcement Learning Academic

heterodoxin/graphkv: Graph-guided KV cache compression for memory-efficient LLM inference.

💬LLMs Code

github.com··r/LocalLLaMA

CodegenBench: Can LLMs Write Efficient Code Across Architectures?

🤖AI Academic

arxiv.org··Hacker News

NVIDIA/cosmos: NVIDIA Cosmos is an open platform of world models, datasets, and tools that enables developers to build Physical AI for robots, autonomous vehicles, smart infrastructure, and more.

🤖AI Code

Graph Traversal on Tensor Cores: A BFS Framework for Modern GPUs

⚙️Algorithms Academic

GNStor: Design of GPU-Native High-Performance Remote All-Flash Array

✨Computer Graphics Academic

DeployBench: Benchmarking LLM Agents for Research Artifact Deployment

⚡Hardware Acceleration Academic

huawei-csl/KVarN: KVarN is a native vLLM KV-cache quantization backend for your agents: 3-5x more context, throughput above FP16, and FP16-level accuracy. Calibration-free, one flag.

💬LLMs Code

github.com··Hacker News

AutoLab: Can Frontier Models Solve Long-Horizon Auto Research and Engineering Tasks?

🎯AI Agents Academic

harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.

🤖AI Code

github.com··Hacker News

Video-Rate Streaming Stylization on a Vision-Aware MLLM-Conditioned Edit Diffusion: Asymmetric Batched Inference on a Distilled UNet + MLLM Text Encoder

🎨Generative AI Academic

No more posts from jhcha.oyo's subscribed feeds.

Scour all 25257 feeds Learn more about Feeds

Log in to enable infinite scrolling