Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
🎮 GPU Microarchitecture
GPU ISA, shader cores, warp scheduling, SIMT execution
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
198802
posts in
42.8
ms
DICE: Enabling Efficient General-Purpose
SIMT
Execution with
Statically
Scheduled Coarse-Grained Reconfigurable Arrays
🎮
SIMT Execution
arxiv.org
·
6d
Show HN: I built a small
repertoir
of different
computing
systems
🖥️
Hardware Architecture
computers.tugdual.fr
·
1d
·
Hacker News
Architecting
Data Pipelines for Multimodal
Datasets
at Scale
🎮
GPU Architecture
anyscale.com
·
15h
Unlocking
asynchronicity
in continuous
batching
🎮
SIMT Execution
huggingface.co
·
23h
The case for
fine-grained
tracking of
compute
for AI
📊
AI Performance Profiling
lesswrong.com
·
1d
COLORFUL
iGame
GeForce RTX 5070 Ultra OC Review - When Style and Performance Meet
🎮
WebGPU
tweaktown.com
·
6d
Mapping NVIDIA's Full
GenAI
Toolchain
🟩
Nvidia
mlops.community
·
1d
Scaling PCIe Controllers for AI
Bandwidth
: A
Multistream
Architecture Analysis for 64 GT/s and 128 GT/s
🏗
System Design Patterns
semiengineering.com
·
1d
DLSS has turned buying a GPU into
analyzing
different software
tiers
, and I can't keep up
🖥️
Terminal Renaissance
xda-developers.com
·
6d
How to
achieve
truly
serverless
GPUs
🚀
Modal
modal.com
·
2d
·
Hacker News
Vulkan
1.4.351 Brings Six New Extensions, Including A
Ray-Tracing
Improvement
⚡
Real-time Rendering
phoronix.com
·
3d
·
r/linux
A detailed algorithmic study on a
reuse-aware
, near memory, all-digital
Ising
machine
📊
Data-Oriented Design
arxiv.org
·
19h
TLX: Hardware-Native,
Evolvable
MIMW
GPU Compiler for Large-scale Production Environments
🎮
SIMT Execution
arxiv.org
·
2d
·
Hacker News
OOM-Free
Alpamayo
via CPU-GPU Memory Swapping for Vision-Language-Action Models
🎮
GPU Memory
arxiv.org
·
1d
Beyond
Static
Policies: Exploring Dynamic Policy
Selection
for Single-Thread Performance Optimization
⏱️
Runtime Performance Analysis
arxiv.org
·
6d
Closer in the Gap: Towards Portable Performance on
RISC-V
Vector
Processors
🖥️
Modern CPU
arxiv.org
·
2d
EDA-Schema-V2
: A Multimodal
Schema
, Open Datasets, and Benchmarks for Machine Learning in Digital Physical Design
🔌
Embedded Systems
arxiv.org
·
3d
EULER-ADAS: Energy-Efficient & SIMD-Unified
Logarithmic-Posit
Engine for Precision-Reconfigurable Approximate ADAS Acceleration
🖥️
Hardware Architecture
arxiv.org
·
3d
Towards Compute-Aware In-Switch Computing for LLMs
Tensor-Parallelism
on Multi-GPU Systems
🏗️
LLM Infrastructure
arxiv.org
·
6d
·
Hacker News
TransDot
: An Area-efficient
Reconfigurable
Floating-Point Unit for Trans-Precision Dot-Product Accumulation for FPGA AI Engines
🎯
Emulation Accuracy
arxiv.org
·
3d
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help