Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
馃敘 cuBLAS
Specific
CUDA Linear Algebra, Matrix Operations, GPU BLAS, cuBLASLt
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
5
posts in
6.8
ms
michelangeloromerochisco/ternative: Inference engine for ternary-weight LLMs with runtime LoRA - the llama.cpp of BitNet models
聽
馃攧
ONNX
github.com
路
1d
路
Hacker News
Ollama Doesn't Know Its
GPU
Is on Another Machine
聽
鈴憋笍
CUDA Events
loopholelabs.io
路
1d
路
Hacker News
Less-relevant results
Exceeding the Numerical and Performance Characteristics of IEEE-754 SGEMM with
BFloat16
Tensor
Cores
on GPUs for Scientific Computing
聽
馃幆
Tensor Cores
arxiv.org
路
2d
PyTorch, rewritten from scratch in pure Rust
聽
馃摐
TorchScript
github.com
路
6d
路
Hacker News
Luce Megakernal: Why nobody is taking about this?
聽
馃攳
Nsight
github.com
路
5d
路
r/LocalLLaMA
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help