Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
🔄 SIMD Programming
Specific
AVX512, Vector Instructions, Loop Unrolling, Auto-vectorization
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
188638
posts in
54.9
ms
Hybrid
JIT-CUDA
Graph Optimization for Low-Latency Large Language Model Inference
🏗️
LLM Infrastructure
arxiv.org
·
5d
SpikingBrain2.0
: Brain-Inspired Foundation Models for Efficient Long-Context and Cross-Platform
Inference
🔢
BitNet Inference
arxiv.org
·
6d
Shape of Memory: a Geometric Analysis of Machine
Unlearning
in Second-Order
Optimizers
🔍
AI Interpretability
arxiv.org
·
5d
Salca
: A
Sparsity-Aware
Hardware Accelerator for Efficient Long-Context Attention Decoding
⚡
Hardware Acceleration
arxiv.org
·
4d
Sparse-on-Dense: Area and Energy-Efficient Computing of Sparse Neural Networks on Dense Matrix
Multiplication
Accelerators
⚡
Hardware Acceleration
arxiv.org
·
3d
FACT:
Compositional
Kernel
Synthesis
with a Three-Stage Agentic Workflow
🏗️
LLM Infrastructure
arxiv.org
·
3d
Fix Initial Codes and
Iteratively
Refine
Textual Directions Toward Safe Multi-Turn Code Correction
🗂️
Code Indexing
arxiv.org
·
5d
PortraVec
: Image-Based Portrait
Vectorization
with Text-Guided Manipulation
🚀
Astral
arxiv.org
·
4d
Optimas
: An Intelligent
Analytics-Informed
Generative AI Framework for Performance Optimization
🏗️
LLM Infrastructure
arxiv.org
·
5d
Betting for
Sim-to-Real
Performance
Evaluation
🏆
LLM Benchmarking
arxiv.org
·
5d
Revealing
NVIDIA Closed-Source Driver Command Streams for CPU-GPU
Runtime
Behavior Insight
🕯️
Candle ML
arxiv.org
·
3d
·
Hacker News
Discriminator-Guided
Adaptive Diffusion for Source-Free Test-Time Adaptation under Image
Corruptions
0
Binary Vector Embeddings
arxiv.org
·
5d
Transformer-Based Rhythm Quantization of Performance
MIDI
Using Beat
Annotations
🔤
Tokenization
arxiv.org
·
6d
GICC
: A High-Performance Runtime for
GPU-Initiated
Communication and Coordination in Modern HPC Systems
⚡
Glommio
arxiv.org
·
6d
A
Systematic
Post-Train Framework for Video Generation
📦
Batch Embeddings
arxiv.org
·
4d
CoRE: A
Fine-Grained
Code Reasoning Benchmark Beyond
Output
Prediction
🛠️
Build Optimization
arxiv.org
·
4d
« Page 2
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help