Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
🔄 SIMD Programming
Specific
AVX512, Vector Instructions, Loop Unrolling, Auto-vectorization
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
29443
posts in
65.4
ms
Length
Value Model: Scalable Value
Pretraining
for Token-Level
Length
Modeling
🔤
Tokenization
arxiv.org
·
2d
[2212.08153]
FiDO
:
Fusion-in-Decoder
optimized for stronger performance and faster inference
🧩
MoE
arxiv.org
·
1d
WAVE-N
Specialized
Video Processing
NPU
for Edge AI Systems
📱
Edge AI Optimization
semiwiki.com
·
4d
Fast-Vollib
: A Fast Implied Volatility Library for
Pythonwith
PyTorch, JAX, and CUDA Fused-Kernel Backends
📊
Model Serving Economics
arxiv.org
·
2d
SCOPE-FE
: Structured Control of Operator and
Pairwise
Exploration for Feature Engineering
⚡
PGO
arxiv.org
·
2d
Step-level Optimization for Efficient Computer-use Agents
🔧
Agent Tooling
arxiv.org
·
2d
MEMCoder
: Multi-dimensional Evolving Memory for
Private-Library-Oriented
Code Generation
🔄
Cache Coherence
arxiv.org
·
5d
RaMP: Runtime-Aware
Megakernel
Polymorphism
for Mixture-of-Experts
🧩
MoE
arxiv.org
·
3d
Lightweight Real-Time Rendering Parameter Optimization via
XGBoost-Driven
Lookup
Tables
📦
Batch Embeddings
arxiv.org
·
4d
Hybrid
JIT-CUDA
Graph Optimization for Low-Latency Large Language Model Inference
🏗️
LLM Infrastructure
arxiv.org
·
5d
Shape of Memory: a Geometric Analysis of Machine
Unlearning
in Second-Order
Optimizers
🔍
AI Interpretability
arxiv.org
·
5d
SpikingBrain2.0
: Brain-Inspired Foundation Models for Efficient Long-Context and Cross-Platform
Inference
🔢
BitNet Inference
arxiv.org
·
6d
Sparse-on-Dense: Area and Energy-Efficient Computing of Sparse Neural Networks on Dense Matrix
Multiplication
Accelerators
⚡
Hardware Acceleration
arxiv.org
·
3d
FACT:
Compositional
Kernel
Synthesis
with a Three-Stage Agentic Workflow
🏗️
LLM Infrastructure
arxiv.org
·
3d
Salca
: A
Sparsity-Aware
Hardware Accelerator for Efficient Long-Context Attention Decoding
⚡
Hardware Acceleration
arxiv.org
·
4d
Fix Initial Codes and
Iteratively
Refine
Textual Directions Toward Safe Multi-Turn Code Correction
🗂️
Code Indexing
arxiv.org
·
5d
PortraVec
: Image-Based Portrait
Vectorization
with Text-Guided Manipulation
🚀
Astral
arxiv.org
·
4d
Optimas
: An Intelligent
Analytics-Informed
Generative AI Framework for Performance Optimization
🏗️
LLM Infrastructure
arxiv.org
·
5d
Betting for
Sim-to-Real
Performance
Evaluation
🏆
LLM Benchmarking
arxiv.org
·
5d
Discriminator-Guided
Adaptive Diffusion for Source-Free Test-Time Adaptation under Image
Corruptions
0
Binary Vector Embeddings
arxiv.org
·
5d
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help