Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Model optimizations in LLMs
✨ Model optimizations in LLMs
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
48
posts in
4.5
ms
STAR-KV: Low-Rank KV Cache Compression via Soft Thresholding for Adaptive Rank Control
🔧
Systems-level optimizations for LLM serving
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for STAR-KV: Low-Rank KV Cache Compression via Soft Thresholding for Adaptive Rank Control
Beyond Output Matching: Preserving Internal Geometry in NVFP4
LLM
Distillatio
🔢
Quantization of LLMs
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for Beyond Output Matching: Preserving Internal Geometry in NVFP4 LLM Distillatio
LLMCodec
: Adapting Video Codecs for Efficient Weight Compression of
Large
Language
Models
🧠
Large Language Models (LLMs)
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for LLMCodec: Adapting Video Codecs for Efficient Weight Compression of Large Language Models
BenDi: An Energy-Efficient Quasi-Stochastic Systolic
Architecture
for Edge Bioelectronics
📊
AI Performance Profiling
Content type:
Academic
arxiv.org
·
18h
18 hours ago
Actions for BenDi: An Energy-Efficient Quasi-Stochastic Systolic Architecture for Edge Bioelectronics
FAIR-Calib: Frontier-Aware Instability-Reweighted Calibration for Post-Training
Quantization
of Diffusion
Large
Language
Models
🧠
Large Language Models (LLMs)
Content type:
Academic
arxiv.org
·
3d
3 days ago
Actions for FAIR-Calib: Frontier-Aware Instability-Reweighted Calibration for Post-Training Quantization of Diffusion Large Language Models
SNN-MLIR: An MLIR Dialect for Compiling Neuromorphic SNNs from NIR to Bare-Metal C
📊
AI Performance Profiling
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for SNN-MLIR: An MLIR Dialect for Compiling Neuromorphic SNNs from NIR to Bare-Metal C
Semantic Grading of Written Answers in Low-Resource
Language
Bangla Using a Fine-Tuned Lightweight
Language
Model
🔢
Quantization of LLMs
Content type:
Academic
arxiv.org
·
18h
18 hours ago
Actions for Semantic Grading of Written Answers in Low-Resource Language Bangla Using a Fine-Tuned Lightweight Language Model
ColBERTSaR: Sparsified ColBERT Index via Product
Quantization
🔍
Retrieval-augmented generation
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for ColBERTSaR: Sparsified ColBERT Index via Product Quantization
SpectrumKV: Per-Token Mixed-Precision KV Cache
Transfer
for Prefill-Decode Disaggregated
LLM
Serving
🧠
Large Language Models (LLMs)
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for SpectrumKV: Per-Token Mixed-Precision KV Cache Transfer for Prefill-Decode Disaggregated LLM Serving
MOTOR: Learning ID-free Item Representation with Token Crossing for Embedding-based Multimodal Recommendation
🧠
Large Language Models (LLMs)
Content type:
Academic
arxiv.org
·
18h
18 hours ago
Actions for MOTOR: Learning ID-free Item Representation with Token Crossing for Embedding-based Multimodal Recommendation
Automated IEP Generation from Traditional Chinese Parent-Teacher Interviews via Corpus-Grounded Feature Diffusion
🧠
Large Language Models (LLMs)
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Automated IEP Generation from Traditional Chinese Parent-Teacher Interviews via Corpus-Grounded Feature Diffusion
Beyond Generative Decoding: Discriminative Hidden-State Readout from a Native Omni-Modal
LLM
for Multimodal Sentiment Analysis
🔢
Quantization of LLMs
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for Beyond Generative Decoding: Discriminative Hidden-State Readout from a Native Omni-Modal LLM for Multimodal Sentiment Analysis
Benchmarking Neural Speech Compression from a Rate-Distortion Perspective
🧠
Large Language Models (LLMs)
Content type:
Academic
arxiv.org
·
18h
18 hours ago
Actions for Benchmarking Neural Speech Compression from a Rate-Distortion Perspective
EditSSC: Toward Editable Semantic Occupancy Scenes with Unconditional Diffusion
Models
🔍
Retrieval-augmented generation
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for EditSSC: Toward Editable Semantic Occupancy Scenes with Unconditional Diffusion Models
Large-scale
empirical tuning and comparison of default
optimizers
for variational
inference
🧠
Large Language Models (LLMs)
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Large-scale empirical tuning and comparison of default optimizers for variational inference
RedditPersona: A Modular Framework for Community-Conditioned
LLM
Adaptation from Reddit
🔢
Quantization of LLMs
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for RedditPersona: A Modular Framework for Community-Conditioned LLM Adaptation from Reddit
Next-Token Prediction Learns Generalisable Representations of Sleep Physiology
🧠
Large Language Models (LLMs)
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Next-Token Prediction Learns Generalisable Representations of Sleep Physiology
EgoPressDiff: Multimodal Video Diffusion for Egocentric UV-Domain Hand-Pressure Estimation
⚡
Real-time AI Systems
Content type:
Academic
arxiv.org
·
3d
3 days ago
Actions for EgoPressDiff: Multimodal Video Diffusion for Egocentric UV-Domain Hand-Pressure Estimation
BioVid: Autoregressive Video Generation with Biological Behavior Semantic Comprehension
🧠
Large Language Models (LLMs)
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for BioVid: Autoregressive Video Generation with Biological Behavior Semantic Comprehension
Learned Subspace Compression for Communication-Efficient Pipeline Parallelism
🌐
Distributed LLM Systems
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for Learned Subspace Compression for Communication-Efficient Pipeline Parallelism
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help