Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
LLMs
馃 LLMs
Specific
Large Language Models, GPT, Transformers
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
482
posts in
6.8
ms
Alignment Collapse Under
KV
Cache
Quantization: Diagnosis and Mitigation
聽
馃
LLM
聽
Content type:
Academic
arxiv.org
路
1d
1 day ago
Actions for Alignment Collapse Under KV Cache Quantization: Diagnosis and Mitigation
Reachability and asymptotics of Gaussian
Transformer
dynamics
聽
馃
LLM
聽
Content type:
Academic
arxiv.org
路
2d
2 days ago
Actions for Reachability and asymptotics of Gaussian Transformer dynamics
LLM-Based
Code Documentation Generation and Multi-Judge Evaluation
聽
馃
LLM
聽
Content type:
Academic
arxiv.org
路
1d
1 day ago
Actions for LLM-Based Code Documentation Generation and Multi-Judge Evaluation
The Order Matters: Sequential
Fine-Tuning
of
LLaMA
for Coherent Automated Essay Scoring
聽
馃
LLM
聽
Content type:
Academic
arxiv.org
路
1d
1 day ago
Actions for The Order Matters: Sequential Fine-Tuning of LLaMA for Coherent Automated Essay Scoring
A retrieval conditioned rebinding circuit for dynamic entity tracking in
large
language
models
聽
馃
LLM
聽
Content type:
Academic
arxiv.org
路
2d
2 days ago
Actions for A retrieval conditioned rebinding circuit for dynamic entity tracking in large language models
RedKnot: Efficient Long-Context
LLM
Serving with Head-Aware
KV
Reuse and SegPagedAttention
聽
馃
LLM
聽
Content type:
Academic
arxiv.org
路
6d
6 days ago
Actions for RedKnot: Efficient Long-Context LLM Serving with Head-Aware KV Reuse and SegPagedAttention
YouZhi: Towards High-Concurrency
Financial
LLMs
via Adaptive GQA-to-MLA Transition
聽
馃
AI Tools
聽
Content type:
Academic
arxiv.org
路
6d
6 days ago
Actions for YouZhi: Towards High-Concurrency Financial LLMs via Adaptive GQA-to-MLA Transition
Tangram: Unlocking Non-Uniform
KV
Cache
for Efficient Multi-turn
LLM
Serving
聽
馃捇
Operating Systems
聽
Content type:
Academic
arxiv.org
路
6d
6 days ago
路
Hacker News
Actions for Tangram: Unlocking Non-Uniform KV Cache for Efficient Multi-turn LLM Serving
Minimizing the Hidden Cost of Scales: Graph-Guided Ultra-Low-Bit Quantization for
Large
Language
Models
聽
馃
LLM
聽
Content type:
Academic
arxiv.org
路
6d
6 days ago
Actions for Minimizing the Hidden Cost of Scales: Graph-Guided Ultra-Low-Bit Quantization for Large Language Models
LLMCodec
: Adapting Video Codecs for Efficient Weight Compression of
Large
Language
Models
聽
馃挰
Natural Language Processing
聽
Content type:
Academic
arxiv.org
路
6d
6 days ago
Actions for LLMCodec: Adapting Video Codecs for Efficient Weight Compression of Large Language Models
SigmaScale:
LLM
Compression with SVD-based Low-Rank Decomposition and Learned Scaling Matrices
聽
馃
LLM
聽
Content type:
Academic
arxiv.org
路
3d
3 days ago
Actions for SigmaScale: LLM Compression with SVD-based Low-Rank Decomposition and Learned Scaling Matrices
Empirical Evaluation of
Large
Language
Models
for Migration of Code Fragments to Post-Quantum Cryptography
聽
馃
LLM
聽
Content type:
Academic
arxiv.org
路
3d
3 days ago
Actions for Empirical Evaluation of Large Language Models for Migration of Code Fragments to Post-Quantum Cryptography
« Page 2
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help