Context Windows

Feeds to Scour
SubscribedAll
Scoured 483 posts in 14.9 ms

Tangram: Unlocking Non-Uniform KV Cache for Efficient Multi-turn LLM Serving

 🤖LLM  Content type: Academic
arxiv.org··Hacker News

SparDA: Sparse Decoupled Attention for Efficient Long-Context LLM Inference

 🤖LLM  Content type: Academic
arxiv.org·

You Only Index Once: Cross-Layer Sparse Attention with Shared Routing

 ✍️Prompt Engineering  Content type: Academic
arxiv.org·

Signed Dual Attention: Capturing Signed Dependencies in Time Series Forecasting

 📝NLP  Content type: Academic
arxiv.org·

Rethinking LoRA Memory Through the Lens of KV Cache Compression

 🦙Ollama  Content type: Academic
arxiv.org·

Cartridges at Scale: Training Modular KV Caches over Large Document Collections

 💬LLMs  Content type: Academic
arxiv.org·

Where does Absolute Position come from in decoder-only Transformers?

 🛡️AI Security  Content type: Academic
arxiv.org·

GRAMformer: Any-Order Modality Interactions via Volumetric Multimodal Cross-Attention

 💬LLMs  Content type: Academic
arxiv.org·

Transformer-Enhanced Reinforcement Learning: Fundamentals and Applications in Communication Networks

 💬LLMs  Content type: Academic
arxiv.org·

Selective Coupling of Decoupled Informative Regions: Masked Attention Alignment for Data-Free Quantization of Vision Transformers

 💬LLMs  Content type: Academic
arxiv.org·

Do Transformers Need Three Projections? Systematic Study of QKV Variants

 📱Edge AI  Content type: Academic
arxiv.org··Hacker News

Dense Contexts Are Hard Contexts: Lexical Density Limits Effective Context in LLMs

 🤖LLM  Content type: Academic
arxiv.org··Hacker News

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help