Transformers

Feeds to Scour
SubscribedAll
Scoured 77 posts in 13.1 ms

Transformer-Enhanced Reinforcement Learning: Fundamentals and Applications in Communication Networks

馃幃Reinforcement LearningContent type: Academic
arxiv.org

SpikeDecoder: Realizing the GPT Architecture with Spiking Neural Networks

馃AI ResearchContent type: Academic
arxiv.org

Chiaroscuro Attention: Spending Compute in the Dark

馃搲Deep LearningContent type: Academic
arxiv.org

PT-WNO: Point Transformer with Wavelet Neural Operator for 3D Point Cloud Semantic Segmentation

馃搻Scaling LawsContent type: Academic
arxiv.org

Look Less, Reason More: Block-wise Attention Skipping for Efficient Multimodal LLMs

馃挰LLMsContent type: Academic
arxiv.org

Towards Tight Bounds for Streaming Attention

馃AI ResearchContent type: Academic
arxiv.org

UR-BERT: Scaling Text Encoders for Massively Multilingual TTS Through Universal Romanization and Speech Token Prediction

馃挰LLMsContent type: Academic
arxiv.org

LiteVSR: Lightweight Adaptation of Frozen Diffusion Transformers for Video Super-Resolution

鈿欙笍Model TrainingContent type: Academic
arxiv.org

Kuramoto Attention: Synchronizing Self-Attention on the Torus

馃搻Scaling LawsContent type: Academic
arxiv.org

ATT-CR: Adaptive Triangular Transformer for Cloud Removal

馃搻Scaling LawsContent type: Academic
arxiv.org

TextEconomizer: Enhancing Lossy Text Compression with Denoising Transformers and Entropy Coding

馃挰LLMsContent type: Academic
arxiv.org

SPADE: Split-and-Delay Embeddings for Autoregressive High-Granularity Calorimeter Simulation

馃AI ResearchContent type: Academic
arxiv.org

From Architecture to Output: Structural Origins of Hallucination in Large Language Models and the Amplifying Role of Data

馃挰LLMsContent type: Academic
arxiv.org

LifeSentence: Language models can encode human life course trajectories from longitudinal panel data

馃AI ResearchContent type: Academic
arxiv.org

Gated Bidirectional Linear Attention for Generative Retrieval

馃挰LLMsContent type: Academic
arxiv.org

RePAIR: Predictive Self-Supervised Representation Learning in Chess

馃幃Reinforcement LearningContent type: Academic
arxiv.org

End-to-End Context Compression at Scale

鈿欙笍Model TrainingContent type: Academic
arxiv.org

Architecture-Aware Reinforcement Learning Makes Sliding-Window Attention Competitive in Math Reasoning

鈿欙笍Model TrainingContent type: Academic
arxiv.org

GRAMformer: Any-Order Modality Interactions via Volumetric Multimodal Cross-Attention

馃AI ResearchContent type: Academic
arxiv.org

NGram-MoSE: Efficient Remote Sensing Super-Resolution via N-Gram Context and Mixture-of-Experts

馃AI ResearchContent type: Academic
arxiv.org

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help