Transformers

Feeds to Scour
SubscribedAll
Scoured 205 posts in 6.3 ms

harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.

 🤖AI  Content type: Code
github.com··Hacker News

Audio-Visual Exchange-Aware Token Pruning for Efficient Audio-Visual Captioning

 👁️Computer Vision  Content type: Academic
arxiv.org·

Contribution Weights: A Geometrical Analysis of Self-Attention Transformers

 💬LLMs  Content type: Academic
arxiv.org·

Operator Fusion for LLM Inference on the Tensix Architecture

 🤖Machine Learning  Content type: Academic
arxiv.org·

Post-training is (Massive) Supervised Learning

 🎛️Fine-tuning  Content type: Academic
arxiv.org·

mingusb/transformer-golf: The Fully Unrolled Transformer: An experimental repository for architecture simplification and compilation. [2026]

 🧠Neural Networks  Content type: Code
github.com··Hacker News

DUET -- Dual User Embedding Transformers for Offsite Conversion Prediction

 🤖Machine Learning  Content type: Academic
arxiv.org·

Early Comparative Evaluation of Transformer Models for Multilingual Software Vulnerability Detection

 💬LLMs  Content type: Academic
arxiv.org·

Uncertainty-Aware LLM-Guided Policy Shaping for Sparse-Reward Reinforcement Learning

 🎮Reinforcement Learning  Content type: Academic
arxiv.org·

mikinko/HuggingFace_WFX: Total Commander WFX plugin for HuggingFace repos

 🤖AI  Content type: Code

Beyond Patches: Superpixel Token-based Transformers for Attribute-Specific Fashion Retrieval

 🤖AI  Content type: Academic
arxiv.org·

Chiaroscuro Attention: Spending Compute in the Dark

 Flash Attention  Content type: Academic
arxiv.org·

Transformer Based Model for Spatiotemporal Feature Learning in EEG Emotion Recognition

 🧮Complexity Theory  Content type: Academic
arxiv.org·

tenurehq/precisionMemBench: Precision-aware retrieval benchmark for LLM memory systems.

 🤖AI  Content type: Code
github.com··Hacker News

Look Less, Reason More: Block-wise Attention Skipping for Efficient Multimodal LLMs

 👁️Computer Vision  Content type: Academic
arxiv.org·

InA-Probe: Instruction-Aware Active Probing for Time Series Forecasting with LLMs

 📈Time Series Analysis  Content type: Academic
arxiv.org·

FuseFSS: Efficient Secure LLM Inference with Function Secret Sharing

 💬LLMs  Content type: Academic
arxiv.org·

When Vision Misleads, Let Location Speak: A Worldwide Image Geo-Localization Method via Location Attention Mechanism and Large Multimodal Models

 🤖AI  Content type: Academic
arxiv.org·

Inside the LLM Word Factory

 💬Natural Language Processing  Content type: Academic
arxiv.org·

TextEconomizer: Enhancing Lossy Text Compression with Denoising Transformers and Entropy Coding

 🤖AI  Content type: Academic
arxiv.org·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help