Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Transformers
⚡ Transformers
Specific
Attention Mechanism, BERT, GPT, Sequence Models
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
107
posts in
5.2
ms
harshuljain13/llm-inference-at-scale
: A Practitioner handbook for production
llm
serving.
🤖
AI
Content type:
Code
github.com
·
4d
4 days ago
·
Hacker News
Actions for harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.
know the mother tongue of your LLMs
💬
LLMs
mothertoken.inigoimaz.com
·
1d
1 day ago
·
Hacker News
Actions for know the mother tongue of your LLMs
Meta-Attention
: Teaching
Models
When Not to Answer
🤖
AI
hackernoon.com
·
16h
16 hours ago
Actions for Meta-Attention: Teaching Models When Not to Answer
Causal Semantic Alignment for
LLM-based
Time Series Forecasting
🤖
AI
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Causal Semantic Alignment for LLM-based Time Series Forecasting
Measuring Embedding Drift: Why Hybrid Search Saves Stale
Models
.
🎯
Fine-Tuning
pub.towardsai.net
·
17h
17 hours ago
Actions for Measuring Embedding Drift: Why Hybrid Search Saves Stale Models.
The Edge
LLM
Offload Story
🤖
AI
semiengineering.com
·
6d
6 days ago
Actions for The Edge LLM Offload Story
Less-relevant results
What Does Abliteration Actually Cost?
🤖
AI
lesswrong.com
·
5d
5 days ago
Actions for What Does Abliteration Actually Cost?
Show HN:
LLM
memory without context bleed; 100% precision vs. <10% vector search
🤖
AI
tenureai.dev
·
5d
5 days ago
·
Hacker News
,
Hacker News
Actions for Show HN: LLM memory without context bleed; 100% precision vs. <10% vector search
SafeRun: Enabling Determinism in
LLM
Planning for Running
🤖
AI
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for SafeRun: Enabling Determinism in LLM Planning for Running
nex-agi/Nex-N2-mini •
Huggingface
🤖
AI
huggingface.co
·
6d
6 days ago
·
r/LocalLLaMA
Actions for nex-agi/Nex-N2-mini • Huggingface
LangChain Series #2:
Models
Explained — LLMs, Chat
Models
, and Embeddings with Practical…
🤖
AI
pub.towardsai.net
·
1d
1 day ago
Actions for LangChain Series #2: Models Explained — LLMs, Chat Models, and Embeddings with Practical…
defai-digital/ax-engine: Apple Silicon
LLM
runtime supporting Gemma 4 and Qwen 3.6 MTP
modes
🤖
AI
Content type:
Code
github.com
·
21h
21 hours ago
·
Hacker News
Actions for defai-digital/ax-engine: Apple Silicon LLM runtime supporting Gemma 4 and Qwen 3.6 MTP modes
Beyond Patches: Superpixel Token-based
Transformers
for Attribute-Specific Fashion Retrieval
🤖
AI
Content type:
Academic
arxiv.org
·
18h
18 hours ago
Actions for Beyond Patches: Superpixel Token-based Transformers for Attribute-Specific Fashion Retrieval
Reachability and asymptotics of Gaussian
Transformer
dynamics
🤖
Machine Learning
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Reachability and asymptotics of Gaussian Transformer dynamics
Transformer
Based
Model
for Spatiotemporal Feature Learning in EEG Emotion Recognition
🧮
Complexity Theory
Content type:
Academic
arxiv.org
·
18h
18 hours ago
Actions for Transformer Based Model for Spatiotemporal Feature Learning in EEG Emotion Recognition
Contribution Weights: A Geometrical Analysis of
Self-Attention
Transformers
💬
LLMs
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Contribution Weights: A Geometrical Analysis of Self-Attention Transformers
A Mean-Field Analysis of Multi-Head
Self-Attention
under Cross-Entropy Training
📈
Optimization
Content type:
Academic
arxiv.org
·
18h
18 hours ago
Actions for A Mean-Field Analysis of Multi-Head Self-Attention under Cross-Entropy Training
Post-training
is (Massive) Supervised Learning
🎛️
Fine-tuning
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Post-training is (Massive) Supervised Learning
Parallel Causal Associative Fields: Gated Sparse Memory for Long-Context
Language
Modeling
⚡
Hardware Acceleration
Content type:
Academic
arxiv.org
·
18h
18 hours ago
Actions for Parallel Causal Associative Fields: Gated Sparse Memory for Long-Context Language Modeling
Uncertainty-Aware
LLM-Guided
Policy Shaping for Sparse-Reward Reinforcement Learning
🎮
Reinforcement Learning
Content type:
Academic
arxiv.org
·
2d
2 days ago
Actions for Uncertainty-Aware LLM-Guided Policy Shaping for Sparse-Reward Reinforcement Learning
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help