Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
🤖 Transformers
Specific
transformer model, attention mechanism, BERT, GPT architecture
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
65
posts in
9.7
ms
MiniGPT: Rebuilding
GPT
from First Principles
🏗️
System Design
arxiv.org
·
2d
A primer on how large language
model
works
🧠
LLM Training
mayijie.substack.com
·
4d
·
Substack
KV Cache and Flash
Attention
with interactive diagrams
🏗️
System Design
kvcache.cobanov.dev
·
9h
·
Hacker News
How LLM Inference Works
🧠
LLM Training
arpitbhayani.me
·
6d
·
Hacker News
LLM Wiki app Chunker –
transform
documents into navigable knowledge trees
🧠
LLM Training
github.com
·
1d
·
Hacker News
Digital twin-driven fault diagnosis of power substations by
multi-modal
fusion learning
🖧
Distributed Systems
nature.com
·
1d
Vision-Language-Action
Models
Arrive
🕵️
AI Agents
semiengineering.com
·
6d
HSTU: How Meta Built a Trillion-Parameter Recommender That Actually
Scales
🔍
RAG
mlwhiz.com
·
2d
Benchmarking Subquadratic's latest
model
and SSA Kernel
🏗️
System Design
appen.com
·
6d
·
Hacker News
Understanding KV Cache: The Hidden Memory Cost of Serving LLMs
🏗️
System Design
melchi.me
·
1d
·
Hacker News
Show HN: The Name in the Bracket (a free book on naming tensor dimensions)
💻
Bioinformatics
einlang.github.io
·
2d
·
Hacker News
ViT Overfits Small Datasets: When CNNs Win by 18% mAP
🔍
RAG
tildalice.io
·
5d
Running PyTorch
Models
on Apple Silicon GPUs with the ExecuTorch MLX Delegate
🧠
LLM Training
pytorch.org
·
2d
·
Hacker News
Multiplex
networks-based
directed graph neural network for cancer driver gene identification
🧬
Genomics
journals.plos.org
·
6d
Towards local plug-and-play AI
🧠
LLM Training
adlrocha.substack.com
·
3d
·
Substack
From Sparsity to Simplicity: Enabling Simpler Sequential Replacements via Sparse
Attention
Distillation
🔍
RAG
arxiv.org
·
1d
Aether Mind – on-chain neural cognitive engine on a quantum-VQE L1
⚛️
Quantum Computing
huggingface.co
·
5d
·
Hacker News
Spin Lattices and Proteins – How state-based discretisations have enabled
modern
protein
modelling
💻
Bioinformatics
blopig.com
·
6d
sapientinc/HRM-Text: HRM-Text is a 1B text generation
model
based on the HRM
architecture
, strengthened by task completion and latent space reasoning.
🧠
LLM Training
github.com
·
1d
·
r/singularity
Less-relevant results
Market Trend Insights: The Impact of Recent Innovations on the Machine Translation Market
🧠
LLM Training
openpr.com
·
5d
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help