Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🤖 Transformers
Specific
Attention Mechanism, BERT, GPT, Language Models
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
7304
posts in
8.4
ms
Short Data, Long Context:
Distilling
Positional
Knowledge in Transformers
🗣️
Large Language Models
arxiv.org
·
2d
milanm/AutoGrad-Engine
: A complete GPT language model (training and inference) in ~600 lines of pure C#, zero dependencies
🗣️
Large Language Models
github.com
·
20h
·
Hacker News
The Hidden
Auditory
Knowledge
Inside Language Models
🗣️
Large Language Models
hackernoon.com
·
5d
Claude
Mythos
: The System Card
🗣️
Large Language Models
lesswrong.com
·
13h
Coding agents for old people
🗣️
Large Language Models
blog.tasuki.org
·
2d
·
Hacker News
LLM
inference
engine from
scratch
in C++
🗣️
Large Language Models
anirudhsathiya.com
·
4d
·
Hacker News
Show HN: We built the "LLM knowledge base"
Karpathy
described 9
yrs
ago
🗣️
Large Language Models
mythos.one
·
14h
·
Hacker News
SNN
brain-inspired gen-AI in C/C#, no external AI
libs
could be promising?
🗣️
Large Language Models
news.ycombinator.com
·
1d
·
Hacker News
DSPy
: Programming – Not
Prompting
🗣️
Large Language Models
akashtandon.in
·
1d
·
Hacker News
The
pinnacle
of
enshittification
, or Large Language Models
🗣️
Large Language Models
blogs.gentoo.org
·
4d
·
Lobsters
,
Hacker News
The Learning Firm Under
Poverty
of
Stimulus
🗣️
Large Language Models
jimiwen.substack.com
·
1d
·
Substack
Verbosity
decreases
accuracy in large language models
🗣️
Large Language Models
unite.ai
·
5d
·
Hacker News
How to
Generate
Text in One Step
∂
Automatic Differentiation
one-step-lm.github.io
·
2d
·
Hacker News
GLM-5.1
: Towards
Long-Horizon
Tasks
🗣️
Large Language Models
simonwillison.net
·
2d
·
Hacker News
Thoughts
on Large Language Models (2023)
🗣️
Large Language Models
nikola.plejic.com
·
4d
·
Hacker News
Loop, Think, &
Generalize
: Implicit Reasoning in
Recurrent-Depth
Transformers
🗣️
Large Language Models
arxiv.org
·
7h
Darwin
V6:
Diagnostic-Guided
Evolutionary Model Merging
🗣️
Large Language Models
huggingface.co
·
2d
·
Hacker News
DeepFocus-BP
: Error-Aware Adaptive Backpropagation via Dynamic Alpha-Beta Routing (Achieving 66% FLOPs Reduction with Improved Accuracy) - SOTA NLP Confirmed v3. (
Resnet
FAIL)
🧠
Neural Networks
zenodo.org
·
5d
·
Hacker News
Reading Note:
Mamba-3
and the State Space Model
Renaissance
∂
Automatic Differentiation
ngrislain.github.io
·
2d
·
Hacker News
ikermoel/VRS-Void-Rescue-System
: A new geometric rescue method for regression-based token prediction in transformers
🗣️
Large Language Models
github.com
·
14h
·
Hacker News
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help