Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
gilesthomas.com
8
posts in the last 30 days
Writing an LLM from scratch, part
32l
--
Interventions
: updated instruction fine-tuning results
gilesthomas.com
·
11h
·
Hacker News
How an LLM
becomes
more
coherent
as we train it
gilesthomas.com
·
3d
·
Hacker News
Writing an LLM from scratch, part
32k
-- Interventions: training a better model locally with gradient
accumulation
gilesthomas.com
·
5d
·
Hacker News
Writing an LLM from scratch, part
32j
--
Interventions
: trying to train a better model in the cloud
gilesthomas.com
·
1w
·
Hacker News
Writing an LLM from scratch, part
32i
--
Interventions
: what is in the noise?
gilesthomas.com
·
1w
·
Hacker News
Writing an LLM from scratch, part
32h
– Interventions: full fat
float32
gilesthomas.com
·
2w
·
Hacker News
,
Hacker News
Automating starting
Lambda
Labs
instances
gilesthomas.com
·
2w
·
Hacker News
Writing an LLM from scratch, part
32g
– Interventions: weight
tying
gilesthomas.com
·
3w
·
Hacker News
,
Hacker News
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help