DEV Community

The Man Who Summoned Ghosts | Chapter 2: The Training Stack Is Not a Secret (opens in new tab)

Discussed on DEV

From OpenAI to nanoGPT: why the training stack should feel legible, not magical. Originally published on Lei Hua's Substack. Anchors: 2023-05-23 · State of GPT @ Microsoft Build · 2023-11-23 · [1hr Talk] Intro to Large Language Models · Epigraph "99% of the compute is in pretraining. ... For applications, you want low-stakes things, with humans in the loop. Treat these models like cognitive interns." — Andrej Karpathy, State of GPT · 2023-05 The Return In the last two months of 2022, three th...

Read the original article
Sign in to keep reading the full article.

Keyboard Shortcuts

Navigation

Next / previous post
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Discover
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help