Andrej Karpathy (@karpathy) 4K likes

x.com·4w·

Discuss: X

Preview

TLDR: we can train compute optimal miniseries and relate them to GPT-2/3 via objective CORE scores, but further improvements are desirable and needed. E.g., matching GPT-2 currently needs ~$500, but imo should be possible to do <$100 with more work.

Similar Posts