Training Compute-Optimal Large Language Models
paperium.net·5d·
Discuss: DEV
📊Columnar Engines
Preview
Report Post

Artificial Intelligence

arXiv

Paperium

Jordan Hoffmann, Sebastian Borgeaud, Arthur Mensch, Elena Buchatskaya, Trevor Cai, Eliza Rutherford, Diego de Las Casas, Lisa Anne Hendricks, Johannes Welbl, Aidan Clark, Tom Hennigan, Eric Noland, Katie Millican, George van den Driessche, Bogdan Damoc, Aurelia Guy, Simon Osindero, Karen Simonyan, Erich Elsen, Jack W. Rae, Oriol Vinyals, Laurent Sifre

29 Mar 2022 • 3 min read

Training Compute-Optimal Large Language Models

AI-generated image, based on the article abstract

Quick Insight

Better AI Comes From More Data, Not Just Bigger Models

Many big AI systems got bigger while using abou…

Similar Posts

Loading similar posts...