Abstract page for arXiv paper 2203.15556: Training Compute-Optimal Large Language Models
Press ? anytime to show this help