scaling laws, compute optimal, chinchilla, model size
No more posts from Bingran's subscribed feeds.
Press ? anytime to show this help