evals, benchmarking, MMLU, model performance, evaluation metrics
No more posts from gruggiero's subscribed feeds.
Press ? anytime to show this help