evals, benchmarking, MMLU, model performance, evaluation metrics
No high-quality results found.
No more posts from gruggiero's subscribed feeds.
Press ? anytime to show this help