I made a general AI benchmark (opens in new tab)

Discussed on r/LLM

A structured benchmark for evaluating AI models on arithmetic, logic, geography, science, history, and world knowledge. Open-source questions, leaderboard included.

Read the original article