https://artificialanalysis.ai/evaluations/critpt
CritPt Benchmark Leaderboard | Artificial Analysis
Compare AI model performance on CritPt Benchmark Leaderboard. A benchmark designed to test LLMs on research-level physics reasoning tasks, featuring 71...
critptbenchmarkleaderboardartificialanalysis