Robuta

https://artificialanalysis.ai/evaluations/critpt CritPt Benchmark Leaderboard | Artificial Analysis Compare AI model performance on CritPt Benchmark Leaderboard. A benchmark designed to test LLMs on research-level physics reasoning tasks, featuring 71... critptbenchmarkleaderboardartificialanalysis