Sponsor of the Day:
Jerkmate
https://huggingface.co/datasets/CohereLabs/Global-MMLU-Lite
CohereLabs/Global-MMLU-Lite · Datasets at Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
hugging facecoherelabsglobalmmlulite
https://huggingface.co/datasets/TIGER-Lab/MMLU-Pro
TIGER-Lab/MMLU-Pro · Datasets at Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
hugging facetigerlabmmlupro
https://huggingface.co/deepseek-ai/DeepSeek-V4-Pro/discussions/110
deepseek-ai/DeepSeek-V4-Pro · Add community evaluation results for GPQA, GSM8K, HLE, MMLU-PRO,...
This PR adds community-provided evaluation results for the following benchmarks:
deepseek ai v4evaluation resultsproaddcommunity