https://huggingface.co/Qwen/Qwen3.6-35B-A3B/discussions/3
Qwen/Qwen3.6-35B-A3B · Add community evaluation results for AIME_2026, GPQA, HLE, HMMT_FEB_2026,...
This PR adds community-provided evaluation results for the following benchmarks:
qwenaddcommunityevaluationresults
https://huggingface.co/moonshotai/Kimi-K2.6/blob/main/.eval_results/gpqa_diamond.yaml
.eval_results/gpqa_diamond.yaml · moonshotai/Kimi-K2.6 at main
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
eval resultskimi k2gpqadiamondyaml
https://huggingface.co/deepseek-ai/DeepSeek-V4-Pro/discussions/110
deepseek-ai/DeepSeek-V4-Pro · Add community evaluation results for GPQA, GSM8K, HLE, MMLU-PRO,...
This PR adds community-provided evaluation results for the following benchmarks:
deepseek aiv4proaddcommunity
https://huggingface.co/Qwen/Qwen3.6-27B/discussions/2
Qwen/Qwen3.6-27B · Add community evaluation results for AIME_2026, GPQA, HLE, HMMT_FEB_2026,...
This PR adds community-provided evaluation results for the following benchmarks:
qwenaddcommunityevaluationresults
https://huggingface.co/datasets/Idavidrein/gpqa
Idavidrein/gpqa · Datasets at Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
hugging facegpqadatasets