Robuta

https://arxiv.org/abs/2311.12022?utm_campaign=The%20Batch&utm_medium=email&_hsenc=p2ANqtz-9A9LbM28bl3zZPLZlodjJ1cVW0bpWxauKjf8eysSahOf2LRUSH79fD10xcPEGB2LYIRASrjuCQQBBWGwCmwtjPyXoTde-EwqgpmnmEhSy_f3p_Cgs&_hsmi=353823758&utm_content=353823758&utm_source=hs_email
Abstract page for arXiv paper 2311.12022: GPQA: A Graduate-Level Google-Proof Q&A Benchmark
gpqagraduatelevelgoogleproof
https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct/discussions/332
This PR adds community-provided evaluation results for the following benchmarks:
meta llamaadd communityinstruct