https://allenai.org/evaluation-frameworks
Evaluation frameworks | Ai2
A collection of Ai2’s evaluation frameworks and benchmarks, open and accessible to compare like-for-like outcomes.
evaluation frameworksai2
Sponsored https://www.kupid.ai/
Experience the Future of AI Chat with KupidAI
https://boston.qcon.ai/presentation/boston2026/building-reusable-evaluation-frameworks-agentic-ai-products
QCon AI Boston 2026 | Building Reusable Evaluation Frameworks for Agentic AI Products
This talk covers methods of evaluating AI Agents, with an example of how we built evaluation frameworks for a user-facing AI Agent system that has been in...
qcon ai bostonevaluation frameworksbuildingreusableagentic