Robuta

https://docs.cotool.ai/api-reference/agents/get-llm-evaluation-metrics Get LLM evaluation metrics - Cotool Documentation Retrieve system LLM evaluation metrics for an agent (default evaluator: llm-judge). llm evaluation metricsgetdocumentation https://www.sarvam.ai/blogs/evaluating-indian-language-asr Indic ASR evaluation: beyond WER to LLM & semantic metrics | Sarvam AI Why WER/CER misjudge Indian-language ASR when scripts mix and spellings vary. Covers LLM-WER, LLM-CER, Intent, Entity, and COMET-plus open evaluation tooling. indicasrevaluationbeyondwer https://deepeval.com/guides/guides-ai-agent-evaluation-metrics AI Agent Evaluation Metrics | DeepEval by Confident AI - The LLM Evaluation Framework AI agent evaluation metrics are purpose-built measurements that assess how well autonomous LLM systems reason, plan, execute tools, and complete tasks. Unlike… ai agent evaluation metricsdeepevalconfidentllmframework