https://www.comet.com/site/products/opik/
Open-Source LLM Evaluation Platform | Opik by Comet
Nov 4, 2025 - Opik is an end-to-end LLM evaluation platform designed to help AI developers test, ship, and continuously improve LLM-powered applications.
open source llmevaluationopik
https://arize.com/
LLM Observability & Evaluation Platform
Unified LLM Observability and Agent Evaluation Platform for AI Applications—from development to production.
llm observability evaluation
https://openmark.ai/why
Why Benchmark AI Models? | OpenMark - LLM Evaluation Platform
Mar 9, 2026 - Compare AI model pricing and performance. Benchmark LLMs on YOUR task — not generic tests. Find the best AI model with real API costs, deterministic scoring,...
llm evaluation platformai
https://langwatch.ai/
LangWatch: AI Agent Testing and LLM Evaluation Platform
LangWatch is an AI agent testing, LLM evaluation, and LLM observability platform. Test agents with simulated users, prevent regressions, and debug issues.
llm evaluation platformai