Review gate

Final scoring is withheld because this record is currently Needs verification. Official links, pricing, license, and publication summary need manual review before indexing.

Tool fit

CategoryAI Observability Evaluation
Pricing modelFreemium
Open sourceYes
Self-hostableYes

Primary repository

RepoArize-ai/phoenix
Stars10,063
LanguagePython
Last pushedJun 9, 2026

Mapped use cases

AI Observability And Evaluation

AI Observability Evaluation

Evaluation, tracing, prompt testing, and monitoring tools for LLM applications.

Use caseNeeds verification

Related repositories

Arize-ai/phoenix

Python

AI Observability & Evaluation

RepositoryAuto enrichedPythonActive signal

Related skills

Data Ingestion

Data

Loading, cleaning, chunking, and normalizing documents or structured data.

Role: Recommended

SkillNeeds verificationDataRequired

Debugging

Quality

Finding root causes through logs, reproduction, isolation, and targeted tests.

Role: Recommended

SkillNeeds verificationQualityRequired

Evaluation

Quality

Testing LLM quality, retrieval quality, task completion, and regression behavior.

Role: Recommended

SkillNeeds verificationQualityRecommended