Arize AI Azure+Arize Observability Evaluations

Azure+Arize Observability Evaluations

agentsarize-tutorialsLLMPython

Export

Run Notebooks

Contents

No cells yet

Add cells to see them here

Azure AI Foundry and Arize for Agent Observability and Evaluation

Reference: Azure AI Foundry - LangChain Integration

This notebook demonstrates how to:

Build a LangChain multi-chain agent on Azure AI Foundry while tracing all operations to Arize for observability
Leverage Azure AI Evaluators to evaluate LLM behavior
Log evaluation results to Arize for visibility

Prerequisites:

[ ]

Set up OpenTelemetry instrumentation to send traces to Arize for observability.

[ ]

A multi-chain agent: producer (generates content) and a verifier (validates content).

[ ]

Traces will be generated and sent to Arize on each agent run

[ ]

Optional: Test evaluator call

[ ]

Export traces from Arize and run the hate and unfairness evaluator on all rows

[ ]

Traces will have evaluation label, score and explanation attached

[ ]

Some things to do next:

Curate datasets to drive prompt optimization or fine tuning jobs
Send regressions to labeling queues for human annotators to curate golden datasets
Create custom metrics, monitors from evaluation labels