Gentrace Documentation

Gentrace’s agent evaluation operates in three steps:

Create a dataset with test cases for your AI pipeline
Run an experiment using unit tests and/or dataset tests
Analyze results with Gentrace Chat and derivations to extract insights and monitor performance

Next steps

Set up experiments to run systematic evaluations
Create datasets to organize your test cases
Use derivations to analyze your results

ExperimentsCreate and submit experiments to Gentrace with `experiment()`

Next steps