Gentrace’s agent evaluation operates in three steps:
  1. Create a dataset with test cases for your AI pipeline
  2. Run an experiment using unit tests and/or dataset tests
  3. Analyze results with Gentrace Chat and derivations to extract insights and monitor performance

Next steps