Class TaskCompletionEvaluator

java.lang.Object
dev.dokimos.core.BaseEvaluator
dev.dokimos.core.evaluators.agents.TaskCompletionEvaluator
All Implemented Interfaces:
Evaluator

public class TaskCompletionEvaluator extends BaseEvaluator
Evaluates whether an AI agent completed the user's requested tasks.

This is a black-box evaluator that uses a judge LLM to analyze the dialog between user and agent, comparing against a predefined task list. The score is the fraction of completed tasks (0.0–1.0).

  • Method Details