Class TaskCompletionEvaluator
java.lang.Object
dev.dokimos.core.BaseEvaluator
dev.dokimos.core.evaluators.agents.TaskCompletionEvaluator
- All Implemented Interfaces:
Evaluator
Evaluates whether an AI agent completed the user's requested tasks.
This is a black-box evaluator that uses a judge LLM to analyze the dialog between user and agent, comparing against a predefined task list. The score is the fraction of completed tasks (0.0–1.0).
-
Nested Class Summary
Nested ClassesModifier and TypeClassDescriptionstatic classBuilder for constructing the evaluator. -
Method Summary
Modifier and TypeMethodDescriptionbuilder()Creates a new builder for constructing the evaluator.Methods inherited from class dev.dokimos.core.BaseEvaluator
evaluate, evaluateAsync, evaluateAsync, name, threshold
-
Method Details
-
builder
Creates a new builder for constructing the evaluator.- Returns:
- a new builder
-