Class ToolDescriptionReliabilityEvaluator
java.lang.Object
dev.dokimos.core.BaseEvaluator
dev.dokimos.core.evaluators.agents.ToolDescriptionReliabilityEvaluator
- All Implemented Interfaces:
Evaluator
Evaluates tool description quality using a mix of rule-based checks and optional LLM checks.
Performs 13 checks across two categories:
Rule-based (always run):
input_arguments_clarity: Each parameter has a "description" keyinput_arguments_types: Each parameter has a "type" keymax_num_input_arguments: Total params ≤ maxInputArgs (default 5)max_optional_input_arguments: Optional params ≤ maxOptionalArgs (default 3)
LLM-based (require judge):
general_structure: Description includes purpose, inputs, and outputhas_examples: Description includes usage exampleshas_usage_notes: Description includes notes/limitations/caveatsintent_over_implementation: Communicates what, not howclarity: Avoids obscure/ambiguous termsredundancy: Avoids redundant informationinput_arguments_enum: Applicable args include enumeration valuesinput_arguments_format: Applicable args include format specsreturn_statement_quality: Output information is clearly described
Without a judge LLM, only the 4 rule-based checks run. Score is based on checks that actually ran.
-
Nested Class Summary
Nested ClassesModifier and TypeClassDescriptionstatic classBuilder for constructing the evaluator. -
Method Summary
Modifier and TypeMethodDescriptionbuilder()Creates a new builder for constructing the evaluator.Methods inherited from class dev.dokimos.core.BaseEvaluator
evaluate, evaluateAsync, evaluateAsync, name, threshold
-
Method Details
-
builder
Creates a new builder for constructing the evaluator.- Returns:
- a new builder
-