Verification and Stopping

Reliable loops need two separate decisions: whether the latest work is acceptable, and whether the loop should continue at all. `adk-loop-lab` keeps both decisions explicit and deterministic by default.

Deterministic evaluators

The reusable deterministic evaluators cover the kinds of checks that should never depend on model confidence:

schema validation
exact matches
numeric ranges such as word count
required items
forbidden patterns

Example 1 uses this directly for word count, concrete-example detection, generation-versus-verification checks, and unsupported citation phrasing.

Model-based evaluators

Not every quality dimension is reducible to a rule. The project still supports model judges for clarity, coherence, and qualitative feedback, and they participate through the same EvaluationResult contract. The reusable composite evaluator supports deterministic veto behavior, while the generic controller currently applies a simpler all-pass rule directly.

Composite policies

The evaluation layer supports several policies:

DETERMINISTIC_VETO
ALL_REQUIRED
WEIGHTED_SCORE
QUORUM

The default helper policy matters most: deterministic veto means a failing hard check blocks success even if the model judge likes the result. In the current generic controller, richer composite policies exist but are not yet the default execution path.

Stopping conditions

The controller can emit decision outcomes such as:

SUCCESS
FAILED
BLOCKED
BUDGET_EXHAUSTED
STAGNATED
CONTINUE

The stopping-policy decision order stays straightforward:

stop on fatal error
succeed if all criteria are met
stop on budget exhaustion
stop on stagnation
stop on duration exhaustion
otherwise continue

How verification feeds stopping

Verification updates progress score, evaluation history, and stagnation tracking. Stopping then reads those deterministic signals plus the current budget state. That split is important: evaluators judge the last action; the stopping policy judges the run as a whole.

Next: Failure Modes.