Experiment 001

Judged-open pocket penalty.

A public-safe dossier for the first M3 decision-quality trial. The exact operational artifacts remain private or delayed; this page publishes the hypothesis, boundary, and evidence contract.

Status: draft public briefSource class: manual summaryPublication class: delayed public candidate

Hypothesis

A narrow judged-debate OPEN_LONG context appears weaker than surrounding decision pockets. Applying a small additive rerank penalty to marginal candidates in that context should demote weak opens into safer alternatives or improve target-pocket quality while preserving stronger judged-open behavior.

Target context

Allowed intervention

Not allowed

Decision gate

Promote or continue tuning only if the target pocket contracts or improves in quality, stronger judged-open buckets do not degrade, HOLD-rate movement is localized and explainable, and audit metadata is complete.

Kill or revert if judged-open quality worsens, stronger buckets degrade, action distribution shifts outside the target context, metadata is ambiguous, or the change behaves like a stealth global policy rewrite.