WisdomBench is a public benchmark line for evaluating longitudinal failure learning, recovery, grader drift, and wisdom-oriented behavior under bounded evidence.
Concept Entry
WisdomBench
WisdomBench is a public benchmark line for evaluating longitudinal failure learning, recovery, grader drift, and wisdom-oriented behavior under bounded evidence.
P02 WisdomBench
wisdom benchmark
failure-learning benchmark
One-Sentence Definition
Definition.
Evidence Scope
What evidence this term covers.
Public evidence covers the P02 DOI, public dataset route, repository metadata, benchmark boundaries, and challenge paths for reproducibility or baseline issues.
DOI / GitHub / HF Links
Stable public anchors.
Each concept entry has a DOI or Zenodo anchor, a GitHub route, a Hugging Face route, a main-site route, and a counterexample route.
KindAnchorURL
Boundary Statement
What this entry does not prove.
This entry does not claim general intelligence, solved alignment, universal evaluation coverage, or a single sufficient wisdom score.