Concept Entry

WisdomBench

WisdomBench is a public benchmark line for evaluating longitudinal failure learning, recovery, grader drift, and wisdom-oriented behavior under bounded evidence.

P02 WisdomBench wisdom benchmark failure-learning benchmark

One-Sentence Definition

Definition.

WisdomBench is a public benchmark line for evaluating longitudinal failure learning, recovery, grader drift, and wisdom-oriented behavior under bounded evidence.

Evidence Scope

What evidence this term covers.

Public evidence covers the P02 DOI, public dataset route, repository metadata, benchmark boundaries, and challenge paths for reproducibility or baseline issues.

DOI / GitHub / HF Links

Stable public anchors.

Each concept entry has a DOI or Zenodo anchor, a GitHub route, a Hugging Face route, a main-site route, and a counterexample route.

KindAnchorURL
DOI / ZenodoP02 Zenodo recordhttps://zenodo.org/records/19793098
GitHubWisdomBench GitHubhttps://github.com/mmjbds/wisdombench
Hugging FaceTechnical mirror or datasethttps://huggingface.co/datasets/MMJBDS/wisdombench
Main siteCanonical public concept pagehttps://mianzhang.org/concepts/wisdombench.html
Counterexample routePublic challenge entryhttps://mianzhang.org/counterexamples/index.html

Boundary Statement

What this entry does not prove.

This entry does not claim general intelligence, solved alignment, universal evaluation coverage, or a single sufficient wisdom score.