Concept Entry

WisdomBench

WisdomBench is a public benchmark line for evaluating longitudinal failure learning, recovery, grader drift, and wisdom-oriented behavior under bounded evidence.

P02 WisdomBench wisdom benchmark failure-learning benchmark

Papers Evidence Counterexamples

One-Sentence Definition

Definition.

WisdomBench is a public benchmark line for evaluating longitudinal failure learning, recovery, grader drift, and wisdom-oriented behavior under bounded evidence.

Evidence Scope

What evidence this term covers.

Public evidence covers the P02 DOI, public dataset route, repository metadata, benchmark boundaries, and challenge paths for reproducibility or baseline issues.

DOI / GitHub / HF Links

Stable public anchors.

Each concept entry has a DOI or Zenodo anchor, a GitHub route, a Hugging Face route, a main-site route, and a counterexample route.

KindAnchorURL

DOI / ZenodoP02 Zenodo recordhttps://zenodo.org/records/19793098

GitHubWisdomBench GitHubhttps://github.com/mmjbds/wisdombench

Hugging FaceTechnical mirror or datasethttps://huggingface.co/datasets/MMJBDS/wisdombench

Main siteCanonical public concept pagehttps://mianzhang.org/concepts/wisdombench.html

Counterexample routePublic challenge entryhttps://mianzhang.org/counterexamples/index.html

Boundary Statement

What this entry does not prove.

This entry does not claim general intelligence, solved alignment, universal evaluation coverage, or a single sufficient wisdom score.