Methodology

How The Governance Compass works

Three ways of asking

The Governance Compass uses three distinct question formats, each designed to capture something the others can't.

Forced-choice tradeoffs form the backbone of the assessment. Each of the 36 forced-choice items presents two statements representing opposite poles of a single axis. There is no neutral option, no “both,” and no scale. The respondent chooses which statement comes closer to their view.

Nuanced scales complement the forced-choice items by capturing intensity. The 24 scaled items each pose a question with five response options specific to that question — not generic “agree/disagree” anchors. The response options themselves communicate the spectrum, making the gradient visible at a glance.

The Chancellor's Budget is a single interactive exercise that captures something no questionnaire can: revealed preferences under scarcity. The respondent allocates 50 points across 7 government ministries — and there isn't enough to fund everything well. The resulting allocation often diverges from questionnaire responses, and that divergence is captured in scoring.

How scoring works

Each of the 12 axes produces a final score from -1.0 to +1.0, with 0.0 representing genuine ambivalence or a balanced position. The score is a weighted composite of the three formats: forced-choice responses receive the highest weight, scaled responses capture intensity and nuance, and the budget allocation reveals operational priorities. The exact weights vary by axis depending on how directly the budget maps to each dimension.

Some axes receive strong budget signal from two opposing ministries. Others receive partial signal from a single ministry. And some axes have no direct budget mapping at all — those are scored entirely from the questionnaire formats.

One of the instrument's most distinctive features is its contradiction detection system. After scoring, the algorithm compares each axis's stated preference (from forced-choice and scaled items) against its revealed preference (from budget allocation). When these diverge significantly, the system flags the discrepancy as a “tension.” Tensions are reported as informative features, not errors. The tension between what you believe in principle and what you prioritize in practice is often where the most interesting self-knowledge lives.

Finally, the 12-axis profile is matched against a set of governance archetype prototypes — idealized profiles representing coherent governance philosophies. The respondent is assigned to the closest archetype and shown their degree of match, their second-closest archetype, and a description of each archetype's internal logic and characteristic tensions.

Archetypes in the Governance Compass fall into three categories, indicated by a small tag on each archetype: theoretically derived (hand-crafted from comparative political philosophy), refined (hand-crafted, then adjusted using empirical data), and empirical(identified through a synthetic population study and named after the fact). The April 2026 synthetic population study — in which roughly 1,000 biographical personas, generated by a separate language model, took the full Governance Compass instrument — surfaced one archetype that was not in the original hand-crafted set and informed refinements to five others. The synthetic study is informative but is not a substitute for empirical validation with real respondents; LLM-generated personas may also under-represent ideologically sharp profiles in favor of narratively conflicted ones.

Each archetype's entry on the archetypes page includes a short section describing governance traditions and movements that express the orientation, with links to further reading. These are illustrations, not defining examples — individual movements rarely align perfectly with any single archetype, and many traditions sit across two or more.

Designing for cross-cultural portability

No question in the instrument references a specific country, political leader, party, policy by name, constitutional provision, or institutional structure. Concepts that carry specific cultural freight — “liberal,” “conservative,” “left,” “right” — never appear in question text. Each is decomposed into its underlying governance tension.

For culturally sensitive items, the instrument uses the portrait method, adapted from Schwartz's Portrait Values Questionnaire. Rather than asking “Do you believe X?” the question describes a hypothetical person, giving respondents permission to identify with positions that might feel uncomfortable to endorse directly.

Most critically, the instrument includes an axis on legitimacy basis. Including this axis allows the instrument to treat non-democratic governance as a coherent position rather than a failure state.

Limitations and what comes next

The Governance Compass is a v1 instrument. Its dimensional structure is grounded in political science theory and comparative governance research, but it has not undergone formal psychometric validation. The 12 axes have not been confirmed through exploratory factor analysis on response data. Test-retest reliability has not been measured. Cross-cultural measurement invariance has not been tested.

This matters. It means the instrument should be used as a tool for structured self-reflection — a way to think about your governance philosophy across more dimensions than you've probably considered — not as a scientific diagnostic or a validated psychometric measure.

The 12 governance archetypes are interpretive anchors, not statistical categories. Most are theoretically derived from comparative political philosophy; a subset have been refined toward — or in one case identified directly from — clusters surfaced in a synthetic population study. As response data from real respondents accumulates, empirically grounded clustering may further refine or replace them.

The roadmap toward greater rigor is concrete: internal consistency testing, exploratory factor analysis, test-retest reliability assessment, and ultimately multi-group measurement invariance testing with respondents from diverse cultural contexts.

A note on how this was built: the research synthesis, question drafting, scoring design, and methodology content on this site were developed collaboratively with AI tools, with human editorial direction and review throughout. The empirical claims and citations in this document have been editorially reviewed but have not been independently verified by a credentialed psychometrician. This transparency is consistent with the project's broader commitment to showing its work.

The April 2026 synthetic population study had roughly 1,000 biographical personas (generated by Gemini) take the full instrument, with responses collected from Claude and Gemini administering it independently. The resulting clusters informed refinements to five hand-crafted archetypes, surfaced one the original set had missed — the Popular Egalitarian — and indicated that two earlier archetypes were better captured as a single type. What this does not do is validate the instrument against actual human responses. Language-model personas reflect their generating model's priors and tend to under-produce ideologically sharp profiles, which is part of why several archetypes (Free Marketeer, Libertarian Individualist, Authoritarian Traditionalist) did not surface empirically despite being well-attested in real political life. The study is useful for checking internal coherence — whether the archetype set spans what the instrument measures and whether archetypes overlap in practice — but does not confirm that any particular archetype is common or correctly drawn in the real world.