Historical Analogy Method

When the First Sentence of a Strategy Memo Decides the Rest

An experienced policy reader develops a specific alarm. It triggers when a strategy memo’s opening sentence invokes a historical parallel — Munich, the Cuban Missile Crisis, the Cold War, the nineteenth-century scramble for colonies. The reader knows what comes next. The memo will treat the analogy as established, derive prescriptions from it as if they were theorems, and defend them by waving back at the historical case whenever an objection is raised. The analogy, which should have been a tool for disciplined thinking, has become the conclusion dressed as a premise. Somewhere inside the reasoning, a structural difference between the historical case and the current one is doing enormous work that no one has acknowledged.

This is not a failure of history. It is a failure of analogical reasoning. Historical analogy is indispensable in strategy — much of what analysts know about how great-power competition, commons governance, and arms control behave under pressure comes from specific historical cases, not from general theory. The question is whether the analogy is being used or being waved. The Historical Analogy Method is the discipline for using them properly. Its premise is that structured analogical reasoning — explicit case selection, symmetric similarity-and-difference mapping, conditional lesson extraction, flagged failure points — is fundamentally different from rhetorical invocation. What follows is an account of where the method comes from, what distinguishes it, and where its outputs should be trusted.

Neustadt, May, Khong, Jervis — The Literature That Made It a Method

The intellectual lineage runs through a small number of sharply focused works. Richard Neustadt and Ernest May’s Thinking in Time (1986) was the field-defining text. Neustadt, a political scientist and former White House adviser, and May, a historian, taught a generation of decision-makers and analysts how historical reasoning actually fails in practice — how decision-makers grab vivid analogies and apply them without testing structural fit, how briefers select historical cases to support conclusions already reached, how the press and political discourse weaponize analogies in ways that distort the decisions they are supposed to inform. Their method — the “Goldberg Rule” and its siblings, asking what questions the analogy does and does not answer, what the decision-makers know and do not know, what the story’s own dissimilarities suggest — became the standard discipline for disciplined analogical reasoning.

Yuen Foong Khong’s Analogies at War (1992) extended the analysis to a specific case, the American decision-making around Vietnam, and showed how the Munich and Korea analogies shaped Washington’s reasoning — and how their selection, rather than their content, determined the policy outcomes. Khong’s work made two points that remain foundational. Analogies are selected, not discovered; different decision-makers invoke different analogies, and the choice of which to invoke is itself a consequential act. And analogies work through a small number of specific cognitive functions — they diagnose the problem, suggest the stakes, predict the consequences, evaluate the alternatives — each of which can be assessed separately and each of which can fail.

Robert Jervis’s Perception and Misperception in International Politics (1976) is older but indispensable. Jervis catalogued the cognitive patterns through which decision-makers misread international situations, and historical analogy was prominent among them. His contribution to the method was the diagnosis of how analogies go wrong: overweighting the vivid, the recent, and the personally experienced; ignoring disconfirming differences; applying the lessons of the most salient case regardless of structural fit. Jervis’s cautions remain the critical counterweight to the method’s constructive side.

The space sector has absorbed these disciplines unevenly. The strongest uptake has been in commons-governance discussions, where analogies from the Antarctic Treaty System, the deep-seabed regime, and the law of the high seas recur. The weakest uptake has been in military space, where Cold War analogies are invoked reflexively and rarely tested for structural fit.

What the Method Actually Sees

The characteristic analytical move is the symmetric mapping of similarity and difference. Casual analogy catalogs similarities and treats the listing as persuasion. The method treats the similarity list as half the work. The second half — enumerating the structural differences with equal rigor — is what distinguishes the discipline from the rhetoric. An analogy whose similarities are spelled out in detail and whose differences are waved away is not an analysis; it is advocacy wearing an analytical costume.

Structural, not narrative

The method reads cases through a small set of dimensions that strategic interactions share: actor configuration (number, type, power distribution), incentive structure (payoffs, time pressures, credible commitments), domain characteristics (commons, territorial, technological, institutional), institutional context, information environment, temporal dynamics, and technology gap. A case compared on these dimensions produces a different kind of finding than a case compared on narrative surface.

Conditional lessons

Lessons are framed as conditional propositions: *if* the structural parallel holds on dimension X, *then* the historical experience suggests Y. Unconditional lessons — "Vietnam teaches us that X" — are the method's characteristic failure mode. Conditionality forces the analyst to specify which dimensions the lesson depends on, which allows the lesson to be checked when the current situation diverges from the historical one.

Explicit failure-point list

Every analogy breaks down somewhere, and the method requires the break points to be named. What features of the current situation have no historical parallel? Where might the analogy create a false sense of predictability? What did historical decision-makers get wrong by relying on faulty analogies of their own? The failure-point list is the analytical counterweight to the similarity list, and its absence is a tell.

Analogy-quality rating

At the end, the method assesses the overall utility of the comparison as Strong, Moderate, or Weak. Strong analogies show multiple structural parallels with few critical differences. Moderate analogies illuminate specific dimensions while diverging on others. Weak analogies are superficially appealing but structurally divergent, and the honest rating is indispensable: a Weak rating on a vivid case is the finding, not a failure of the method.

Reading Two Cases Against Lunar Resource Governance

Consider a generic case: a policy question about how to govern lunar resource extraction as the surface operator population grows. Two historical cases present themselves.

The first is the Antarctic Treaty System of 1959. A small number of claimant states, under Cold War strategic constraints, negotiated a regime that froze sovereignty claims in exchange for scientific cooperation and demilitarization. The treaty held for decades, was strengthened by subsequent protocols, and became the archetype of commons governance for a resource-rich but then-inaccessible domain. Actor configuration: small group of capable states, publicly aligned on peaceful use, with minor commercial interests. Incentive structure: science-driven, low commercial payoff, strong Cold War constraint against the alternative of militarization. Domain characteristics: commons with locational specificity and uncertain resource profile. Institutional context: bilateral diplomatic infrastructure plus new treaty mechanisms. Information environment: relatively transparent, with national scientific activity widely publicized.

The second is the deep-seabed mining regime under Part XI of the UN Convention on the Law of the Sea, negotiated through the 1970s and substantially modified in 1994. A much larger group of states — including emerging nations demanding a redistributive framework — negotiated a regime for a commons whose commercial exploitation depended on technology yet to be proven. When commercial interests proved unwilling to operate under the regime’s original terms, the framework had to be renegotiated more than a decade after its drafting. Actor configuration: large group of states with sharply divergent positions. Incentive structure: contested commercial exploitation, redistributive demands. Domain characteristics: commons without locational specificity, resource profile uncertain and technology-gated. Institutional context: UN framework, with precedents from the law of the high seas. Information environment: commercial opacity around extraction technology and economics.

Dimension	Antarctic Treaty System (1959)	Deep-seabed regime (UNCLOS Part XI)
Actor configuration	Small group of capable states, publicly aligned	Large group with sharply divergent positions
Incentive structure	Science-driven, low commercial payoff	Contested commercial exploitation, redistributive demands
Domain characteristics	Commons with locational specificity	Commons without locational specificity
Institutional context	Bilateral diplomacy plus new treaty mechanisms	UN framework, high-seas precedents
Information environment	Relatively transparent scientific activity	Commercial opacity around extraction
Longevity	Held for decades, strengthened by protocols	Required renegotiation within a decade

Mapping both cases against lunar resource governance produces a finding neither case would produce alone. On actor configuration, lunar competition resembles Antarctica: few capable states, high barriers to entry, early dominance by state actors. On incentive structure, lunar extraction resembles the seabed: commercial payoff is the driver, not scientific cooperation, and the technology is advancing faster than the regulatory framework. On domain characteristics, lunar resources have locational specificity (polar water ice is not uniformly distributed) more like Antarctica than the seabed, but the commercial logic — extraction for sale into terrestrial or in-space markets — resembles the seabed.

The non-obvious lesson is conditional. If the lunar-operator population remains small and dominated by state actors for the first decade, Antarctic-style governance could hold, because the Antarctic analogy’s actor configuration matches. If the population diversifies and commercial incentives dominate within that decade, Antarctic-style governance is unlikely to survive, because the seabed analogy’s renegotiation trajectory becomes more relevant. A regime designed under Antarctic assumptions but exposed to seabed dynamics will either be renegotiated or be ignored by the actors whose interests it fails to accommodate. The policy implication is not that one analogy is right; it is that the choice of regime model should be explicitly tied to the expected actor configuration and incentive structure over a defined period.

The failure-point list is equally important. Neither case involves a domain with orbital access constraints, with real-time operational contact between extraction and terrestrial markets, or with dual-use technology sharing between civil and military uses of the same hardware. These are features of lunar operations that have no historical precedent in either case, and any lesson that assumes the historical institutional inventory is sufficient — no new mechanisms required — is importing a dimensional assumption the analogies cannot support.

The method’s deliverable is this comparative reading, with its conditionality and its failure points flagged. The analyst leaves not with a prediction about lunar governance but with a structural vocabulary for arguing about it — which is what historical reasoning at its best produces.

Where It Shines, Where It Limps

The Historical Analogy Method excels when a current situation has genuine structural parallels in earlier cases and when decision-makers are already invoking analogies that need disciplining. It is the right instrument for commons-governance questions, for new domains where theory is underdeveloped, and for situations where historical depth is needed to counterbalance forward-looking speculation. When the question is what patterns have we seen before that might be working here?, no other method in the library is as direct.

Its limits are substantial. Superficial resemblance frequently masks deep structural difference; the symmetric mapping discipline is the only corrective, and it is easily abandoned under time pressure. Analysts over-rely on vivid cases — Munich, Vietnam, the Cold War — that may not be the most structurally relevant; deliberate consideration of less salient but structurally closer precedents is the counterweight. The method illuminates possibilities and patterns, not certainties; it cannot predict outcomes, and framing it as predictive invites misuse. Every historical situation is unique in some respects, and the analytical question is whether the similarities outweigh the differences on the dimensions that matter, not whether similarities exist at all.

Confirmation bias is the method’s most persistent failure mode. Analysts select cases that support a pre-existing conclusion and apply the method’s apparatus around the selection. The mitigation is requiring explicit selection justification: why this case and not the three others that could have been chosen? A selection that cannot survive the question has not earned its place in the analysis.

Space-domain topics often have limited precedent. Terrestrial commons analogies are useful but inherently imperfect, because no terrestrial case shares the full set of features — orbital mechanics, global interdependence on services from the domain, rapid dual-use technology evolution. The method’s honest output in space-domain work is frequently a Moderate or Weak analogy rating on cases that superficially look Strong, and the discipline is to publish the honest rating rather than the vivid one.

The method works best combined with theoretical frameworks that explain why the historical pattern occurred, not just that it occurred. Realist theory, institutional theory, and constructivist theory each supply explanatory mechanisms that allow the analogy’s transferability to be tested structurally. An analogy offered without theoretical grounding is weaker than one that names the mechanism and asks whether it is present in the current case.

Within the library, the method pairs with scenario planning to supply historical patterns as plausible trajectories, with risk assessments as confidence modifiers on forecasts, with policy cycle analysis to provide temporal benchmarks for treaty negotiations, and with theoretical methods (realist, liberal-institutionalist, constructivist) to supply the explanatory mechanisms that make the analogy structurally defensible.

A Note for the Practitioner

Reach for this method when a historical case offers genuine structural parallels and someone is about to invoke it rhetorically, when a new domain needs historical scaffolding to complement underdeveloped theory, or when a current trajectory resembles a known pattern and the analyst wants to test whether the resemblance survives structural scrutiny. Select at least two cases rather than one; multiple cases force comparative rigor and resist the single-case overreach that is the method’s most common failure mode. Complete the difference mapping with the same energy as the similarity mapping; if time runs short, cut the analysis rather than the difference list. Frame every lesson as conditional. Pair with theoretical frameworks to supply explanatory mechanisms. The operational version of the method — the full step-by-step procedure — sits alongside this article for teams that want to apply it systematically; this piece is the case for picking it up.