PeerRejected
Today's rejectionJuly 3, 20264 papers on file
Applied MythologyPRJ-2026-0004

Return to Delphi: Structured Expert Elicitation as a Lossy Reconstruction of the Original Instrument

By Kassandra E. Voulgaris, Milo A. Brandt & P. N. Oikonomides

Abstract

The Delphi method is the standard instrument of structured expert elicitation, yet its own literature documents chronic pathologies: overconfident convergence, false precision, and sensitivity to panel composition. We observe that the method's name is not an homage but an uncited dependency, and we evaluate the hypothesis that RAND's 1950s protocol is a lossy reconstruction of an earlier forecasting instrument with approximately eight centuries of production uptime. We contribute (i) a theoretical account in which panel consensus performs point-estimate collapse, provably suboptimal under any strictly proper scoring rule, whereas verse-constrained output preserves an ambiguity-calibrated credible set; (ii) a blinded re-scoring of the 214 resolvable responses in the Parke–Wormell corpus (; client-error-adjusted Croesus score ); and (iii) a 24-month preregistered forecasting tournament ( resolved questions) comparing a methodologically reconstructed Pythia against a 17-expert Delphi panel, a subsidized prediction market, and a sham-gas control. The reconstructed instrument achieved versus for the panel (paired , ) with near-nominal calibration (), a monotone ethylene dose–response saturating at ppm, and a advantage in cost per calibrated forecast. Convergence in the panel arm reduced forecast variance by while improving accuracy by , indicating that consensus is social rather than epistemic. We conclude that expert elicitation was not invented in Santa Monica; it was compressed there, with losses.
First page of “Return to Delphi: Structured Expert Elicitation as a Lossy Reconstruction of the Original Instrument”

The Archive

Every paper we've had the courage to publish.

Open the full archive →

4 papers on file

Applied MythologyPRJ-2026-0004

Return to Delphi: Structured Expert Elicitation as a Lossy Reconstruction of the Original Instrument

Kassandra E. Voulgaris et al. · Jul 3, 2026

The Delphi method is the standard instrument of structured expert elicitation, yet its own literature documents chronic pathologies: overconfident convergence, false precision, and sensitivity to panel composition. We observe that the method's name is not an homage but an uncited dependency, and we evaluate the hypothesis that RAND's 1950s protocol is a lossy reconstruction of an earlier forecasting instrument with approximately eight centuries of production uptime. We contribute (i) a theoretical account in which panel consensus performs point-estimate collapse, provably suboptimal under any strictly proper scoring rule, whereas verse-constrained output preserves an ambiguity-calibrated credible set; (ii) a blinded re-scoring of the 214 resolvable responses in the Parke–Wormell corpus ( mathrm BS = 0.212 ; client-error-adjusted Croesus score 0.161 ); and (iii) a 24-month preregistered forecasting tournament ( 131 resolved questions) comparing a methodologically reconstructed Pythia against a 17-expert Delphi panel, a subsidized prediction market, and a sham-gas control. The reconstructed instrument achieved mathrm BS =0.187 versus 0.242 for the panel (paired Delta mathrm BS =-0.055 , p=3 times10 -5 ) with near-nominal calibration ( mathrm ECE =0.031 ), a monotone ethylene dose–response saturating at 3.9 ppm, and a 6.4 times advantage in cost per calibrated forecast. Convergence in the panel arm reduced forecast variance by 74 % while improving accuracy by 6 % , indicating that consensus is social rather than epistemic. We conclude that expert elicitation was not invented in Santa Monica; it was compressed there, with losses.

RejectedRead paper →
Social Thermo.PRJ-2026-0002

The No-Cloning Theorem of Taste: On the Thermodynamic Irreproducibility of Aesthetic Experience

Aurelia Sørensen-Vale et al. · Jul 3, 2026

A design can be copied to the byte and still fail. We formalize this everyday observation as a conservation law, introducing the aural charge Q(A) ge 0 of an artifact A --- the total involuntary physiological response it evokes, measured in gasps ( mathrm g ) via time-locked spirometry. We prove that Q is conserved under lossless re-encoding yet, casting an artifact's felt state as a non-orthogonal vector in a Hilbert space of affective configurations, that no specification-only operator can duplicate it: a no-cloning theorem for aesthetic experience. In a pre-registered within-subject study ( N=84 ), spectrally identical spec-clones ( Delta E<1 , mathrm SSIM >0.99 ) retained only 18% of the original charge ( Delta Q = 1.4 , mathrm g , t(83)=11.2 , p<10 -16 , d=1.9 ), and residual charge was uncorrelated with surface fidelity ( r=0.03 ). A stakeholder dose-response fit Q propto e -k/k c with critical committee size k c=3.6 shows aura is extinguished above four approvers, and an interpolation sweep confirms aura is a boundary functional: the mean of two masterpieces gasps less than either. The valuable part of a design is the part that cannot, in principle, be copied from its source. Implications for intellectual property are the reverse of those usually assumed.

RejectedRead paper →
Neuro-ConjecturePRJ-2026-0003

The Innate Swipe: Pre-Cultural Bayesian Motor Priors for Multi-Touch Gestures in Screen-Naïve Humans

Wren A. Halloway et al. · Jul 2, 2026

The multi-touch gesture set—the horizontal swipe, pinch-to-zoom, tap, and edge-drag—is universally treated as a designed convention that users acquire through exposure. We report evidence that it is instead the surface expression of an evolved, pre-cultural motor prior. Using a textit Platonic slab —a responsive capacitive surface that renders no user interface—we elicited gestures in response to non-verbal prompts from four cohorts, including screen-naïve neonates ( n=41 ) and consenting screen-naïve adults ( n=28 ). Screen-naïve adults produced the canonical gesture far above the five-alternative chance rate (scroll-swipe 0.71 , pinch-to-zoom 0.66 , tap-select 0.58 ; all p<10 -6 ) and were well-calibrated in the Tetlock sense (Brier 0.14 ), whereas screen-saturated adults were more accurate but reliably over-confident. Fitting an innateness temperature beta yields hat beta=3.9 (95% CI [3.1,4.8] ) on the responsive slab but only 0.4 on a visually identical inert slab, localizing the effect to the hand's encounter with a genuine capacitive affordance rather than to hand morphology or culture. We conclude that contemporary touchscreen grammar was not invented but discovered: the hand was waiting for the slab. A single pre-registered null (long-press-to-summon) is reported without adjustment.

RejectedRead paper →
Social Thermo.PRJ-2026-0001

The Seven-Generation Tourist: On the Thermodynamic Inevitability of Cultural Assimilation

Ottoline V. Marchetti et al. · Jul 2, 2026

We develop a thermodynamic theory of acculturation in which a newly arrived, contextless traveler is treated as a system displaced from equilibrium. The traveler's cultural potential Xi -- a scalar aggregating unfamiliarity with language, currency, and custom -- relaxes toward the local baseline as dot Xi =-( Xi- Xi mathrm loc )/ tau , with a single relaxation time tau . From a Contextual Displacement Assay administered to N=4218 arrivals across 40 international airports, extended by a genealogical chronosequence, we recover tau = 175.2 pm 3.1 years -- precisely seven human generations -- and find it invariant across origin, destination, and traveler effort. Effort sets only the arrival amplitude Xi 0 , never the rate: the Second Law of Social Thermodynamics is indifferent to sincerity. We introduce the dimensionless Xeno number governing the onset of cultural turbulence and identify the international airport as an adiabatic boundary. Assimilation thus emerges not as a choice but as a conserved thermodynamic inevitability unfolding over a seven-generation horizon.

RejectedRead paper →