New articles on Economics


[1] 2606.00587

Hashprice modulates the electricity demand response of Bitcoin miners

Large, fast-controllable loads such as Bitcoin mining facilities are increasingly viewed as potential sources of flexibility in modern power systems, yet the conditions under which this flexibility is realized remain incompletely understood. Using the Texas power market as an empirical setting, we examine how Bitcoin-mining load responds to two distinct electricity-sector cost channels: contemporaneous wholesale electricity prices and incentives created by coincident-peak-based transmission charges. We find that mining load responds to both cost channels in a manner consistent with miners operating around a breakeven point. At the aggregate level, we observe that mining load decreases as electricity-sector costs rise, but the strength of this response depends on hashprice, a measure of expected mining revenue from the crypto-financial sector. When hashprice is higher, aggregate load responsiveness is weaker. This mechanism is especially evident in the wholesale-price response. Mining load remains largely online at low prices and begins to decline only when electricity costs become large relative to expected mining revenue, with higher hashprice shifting the implied curtailment threshold toward higher wholesale prices. These findings indicate that Bitcoin-mining demand response to electricity-sector costs is economically state-dependent and shaped by revenue conditions in the crypto-financial sector. Treating such loads as stable demand-response resources may therefore overstate available grid flexibility, with implications for power-system planning, market design, and reliability assessment.


[2] 2606.00614

Mitigation of spatial economic impact propagation of highway disruptions by redundant networks

The damage to transportation infrastructure caused by disasters can indirectly lead to economic damage through economic interdependence, even in areas that are not directly affected. However, even when transportation routes are interrupted by a disaster, the damage can be mitigated if alternative routes are secured. Rural areas with low-density transportation networks are more vulnerable to traffic disruptions in a disaster. This study develops a method for evaluating the effectiveness of redundant transportation networks in mitigating economic vulnerability in the event of a disaster. Our methodology combines inter-regional road network connectivity with a spatial computable general equilibrium (SCGE) model. We apply the method to road disruption scenarios in the Chugoku region of Japan, which has a system of parallel highways. The affected areas are in close geographical proximity to many rural areas and have strong economic interdependencies with them. Several counterfactual simulations depicted the situation without the alternative road and the disaster. We evaluate the transportation impacts, measured by changes in travel time, and the economic impacts, measured by negative benefits, respectively. The results suggest that the economic vulnerability reduction effect is more far-reaching than the transportation impacts.


[3] 2606.00811

Certificates without Electrons? Theory and Evidence on Impacts from AI-Driven Power Demand

Data centers now account for 4.4% of United States electricity demand, yet the grid-level effectiveness of the renewable energy certificates (RECs) and power purchase agreements (PPAs) hyperscalers use to claim carbon neutrality remains unclear. We develop a game-theoretic model in which a data center operator chooses among RECs, PPAs, and behind-the-meter colocation while generators make entry decisions under endogenous financing costs. The model identifies a timing wedge -- the mismatch between consumption and credited renewable generation -- as a central mechanism through which AI demand degrades reliability, raises prices, and increases emissions even when RECs cover 100% of annual consumption. Colocation with storage addresses this wedge directly and induces the greatest renewable entry by eliminating generator revenue risk. We test these predictions by exploiting the staggered release of large language models as a natural experiment, using difference-in-differences on a novel dataset linking AI activity to local grid outcomes. AI demand significantly increases fossil generation, wholesale prices (up to 25% in treated PJM zones), and outage frequency (0.5--1 additional outages per year) near data centers, with impacts scaling in model size. Data centers with on-site generation exhibit a sign reversal in power-quality effects, consistent with the model's prediction that behind-the-meter capacity absorbs demand spikes. Counterfactual analyses show that edge inference, spatial reallocation, and colocated storage each substantially mitigate grid impacts, while REC-only strategies do not. Together, our results demonstrate that the externalities of AI to the grid are tightly coupled to procurement design and the spatial organization of data center infrastructure.


[4] 2606.00948

Recession Detection in Japan using Labor Market Data

Recession indicators are often viewed as U.S. specific, raising the question of whether labor market-based rules such as the Sahm Rule and the Michez Rule can reliably detect recessions in other countries. To answer this, we evaluate whether such rules can be adapted to Japan by calibrating thresholds and smoothing parameters to Japanese labor market data. We construct a large set of 95,832 recession indicators combining unemployment and vacancy data. The selected classifiers are statistically perfect as they identify all 11 historical recessions in the 1970-2021 training period without generating any false positives. Among these, 193 classifiers lie on the anticipation-precision frontier. Restricting attention to the high-precision segment yields six classifiers with a standard deviation of detection errors below 3 months. The selected classifier ensemble signals recessions, on average, 0.06 months after their true onset. Overall, these findings suggest that slack-based labor market rules provide a general framework for improving real-time recession detection across countries.


[5] 2606.00972

Designing entry-monotone risk-sharing pools

While risk pooling lowers the total cost of risk, efficiency alone does not make a pool viable. Participants need terms that ensure their participation, that are immune to subgroups breaking away, and that allow new members to join. Under cash-additive risk measures, the minimum cost of a coalition's risk determines the value created by that coalition, and deterministic side payments redistribute that value among participants. Institutional risk sharing is thus a transferable-utility cooperative game. We prove that the game is totally balanced whenever the risk measures are convex (agents are risk averse), so every coalition has a nonempty core and stable allocations always exist. We then analyze entry monotonicity through Population-Monotonic Allocation Schemes (Sprumont, 1990), a strong requirement that is notoriously difficult to construct and has received limited attention in risk sharing. We find several structural conditions that ensure that either the Arrow--Debreu pricing surplus allocation rule or the proportional-cost surplus allocation rule satisfies this entry-monotonicity property, the latter being a novel cooperative notion we propose. These verifiable structural conditions naturally arise in pooled (re)insurance and credit portfolios, providing pool designers with a practical toolkit for building risk pools that remain stable and attractive as they expand.


[6] 2606.00989

Recession Detection Using Real Time GDP Data

This paper examines whether real-time GDP announcements can reliably identify business-cycle turning points. Using U.S. real-time GDP vintages from 1947 to 2021, we construct 4,356 recession indicators based on alternative smoothing methods and scaling variations. We then combine these indicators with alternative thresholds to generate 137,457 perfect recession classifiers. The selected classifiers identify all 12 historical recessions without generating false positives or false negatives. Restricting attention to the high-precision segment yields two classifiers with a standard deviation of detection errors below three months, while the selected ensemble signals recessions, on average, 3.04 months after their official onset. The framework accurately identifies recession episodes across vintages, suggesting that discrepancies in prior work may reflect limitations of traditional dating methods in addition to data revisions. Overall, the results indicate that real-time GDP announcements provide a practical proxy for NBER-style recession dating.


[7] 2606.01018

Self-Duality and Transfer in Voting Games

This paper studies the role of self-duality in the axiomatization of the Shapley-Shubik power index. We show that, when self-duality is imposed, the transfer axiom can be weakened to a restricted version requiring transfer only among voting games with no veto player. This result shows that self-duality can partially substitute for the transfer axiom.


[8] 2606.01137

Digital Maturity and Technical Efficiency in NHS Acute Trusts: Cross-Sectional Evidence from England

Whether investment in digital health technology is associated with differences in hospital productivity is a question of substantial policy relevance, yet interpretation is constrained by challenges in causal identification and prior evidence is mixed. Technical efficiency in NHS acute hospital trusts in England is estimated using Bayesian stochastic frontier analysis. A four-input Cobb--Douglas production function incorporating clinical full-time equivalents, administrative full-time equivalents, non-labour expenditure, and physical capital derived from audited NHS accounts is fitted to 111 acute non-specialist trusts in 2024/25. Digital maturity, measured by the NHS Digital Maturity Assessment, is included in a trust-specific inefficiency equation alongside population deprivation, teaching status, and financial position controls. The composite digital maturity score is estimated to be negatively associated with technical inefficiency (\(\hat{\gamma} = -0.612\), 95\% credible interval \([-1.289, +0.005]\), \(P(\gamma < 0) = 0.974\)). Trusts in the highest digital maturity quartile are estimated to operate at 98.0\% of their production frontier compared with 93.2\% for the lowest quartile. This gap corresponds to approximately £20 million of additional cost-weighted activity per trust at mean output levels, or £1.1 billion in aggregate. Estimates are robust to functional form but are sensitive to the most conservative prior specification. Pillar-level analysis suggests that population health management and care pathway optimisation domains exhibit stronger associations with efficiency than other domains. Catchment deprivation is not estimated to have an independent association with efficiency after controlling for digital maturity.


[9] 2606.01234

Differing Roles of Leisure and Productivity in GDP - A Machine Learning based comparative analysis of Germany and USA

The GDP of a country is modelled as the relative interaction between two agents - working hours, reflecting the social choice of a population, and Total Factor Productivity, reflecting the collective investment in productivity enhancers. It is shown that a Random Forest model can accu- rately predict the GDP from these two factors. The differences in the choices made by Germany and USA are analysed though Gini importance, SHAP plots and partial dependency. It is shown that the differences in the social structure of the countries are reflected in the relative contribution of working hours and productivity to the GDP.


[10] 2606.01250

Cheap Talk in Bilateral Trade

A single seller offers one or more goods to a single buyer. The buyer's values and the seller's costs are private information. Each player has a commonly known prior over the other player's value or cost, supported on a finite set. What is the optimal selling mechanism? We argue that, despite this question's importance and apparent simplicity, prior work offers no satisfactory answer. If the seller simply chooses an optimal menu given her realized costs, she fails to exploit her informational advantage. At the other extreme, the optimal trade mechanism that satisfies IC/IR constraints for both parties fails in practice, as it conditions prices on the seller's unknown costs in an unenforceable way. The seller's realistic capabilities lie somewhere in between: she may leverage private information but lacks unlimited commitment power. To bridge this gap, we consider a solution concept built on the realistic assumption that the seller can commit to prices but nothing more. Similar -- albeit technically distinct -- solution concepts have been studied in the context of auctions with multiple buyers. Our concept proves surprisingly rich even with a single buyer. In our model, the buyer and seller engage in multiple rounds of cheap talk before the seller posts a menu of priced bundles. The buyer then purchases. We measure value as profit for the seller and consumer surplus for the buyer. We prove that with a single good cheap talk cannot help either party, but show that it creates value in any extension of this canonical setting: multiple goods, multiple units, interdependent values, or repeated play. We also show that multiple rounds of communication can yield strictly higher expected profit than a single round. Finally, we discuss how realistic factors beyond our stripped-down model combine with cheap talk to enhance this value even further.


[11] 2606.01307

Tracking the Economy through Firm Creation:Evidence from Real-Time Administrative Data

We introduce a novel real-time dataset, Companies House Real-Time (CHRT), that captures daily firm creation and dissolution activity for the full population of UK-registered companies. CHRT provides a timely measure of business formation, becoming available months before official business demography statistics. We show that incorporation activity leads taxable business births and contains forward-looking information about employment and output growth. Consistent with this, a structural vector autoregression (SVAR) indicates that positive shocks to firm entry generate persistent increases in employment and output.


[12] 2606.01390

Limit Continuous Poker: A Variant of Continuous Poker with Limited Bet Sizes

We introduce and analyze Limit Continuous Poker, a variant of Von Neumann's Continuous Poker with variable but limited bet sizes. This simplified variant of poker captures aspects of information asymmetry, bluffing, balancing, and the impact of bet size limits while still being simple enough to solve analytically. We derive the Nash equilibrium strategy profile for this game, showing how the bettor's and caller's strategies depend on the bet size limits. We demonstrate that as the bet size limits approach extreme values, the strategy profile converges to those of other continuous poker variants. Finally, we connect these results to strategic implications of limited bet sizing in real-world poker.


[13] 2606.01424

Technology Speed Limits

We study optimal technology regulation when private learning occurs both through doing (scaling up the technology) and through waiting (as time passes). We show that an adaptive speed limit -- a cap on the rate at which the technology can increase per-unit time -- delivers optimal worst-case guarantees over all learning processes and/or preferences, and is the only time-consistent mechanism that does so.


[14] 2606.01659

Data-Automated Policy Learning for Nonlinear Welfare

This paper explores policy learning from observational data, focusing on a nonlinear welfare criterion in a binary treatment setting. The nonlinear criterion is inspired by scenarios where policymakers prioritize specific population segments. We model this criterion using a utility function that encompasses potential outcomes and intermediate parameters, with the latter capturing higher moments of the outcome distributions. When formulated in the context of observational data, both the intermediate parameters and the welfare criterion depend on the propensity score, which we estimate using machine-learning techniques. To address bias in machine learning estimates, we introduce a novel reweighting-based debiasing approach that offers a promising alternative to traditional orthogonality-based methods. To tackle the complexities of infinite-dimensional policy spaces, we employ sieve approximations and $K$-fold cross-validation for model selection, thereby fully automating the policy-learning process. Despite these complexities, we demonstrate that both the welfare regret and the average welfare regret of our proposed policy learning method satisfy an oracle inequality, thereby providing theoretical guarantees on the performance of the estimated policy relative to the best possible policy. This finding extends the existing results from linear to nonlinear welfare criteria, from finite-dimensional to infinite-dimensional policy spaces, and from a known propensity score to a machine-learned one.


[15] 2606.01687

Information and voting: Evidence from Peru's 2026 presidential election

We study how election-night flash estimates shape voting in Peru's fragmented 2026 presidential election. We exploit a natural experiment: on April 12, 2026, 187 polling tables across 13 voting centers failed to install, and the \emph{Jurado Nacional de Elecciones} (JNE) extended voting for the affected $\approx\!55 000$ electors to Monday, April 13. These voters cast ballots after observing the Ipsos and Datum flash estimates; otherwise comparable Sunday voters did not. A Bayesian-updating model of multi-candidate plurality voting frames the analysis, yielding predictions about vote reallocation toward the three candidates the estimates rendered viable -- López Aliaga, Sánchez, and Nieto. We estimate treatment effects on candidate vote shares at both the \emph{acta} level and the acta-weighted polling-station level, comparing treated and control \emph{locales de votación} matched on pre-treatment covariates. How flash estimates reshape voting is of first-order importance for Peru, given its institutional instability and high political volatility over the past decade.


[16] 2606.01706

Higher-Order Debiased Estimators for General Treatment Models

It is now well known that estimators based on influence functions can be sub-optimal in terms of convergence rates in various settings. To address this issue, higher-order influence functions (HOIF) are developed, generalizing the classical semiparametric theory. However, most existing results in this regard focus on treatment effect parameters defined in explicit forms, such as average treatment effects (ATE). In applications, economists are often confronted with tasks of inferring more complex parameters, such as quantile treatment effects (QTE) or effects of complicated treatment regimes/policy. These more complex parameters can often only be implicitly defined as the solution to nonlinear estimating equations, which correspond to M/Z-estimation problems. Our current understanding of these problems is mainly limited to the classical semiparametric theory. Given the foundational role of HOIF for estimating explicit parameters such as ATE, a modest step toward enriching the statistical foundation of econometrics and causal inference is to develop the corresponding higher-order estimators for those more complex parameters. To this end, we consider parameters of a class of non-separable structural models in the econometrics literature and develop a class of higher-order estimators for the target parameters. Statistical properties of these higher-order estimators are derived using recent advances in U-processes theory. Our proposed higher-order estimators relax complexity-reducing assumptions, quantified by Holder smoothness, imposed on the nuisance parameters compared to existing alternative estimators for many important parameters in this class, including QTE and quantile dose-response functions, among others.


[17] 2606.02200

Random Set Quantile Estimation of Partially Identified Discrete Response Models

Semiparametric discrete choice models are widely applied in economics, yet a fundamental tension arises when covariates are discrete as regression coefficients that are point identified under continuous regressors may become only partially identified. We show that this is not merely an identification problem but creates serious estimation pathologies. Classical estimators, including the maximum score estimator of Manski (1975), not only have population maximizers that are outer regions of the identified set (Komarova (2013)) but also converge to a random set drawn from a finite collection of deterministic regions that partition that outer region. To resolve this failure, we introduce the Random Set Quantile (RSQ) estimator which extracts the $\tau$-quantile of the classical estimator for $\tau \in (1/2,1)$. We prove this result for a class of widely used models, which includes binary/multinomial choice and discrete outcome panel data models. This construction is consistent and locally robust across the full parameter space, including precisely those configurations where classical estimators break down. A feasible implementation based on the $m$-out-of-$n$ bootstrap inherits both properties. We apply the methodology to the 2019 UK General Election, where the discrete support of Brexit-related covariates generates the partial identification our theory analyzes.


[18] 2606.02213

A New Method for Finding the Schulze Winner Set

We propose a new voting algorithm based on the pairwise majority-comparison matrix derived from voters' preference profiles. We show that this algorithm induces exactly the winner set of the Schulze rule (Schulze, 1997). Our algorithm successively eliminates weaker candidates in terms of all-pairs comparisons, thereby reflecting a dual spirit to Condorcet's original idea of splitting preference cycles (de Condorcet, 1785). We further show that the direct sum of the survival sets obtained at each elimination round coincides with the Schwartz set (Schwartz, 1972). These two equivalence results provide a formal mathematical foundation for the ``folklore'' relationship between the Schulze winner set and the Schwartz set, as well as a new Condorcetian interpretation of the Schulze winner set.


[19] 2606.02234

When Do Treatment Changes Identify Causal Effects?

This paper clarifies the identifying assumptions underlying causal inference based on treatment changes rather than treatment levels, and their relationship to conventional identification strategies. We characterize two distinct structural models, with non-nested identifying assumptions, under which treatment-change identification is valid conditional on observed covariates. We demonstrate that the identifying assumptions relying on treatment changes are generally not nested with those of methods relying on treatment levels, such as selection-on-observables strategies that control for past outcomes, treatments, and covariates, or difference-in-differences approaches that difference outcomes rather than treatments over time. We show, however, that under a random-walk restriction on the treatment process, conditioning on treatment changes is equivalent to conditioning on treatment levels given lagged treatment. This and other equivalence results motivate overidentification tests by jointly considering methods based on treatment levels and changes. Beyond these tests, the non-nesting results carry a structural double robustness implication: an estimator that differences both the outcome and the treatment over time, such as two-way fixed effects regression, remains consistent if either the treatment-change assumption or the parallel-trends assumption holds, without requiring both simultaneously. We characterize the causal models consistent with each method, investigate finite-sample behavior in a simulation study, and present an empirical application to cigarette demand.


[20] 2606.02306

Delusions of Grandeur and Their Benefits (and Hazards)

We study a population-wide tournament in which agents, who care both about their absolute and relative wealth, experiment by searching over correlated objects. We explore the role of the agents' beliefs about the environment; namely, the stochastic processes corresponding to their experimentation. We find that although optimism leads to higher output, it also produces greater inequality. We connect these observations with empirical evidence suggesting a positive relationship between inequality and entrepreneurship.


[21] 2606.02348

Privacy-preserving Information Sharing in Oligopoly Competitions

Information sharing among competing suppliers can improve decision-making under uncertainty, yet strategic concerns regarding rival exploitation often deter voluntary disclosure. We study information-sharing mechanisms in a Cournot oligopoly with uncertain demand, where a platform aggregates suppliers' signals through privacy-preserving channels and may also possess an exogenous external signal. The central challenge is to balance strategic safety with informational utility: privacy noise reduces the exposure of individual signals, but also lowers the value of the shared information pool. We first characterize a baseline setting in which access to aggregated information is contingent on participation. In a two-firm market without an external signal, firms refuse to share regardless of the privacy level. In an \(n\)-firm market, sharing may arise even without privacy safeguards because non-participating firms lose access to the aggregated signal. Building on this baseline, we show that privacy protection alone is insufficient to incentivize disclosure; it must be combined with a sufficiently informative external signal. We further show that firms with more accurate private signals require stronger privacy protection. Overall, our results characterize the sharing-feasible region and highlight the complementarity between privacy design and the external information environment.


[22] 2606.02362

Endogenous Fertility Waves and the Dynamics of Utility in an Overlapping Generations Model

This paper investigates the conditions under which the Easterlin hypothesis holds within a neoclassical overlapping generations model with endogenous capital accumulation, wages, interest rates, and fertility. We develop a tractable analytical framework that maps economic transitions into utility space via a continuously differentiable first-order difference equation for cohort lifetime utilities. This reformulation allows for a transparent normative evaluation of non-steady-state paths without requiring explicit solutions to the underlying nonlinear system. Within this framework, we show that when fertility cycles emerge and children are normal goods, the utility of small cohorts strictly exceeds that of large cohorts. Crucially, this cohort-welfare asymmetry is driven by fertility preferences and is independent of the economy's position relative to the golden rule.


[23] 2606.02503

Pay Beliefs and the Amenity-Pay Tradeoff

This paper studies how workers' beliefs about pay shape the tradeoffs between pay and workplace amenities. We design a multi-stage incentivized survey experiment that combines hypothetical choice experiments with elicited beliefs about starting salaries in real jobs and randomly varies the provision of explicit pay information. Although stated preferences imply sizable willingness to pay for amenities consistent with prior literature, baseline beliefs about salaries in real jobs are systematically biased along two margins: respondents under-predict starting salaries by 18% and expect higher-amenity jobs to pay more, substantially over-predicting the amenity-pay gradient. Exposure to pay information raises mean pay beliefs for similar jobs by 4% and reduces belief dispersion by 15%, but does not alter the strong positive association between perceived pay and advertised amenities, leaving the amenity-pay tradeoffs in stated choices essentially unchanged. While workers have strong preferences for workplace amenities, the tradeoffs they perceive deviate sharply from those present under full information.


[24] 2606.00071

Bitcoin Price Prediction: Peer-Reviewed Evidence and Social Media Discourse

Bitcoin price prediction has attracted hundreds of academic papers and continuous social media debate, yet the field lacks consensus on even basic questions: can any model beat a naive "today's price" baseline at horizons of one to six months? We survey the peer-reviewed landscape, categorize papers by evaluation methodology, and contrast academic findings with informal but substantive discourse on X/Twitter. The picture that emerges is sobering. At short-to-medium horizons, no peer-reviewed study has shown robust superiority over the naive baseline across multiple market regimes. Daily predictability is real but does not extend to hourly or monthly horizons, and may not survive transaction costs. The stock-to-flow model has failed formal out-of-sample testing, and Metcalfe's Law valuations have been challenged as spurious. The Bitcoin price power law, while empirically compelling, has not been subjected to formal distributional tests. Meanwhile, social media practitioners raise valid statistical critiques -- ordinary least squares (OLS) violations, backtest overfitting, spurious regressions -- that the academic literature has not formalized. We identify open research directions and propose concrete methodological standards for future work -- walk-forward evaluation, multi-regime holdout windows, naive baseline comparison, inclusion of zero in hyperparameter grids, and Diebold-Mariano significance testing -- arguing that the field's primary need is not more models but better evaluation.


[25] 2606.00970

Prospect-Theory Behavior from Bellman Optimality in MDPs with Catastrophic States

We study risk-neutral control in Markov decision processes with an absorbing catastrophic state. Even though rewards are linear and the agent has no utility curvature, probability weighting, or framing dependence, standard Bellman optimality produces three prospect-theory-like signatures: an S-shaped value-function profile (convex near catastrophe, concave in the far field), an endogenous loss-sensitivity coefficient $\lambda^*(S) > 1$, and a reflection-effect policy reversal. Across 495 configurations, the optimal policy plays safe near catastrophe in positive-drift (growth) regimes despite the risky action's higher immediate expected value, and plays risky near catastrophe in negative-drift (decline) regimes despite the safe action's lower immediate expected loss. We derive a closed-form expression for the asymptotic loss-aversion plateau $\bar{\lambda}$ that depends only on win probability $p$, payoff asymmetry $r = |\Delta_\ell/\Delta_w|$, and discount factor $\beta$, and matches numerical solutions to $R^2 = 0.999$. The mechanism does not require asymmetric payoffs. Across a sweep of $(p,\beta)$ at three asymmetry levels, the asymmetry share of $\bar{\lambda}$ above unity has median 4.6% at $r = 1.25$ and rises to 13.9% at $r = 2$, with the boundary contribution exceeding the asymmetry contribution in every cell tested. The phenomena persist under tabular Q-learning (a model-free agent reproduces $V^*$ at correlation 0.98 in growth and 1.00 in decline) and under stochastic transitions with Gaussian, heavy-tailed Student-$t_3$, and asymmetric skew-normal noise up to 50% of the step size, where the asymptotic plateau tracks the closed-form prediction within 0.41% for safe-channel noise and within 9.6% for risky-channel or both-channel noise. These results identify absorbing failure states as a sufficient structural mechanism for prospect-theory-like behavior under optimal control.


[26] 2606.01553

Structural Change Detection in High-Dimensional Transformed Factor Models via Canonical Correlation Analysis

This paper develops a canonical-correlation-based method for detecting structural changes in high-dimensional transformed factor models. The proposed approach exploits the low-rank canonical-correlation structure induced by dynamically dependent common factors, while serially uncorrelated idiosyncratic components correspond to a noise subspace with zero canonical correlations. We construct an eigenvalue-ratio criterion that measures residual dynamic dependence in the estimated noise subspace and identifies the true change point under sufficient separation of the regime-specific loading spaces or dynamic canonical correlation structures. Since the change-point location and the regime-specific factor numbers are both unknown, we further propose an alternating iterative estimation procedure that updates them sequentially until convergence. Under suitable mixing and moment conditions, we establish asymptotic properties of the proposed estimators, with convergence rates depending explicitly on factor strength, cross-sectional dimension, and sample size. Monte Carlo experiments and empirical applications to intraday stock returns and U.S. temperature series demonstrate the finite-sample


[27] 2606.02095

Testing Decision Makers without Counterfactuals

A decision-maker (DM) repeatedly makes choices under uncertainty in a bandit environment, where only the realization of the chosen arm is observed. Another competing agent, the adviser (AD), repeatedly provides recommendations, but the realizations of these recommendations are unobserved unless they coincide with the DM's choice. Both agents possess partial information about the arms' realizations. The central question we focus on is whether, in the long run, an outside observer can identify which agent is more informed based solely on the observed decisions, recommendations, and arm realizations. A test selects one of the agents based on the observed data. We focus primarily on the class of scoring tests, which assign a numerical score to each observation and select the agent according to the average score. We study strategic agents whose objective is to be selected by the test. For simultaneous arm choices, we show that there exists a scoring test that successfully identifies the more-informed agent. For sequential arm choices, however, no such scoring test exists. Finally, we explore the tension between identifying the more-informed agent and maximizing welfare. A DM whose objective is to pass the test may not necessarily make welfare-maximizing decisions. In a binary-arm environment, we show that no scoring test can simultaneously identify the more informed agent and achieve more than half of the welfare attained by welfare-maximizing decisions.


[28] 2308.01596

Individual Shrinkage for Random Effects

This paper develops an approach to random effects estimation and individual-level forecasting in micropanels that targets individual accuracy rather than aggregate performance. The conventional shrinkage methods used in the literature, such as the James-Stein estimator and Empirical Bayes, target aggregate performance and can lead to inaccurate decisions at the individual level. We propose a class of shrinkage estimators with individual weights (IW) that leverage an individual's own history, instead of the cross-sectional dimension. This approach can help overcome the "tyranny of the majority" inherent in existing methods, while relying on weaker assumptions. A key contribution is addressing the challenge of obtaining feasible weights from short time-series data under parameter heterogeneity. We discuss the theoretical optimality of IW and recommend using feasible weights determined through a Minimax Regret analysis in practice.


[29] 2402.14189

Optimal transmission expansion modestly reduces decarbonization costs of U.S. electricity

Major government studies and policy reports project that substantial expansion of interregional transmission will be needed to integrate clean energy and ensure reliability in decarbonized power systems. Using the open-source Switch capacity expansion model with detailed representation of existing U.S. generation and transmission infrastructure, solar, wind, and storage resources, and hourly operations, we evaluate the role of interregional transmission across least-cost, carbon-priced, and zero-emissions scenarios for 2050. An optimal nationwide plan would more than triple interregional transmission capacity, yet this reduces the cost of a zero emissions system by only 7% relative to relying on existing interregional transmission, as storage, solar and wind siting, and nuclear generation serve as close substitutes. Regional cost and rent effects vary, with transmission generally favoring wind and hydrogen resources over solar and batteries. Sensitivity analysis shows diminishing returns: one-fifth of the benefits of full expansion can be achieved with one-twelfth of the added capacity, while cost reductions for batteries and hydrogen provide comparable or greater system savings than interregional transmission. Upgrading existing interregional corridors with advanced conductors roughly doubling capacity per link at half the cost of new builds reduces system costs by only 1.6%, suggesting that reconductoring benefits are modest and that realizing their full potential likely requires pairing with new connections on key corridors or complementary reductions in battery costs. These results suggest that while substantial transmission expansion is economically justified, a diverse set of flexibility resources can substitute for large-scale grid build out, and the relative value of transmission is highly contingent on technological and cost developments.


[30] 2406.13122

Testing for Underpowered Literatures

How many experimental studies would have come to different conclusions had they been run on larger samples? I show how to estimate the expected number of statistically significant results that a set of experiments would have reported had their sample sizes all been counterfactually increased. The proposed deconvolution estimator is asymptotically normal and adjusts for publication bias. Unlike related methods, this approach requires no assumptions of any kind about the distribution of true intervention treatment effects and allows for point masses. Simulations find good coverage even when the t-score is only approximately normal. An application to randomized trials (RCTs) published in economics journals finds that doubling every sample would increase the power of t-tests by 7.2 percentage points on average. This effect is smaller than for non-RCTs and comparable to systematic replications in laboratory psychology where previous studies enabled more accurate power calculations. This suggests that RCTs are on average relatively insensitive to sample size increases. Research funders who wish to raise power should generally consider sponsoring better-measured and higher quality experiments -- rather than only larger ones.


[31] 2410.17105

General Seemingly Unrelated Local Projections

We develop a flexible framework for Bayesian estimation of impulse responses using Local Projections (LPs) with instrumental variables. It accommodates multiple shocks and instruments, accounts for autocorrelation in multi-step forecasts by jointly modeling all LPs as a seemingly unrelated system of equations, defines a flexible yet parsimonious joint prior for impulse responses based on a Gaussian Process, and allows for joint inference about the entire vector of impulse responses. We show via Monte Carlo simulations that our approach delivers more accurate point and uncertainty estimates than standard methods. To address misspecification, we propose an optional robustification step based on power posteriors.


[32] 2504.03766

Renewable Natural Resources, Regime Shift and Hysteresis

Many of the world's renewable resources are in decline. Optimal harvests with smooth recruitment is well studied but in recent years, ecologists have concluded that tipping points in recruitment are common. Recruitment with a tipping point has low-fecundity below the tipping point and high-fecundity above. When the discounted value of high-fecundity is sufficiently high, there is a high-fecundity steady-state. This steady-state is stable but in some cases, small perturbations may result in large, temporary reductions in recruitment and harvests. Below the tipping point, a low-fecundity steady-state need not exist. When a low-fecundity steady-state does exist, there is an endogenous tipping (Skiba) point: below, harvests converge to the low-fecundity steady-state and above, an austere harvest policy transitions the renewable resource to high-fecundity recruitment. If there is hysteresis in recruitment, the high steady-state may not be stable. Moreover, if the high-/low-fecundity differential is large then following a downward perturbation, fecundity optimally remains low.


[33] 2505.17648

Simulating Macroeconomic Expectations in Survey Experiments with LLM-based Economic Agents

We introduce a framework for simulating macroeconomic expectations in survey experiments using LLM-based economic agents (LLM Agents). We construct LLM Agents equipped with several functional modules that retrieve personal characteristics, prior expectations, and dynamic external information. We validate our framework by recapitulating three representative survey designs covering various expectations across different types of respondents. Our results show that LLM Agents generate expectation distributions highly similar to human data and capture human-aligned qualitative patterns in open-ended responses. Evaluation reveals that priors are crucial for matching distributions, whereas personal and external information drive human-like thought processes. Our findings offer guidance for narrowing the belief gap between generative AI and humans at the aggregate level while delineating the boundaries of the framework.


[34] 2510.12049

Generative AI and Sales Productivity: Field Experiments in Online Retail

We quantify the short-term impact of Generative Artificial Intelligence (GenAI) on sales performance through a series of large-scale randomized field experiments involving millions of users and products at a leading cross-border online retail platform. Over 2023-2024, the platform integrated GenAI into seven consumer-facing business workflows spanning customer service, consumer-product matching, advertising, and seller services. We find that GenAI adoption increases sales in most workflows, with effects ranging from no detectable impact to $16.3\%$, depending on GenAI's marginal contribution relative to baseline firm practices. Across the four GenAI applications with positive sales effects, the implied annual incremental value is roughly $\$5-$an economically meaningful impact given the retailer's scale and the early stage of GenAI adoption. The gains operate primarily through higher conversion rates rather than larger cart values, consistent with GenAI improving the shopping experience by reducing search, information, communication, and personalization frictions. Importantly, these effects are not associated with worse post-purchase outcomes, as product return rates and customer ratings do not deteriorate. Finally, we document substantial demand-side heterogeneity, with larger gains for less experienced consumers. Our findings provide novel, large-scale causal evidence on how GenAI shapes sales productivity in online retail, highlighting both its immediate value and broader potential.


[35] 2511.00478

Existence of Equilibria in Large Competitive Markets with Bads, Production and Comprehensive Externalities

This paper establishes existence of equilibrium in a measure-theoretic general equilibrium (MGE) model with production, bads, and comprehensive externalities. These features are jointly essential for modeling perfect competition in which emissions of production byproducts impose harm on agents. We show that, when bads and externalities are modeled in an economically natural way, equilibrium exists. This yields the first existence theorem with bads for MGE models, the benchmark for perfect competition, overcoming Hara (2005)'s nonexistence example. The proof uses nonstandard analysis, which provides a systematic technique to extend results for finite to infinite models.


[36] 2511.19627

Total Factor Productivity and its determinants: an analysis of the relationship at firm level through unsupervised learning techniques

The paper is related to the identification of firm's features which serve as determinants for firm's total factor productivity through unsupervised learning techniques (principal component analysis, self organizing maps, clustering). This bottom-up approach can effectively manage the problem of the heterogeneity of the firms and provides new ways to look at firms' standard classifications. Using the large sample provided by the ORBIS database, the analyses covers the years before the outbreak of Covid-19 (2015-2019) and the immediate post-Covid period (year 2020). It has been shown that in both periods, the main determinants of productivity growth are related to profitability, credit/debts measures, cost and capital efficiency, and effort/efficiency of the R&D activity conducted by the firms. Finally, a linear relationship between determinants and productivity growth has been found.


[37] 2512.10112

Power and Freedom in Mechanisms

In a strategy-proof mechanism, the influence of an agent may be measured as the set of outcomes an agent can bring about by varying her (reported) type. More specifically, we refer to an agent's influence on her own relevant outcomes as her freedom, and to the influence on outcomes relevant for other agents as her power over others. The framework generalises both the notion of opportunity set from the freedom of choice literature, and established power indices for binary voting. It identifies constrained efficient mechanisms as those that maximise agents' freedom. Applying our framework to the analysis of assignment rules, we provide novel characterisations of the top trading cycles rule and bipolar serial dictatorships in terms of their freedom and power properties.


[38] 2602.09967

Incentive Pareto Efficiency in Monopoly Insurance Markets with Adverse Selection

We study a monopolistic insurance market with hidden information, where the agent's type $\theta$ is private information that is unobservable to the insurer, and it is drawn from a continuum of types. The hidden type affects both the loss distribution and the risk attitude of the agent. Within this framework, we show that a menu of contracts is incentive efficient if it maximizes social welfare function, subject to incentive compatibility and individual rationality constraints. This holds for general utility functionals. In the special case of Yaari Dual Utility, we provide two partial converse statements to this result, and we give a semi-explicit characterization of optimal solutions to the social welfare maximization problem. We do this under two different settings: (i) the first assumes that types are ordered in a way such that larger values of $\theta$ correspond to more risk-averse types who face stochastically larger losses; whereas (ii) the second assumes that larger values of $\theta$ correspond to less risk-averse types who face stochastically larger losses. In both settings, the structure of optimal menus of contracts depends on the level of the social welfare weight, and we examine several properties thereof.


[39] 2602.16733

Scaling Reproducibility: An AI-Assisted Workflow for Large-Scale Replication and Reanalysis

Computational reproducibility is central to scientific credibility, yet verifying published results at scale remains costly. We develop an AI-assisted workflow for automated full-paper replication -- retrieving materials, reconstructing environments, executing code, and matching outputs to point estimates reported in regression tables. We define a universe of all empirical and quantitative papers from the three top political science journals (2010--2025) and measure stated data availability using automated extraction. For a stratified sample of 384 studies, we apply the workflow to conduct full-paper replication, totaling 3,523 empirical models. We find that journal verification requirements, combined with data archiving mandates, drive reproducibility: the share of fully or largely reproducible papers rises from 20.8% before DA-RT adoption to 82.5% after, and conditional on accessible replication packages, 92.1% of papers are fully or largely reproducible (234/254). As a secondary application, we apply standardized IV diagnostics to 84 studies (597 IV specifications among 1,910 replicated models), illustrating how automated execution enables systematic reanalysis across heterogeneous empirical settings.


[40] 2605.28026

Information Acquisition with $α$-Divergence Costs

Building on the $f$-information model of Bloedel et al. (2025), this paper introduces a one-parameter family of information acquisition models and characterizes optimal information acquisition. This family extends the mutual information model (Matějka and McKay, 2015) while preserving its analytical tractability. The information cost is derived from the $\alpha$-divergence, which nests the KL-divergence ($\alpha=-1$), the reverse KL-divergence ($\alpha=1$), and the squared Hellinger distance ($\alpha=0$), and is represented in closed form via the $\alpha$-integration of Amari (2007). The optimal choice probabilities belong to the $q$-exponential family, which appears in nonextensive statistical mechanics (Tsallis, 1988) and in the $q$-logit model of traffic route choice (Nakayama, 2013). This family reduces to the modified logit in the mutual information case (Matějka and McKay, 2015). We also show that $\alpha$ determines how payoff levels affect the set of actions chosen with positive probability in each state.


[41] 2605.30435

Global Science Sustains U.S. Innovation

Like physical products, new technologies are developed using globally sourced inputs. Yet while the supply chains behind physical goods are well understood, we know far less about the international supply chain of scientific knowledge that powers U.S. innovation, or how vulnerable it may be to disruption. Here, I uncover this supply chain by tracing multi-generational citation paths connecting NSF-funded research to downstream patents, and stress-test it by simulating barriers to scientific knowledge flows across the U.S. border. The U.S. knowledge supply chain extends globally, and frictions impeding the movement of ideas across the U.S. border reduce its connectivity, extend its length, and lower innovation productivity. These impacts extend to technology areas deemed critical to national priorities by U.S. Congress, including Semiconductors, Quantum Science, and AI.


[42] 1904.09888

Penney's Game Odds From No-Arbitrage

Penney's game is a two-player zero-sum game in which each player chooses a three-flip pattern of heads and tails, and the winner is the player whose pattern occurs first in repeated tosses of a fair coin. Because the players choose sequentially, the second mover has the advantage. In fact, for any three-flip pattern, there is another three-flip pattern that is strictly more likely to occur first. This paper provides a novel no-arbitrage argument that generates the winning odds corresponding to any pair of distinct patterns. The resulting formula is equivalent to that generated by Conway's ``leading number'' algorithm. The accompanying betting-odds intuition adds insight into why Conway's algorithm works. The proof is simple and easy to generalize to games involving more than two outcomes, unequal probabilities, and competing patterns of various lengths. Additional results on the expected duration of Penney's game are presented.


[43] 2512.07526

Strategic Preemption Under Shared Catastrophic Risk: The Suicide Region and the Race to Artificial General Intelligence

We analyze a continuous-time preemption game with shared catastrophic externalities. When the cost of catastrophe is embedded in both players' payoffs, the risk term cancels out in the equilibrium indifference condition. This creates a "suicide region" where competitive pressures force rational agents to deploy despite negative risk-adjusted net present values. We apply this framework to the race for artificial general intelligence (AGI). We show that this suicide region widens as the cost of systemic ruin grows: higher catastrophic risk does not deter the race but instead enlarges the set of conditions under which rational actors deploy despite negative social value. We characterize the resulting welfare distortion against a social planner's benchmark and demonstrate how two complementary mechanisms - private liability and prize-sharing - can close the suicide region. Private liability raises the cost of unsafe deployment while prize-sharing reduces the strategic imperative to deploy first. "Warning shots" (sub-existential disasters) will fail to deter AGI acceleration, as the winner-takes-all nature of the race remains intact.