Quantitative Finance - Academus scientific article reader

From viral jokes to a billion-dollar phenomenon, meme coins have become one of the most popular segments in cryptocurrency markets. Unlike utility-focused crypto assets like Bitcoin or Ethereum, meme coins derive value primarily from community sentiment, making them vulnerable to manipulation. This study presents a cross-chain analysis of the meme coin ecosystem, examining 34,988 tokens across Ethereum, BNB Smart Chain, Solana, and Base. We characterize the tokenomics of meme coins and track their growth in a three-month longitudinal analysis. We discover that among high-return tokens (>100%), an alarming 82.6% show evidence of extensive use of artificial growth strategies designed to create a misleading appearance of market interest. These include wash trading and a form of manipulation we define as Liquidity Pool-Based Price Inflation (LPI), where small strategic purchases trigger dramatic price increases. We also find evidence of schemes designed to profit at the expense of investors, such as pump and dumps and rug pulls. In particular, most of the tokens involved had previously experienced wash trading or LPI, indicating how initial manipulations often set the stage for later exploitation. These findings reveal that manipulations are widespread among high-performing meme coins and suggest that their dramatic gains are often likely driven by coordinated efforts rather than natural market dynamics.

[2] 2507.01964

Forecasting Nigerian Equity Stock Returns Using Long Short-Term Memory Technique

Investors and stock market analysts face major challenges in predicting stock returns and making wise investment decisions. The predictability of equity stock returns can boost investor confidence, but it remains a difficult task. To address this issue, a study was conducted using a Long Short-term Memory (LSTM) model to predict future stock market movements. The study used a historical dataset from the Nigerian Stock Exchange (NSE), which was cleaned and normalized to design the LSTM model. The model was evaluated using performance metrics and compared with other deep learning models like Artificial and Convolutional Neural Networks (CNN). The experimental results showed that the LSTM model can predict future stock market prices and returns with over 90% accuracy when trained with a reliable dataset. The study concludes that LSTM models can be useful in predicting financial time-series-related problems if well-trained. Future studies should explore combining LSTM models with other deep learning techniques like CNN to create hybrid models that mitigate the risks associated with relying on a single model for future equity stock predictions.

[3] 2507.01968

Optimising task allocation to balance business goals and worker well-being for financial service workforces

Purpose: Financial service companies manage huge volumes of data which requires timely error identification and resolution. The associated tasks to resolve these errors frequently put financial analyst workforces under significant pressure leading to resourcing challenges and increased business risk. To address this challenge, we introduce a formal task allocation model which considers both business orientated goals and analyst well-being. Methodology: We use a Genetic Algorithm (GA) to optimise our formal model to allocate and schedule tasks to analysts. The proposed solution is able to allocate tasks to analysts with appropriate skills and experience, while taking into account staff well-being objectives. Findings: We demonstrate our GA model outperforms baseline heuristics, current working practice, and is applicable to a range of single and multi-objective real-world scenarios. We discuss the potential for metaheuristics (such as GAs) to efficiently find sufficiently good allocations which can provide recommendations for financial service managers in-the-loop. Originality: A key gap in existing allocation and scheduling models, is fully considering worker well-being. This paper presents an allocation model which explicitly optimises for well-being while still improving on current working practice for efficiency.

[4] 2507.01970

News Sentiment Embeddings for Stock Price Forecasting

This paper will discuss how headline data can be used to predict stock prices. The stock price in question is the SPDR S&P 500 ETF Trust, also known as SPY that tracks the performance of the largest 500 publicly traded corporations in the United States. A key focus is to use news headlines from the Wall Street Journal (WSJ) to predict the movement of stock prices on a daily timescale with OpenAI-based text embedding models used to create vector encodings of each headline with principal component analysis (PCA) to exact the key features. The challenge of this work is to capture the time-dependent and time-independent, nuanced impacts of news on stock prices while handling potential lag effects and market noise. Financial and economic data were collected to improve model performance; such sources include the U.S. Dollar Index (DXY) and Treasury Interest Yields. Over 390 machine-learning inference models were trained. The preliminary results show that headline data embeddings greatly benefit stock price prediction by at least 40% compared to training and optimizing a machine learning system without headline data embeddings.

[5] 2507.01971

DeepSupp: Attention-Driven Correlation Pattern Analysis for Dynamic Time Series Support and Resistance Levels Identification

Support and resistance (SR) levels are central to technical analysis, guiding traders in entry, exit, and risk management. Despite widespread use, traditional SR identification methods often fail to adapt to the complexities of modern, volatile markets. Recent research has introduced machine learning techniques to address the following challenges, yet most focus on price prediction rather than structural level identification. This paper presents DeepSupp, a new deep learning approach for detecting financial support levels using multi-head attention mechanisms to analyze spatial correlations and market microstructure relationships. DeepSupp integrates advanced feature engineering, constructing dynamic correlation matrices that capture evolving market relationships, and employs an attention-based autoencoder for robust representation learning. The final support levels are extracted through unsupervised clustering, leveraging DBSCAN to identify significant price thresholds. Comprehensive evaluations on S&P 500 tickers demonstrate that DeepSupp outperforms six baseline methods, achieving state-of-the-art performance across six financial metrics, including essential support accuracy and market regime sensitivity. With consistent results across diverse market conditions, DeepSupp addresses critical gaps in SR level detection, offering a scalable and reliable solution for modern financial analysis. Our approach highlights the potential of attention-based architectures to uncover nuanced market patterns and improve technical trading strategies.

[6] 2507.01972

Accelerated Portfolio Optimization and Option Pricing with Reinforcement Learning

We present a reinforcement learning (RL)-driven framework for optimizing block-preconditioner sizes in iterative solvers used in portfolio optimization and option pricing. The covariance matrix in portfolio optimization or the discretization of differential operators in option pricing models lead to large linear systems of the form $\mathbf{A}\textbf{x}=\textbf{b}$. Direct inversion of high-dimensional portfolio or fine-grid option pricing incurs a significant computational cost. Therefore, iterative methods are usually used for portfolios in real-world situations. Ill-conditioned systems, however, suffer from slow convergence. Traditional preconditioning techniques often require problem-specific parameter tuning. To overcome this limitation, we rely on RL to dynamically adjust the block-preconditioner sizes and accelerate iterative solver convergence. Evaluations on a suite of real-world portfolio optimization matrices demonstrate that our RL framework can be used to adjust preconditioning and significantly accelerate convergence and reduce computational cost. The proposed accelerated solver supports faster decision-making in dynamic portfolio allocation and real-time option pricing.

[7] 2507.01973

Integration of Wavelet Transform Convolution and Channel Attention with LSTM for Stock Price Prediction based Portfolio Allocation

Portfolio allocation via stock price prediction is inherently difficult due to the notoriously low signal-to-noise ratio of stock time series. This paper proposes a method by integrating wavelet transform convolution and channel attention with LSTM to implement stock price prediction based portfolio allocation. Stock time series data first are processed by wavelet transform convolution to reduce the noise. Processed features are then reconstructed by channel attention. LSTM is utilized to predict the stock price using the final processed features. We construct a portfolio consists of four stocks with trading signals predicted by model. Experiments are conducted by evaluating the return, Sharpe ratio and max drawdown performance. The results indicate that our method achieves robust performance even during period of post-pandemic downward market.

[8] 2507.01979

Forecasting Labor Markets with LSTNet: A Multi-Scale Deep Learning Approach

We present a deep learning approach for forecasting short-term employment changes and assessing long-term industry health using labor market data from the U.S. Bureau of Labor Statistics. Our system leverages a Long- and Short-Term Time-series Network (LSTNet) to process multivariate time series data, including employment levels, wages, turnover rates, and job openings. The model outputs both 7-day employment forecasts and an interpretable Industry Employment Health Index (IEHI). Our approach outperforms baseline models across most sectors, particularly in stable industries, and demonstrates strong alignment between IEHI rankings and actual employment volatility. We discuss error patterns, sector-specific performance, and future directions for improving interpretability and generalization.

[9] 2507.01980

Detecting Fraud in Financial Networks: A Semi-Supervised GNN Approach with Granger-Causal Explanations

Fraudulent activity in the financial industry costs billions annually. Detecting fraud, therefore, is an essential yet technically challenging task that requires carefully analyzing large volumes of data. While machine learning (ML) approaches seem like a viable solution, applying them successfully is not so easy due to two main challenges: (1) the sparsely labeled data, which makes the training of such approaches challenging (with inherent labeling costs), and (2) lack of explainability for the flagged items posed by the opacity of ML models, that is often required by business regulations. This article proposes SAGE-FIN, a semi-supervised graph neural network (GNN) based approach with Granger causal explanations for Financial Interaction Networks. SAGE-FIN learns to flag fraudulent items based on weakly labeled (or unlabelled) data points. To adhere to regulatory requirements, the flagged items are explained by highlighting related items in the network using Granger causality. We empirically validate the favorable performance of SAGE-FIN on a real-world dataset, Bipartite Edge-And-Node Attributed financial network (Elliptic++), with Granger-causal explanations for the identified fraudulent items without any prior assumption on the network structure.

[10] 2507.01983

Comparing Bitcoin and Ethereum tail behavior via Q-Q analysis of cryptocurrency returns

The cryptocurrency market presents both significant investment opportunities and higher risks relative to traditional financial assets. This study examines the tail behavior of daily returns for two leading cryptocurrencies, Bitcoin and Ethereum, using seven-parameter estimates from prior research, which applied the Generalized Tempered Stable (GTS) distribution. Quantile-quantile (Q-Q) plots against the Normal distribution reveal that both assets exhibit heavy-tailed return distributions. However, Ethereum consistently shows a greater frequency of extreme values than would be expected under its Bitcoin-modeled counterpart, indicating more pronounced tail risk.

[11] 2507.01985

A unified model of horizontal differentiation with general spaces and irrational consumers

We introduce a new microeconomic model of horizontal differentiation that unifies and extends previous developments inspired by the seminal work of Hotelling (1929). Our framework incorporates boundedly rational consumers, an unlimited number of firms, and arbitrary differentiation spaces with Riemannian manifolds. We argue that Riemannian geometry provides a natural and powerful tool for analyzing such models, offering fresh insights into firm behavior and market structure with complex products.

[12] 2507.01987

Predicting and Explaining Customer Data Sharing in the Open Banking

The emergence of Open Banking represents a significant shift in financial data management, influencing financial institutions' market dynamics and marketing strategies. This increased competition creates opportunities and challenges, as institutions manage data inflow to improve products and services while mitigating data outflow that could aid competitors. This study introduces a framework to predict customers' propensity to share data via Open Banking and interprets this behavior through Explanatory Model Analysis (EMA). Using data from a large Brazilian financial institution with approximately 3.2 million customers, a hybrid data balancing strategy incorporating ADASYN and NEARMISS techniques was employed to address the infrequency of data sharing and enhance the training of XGBoost models. These models accurately predicted customer data sharing, achieving 91.39% accuracy for inflow and 91.53% for outflow. The EMA phase combined the Shapley Additive Explanations (SHAP) method with the Classification and Regression Tree (CART) technique, revealing the most influential features on customer decisions. Key features included the number of transactions and purchases in mobile channels, interactions within these channels, and credit-related features, particularly credit card usage across the national banking system. These results highlight the critical role of mobile engagement and credit in driving customer data-sharing behaviors, providing financial institutions with strategic insights to enhance competitiveness and innovation in the Open Banking environment.

[13] 2507.01989

Currents Beneath Stability: A Stochastic Framework for Exchange Rate Instability Using Kramers Moyal Expansion

Understanding the stochastic behavior of currency exchange rates is critical for assessing financial stability and anticipating market transitions. In this study, we investigate the empirical dynamics of the USD exchange rate in three economies, including Iran, Turkey, and Sri Lanka, through the lens of the Kramers-Moyal expansion and Fokker-Planck formalism. Using log-return data, we confirm the Markovian nature of the exchange rate fluctuations, enabling us to model the system with a second-order Fokker-Planck equation. The inferred Langevin coefficients reveal a stabilizing linear drift and a nonlinear, return-dependent diffusion term, reflecting both regulatory effects and underlying volatility. A rolling-window estimation of these coefficients, paired with structural breakpoint detection, uncovers regime shifts that align with major political and economic events, offering insight into the hidden dynamics of currency instability. This framework provides a robust foundation for detecting latent transitions and modeling risk in complex financial systems.

[14] 2507.01990

Integrating Large Language Models in Financial Investments and Market Analysis: A Survey

Large Language Models (LLMs) have been employed in financial decision making, enhancing analytical capabilities for investment strategies. Traditional investment strategies often utilize quantitative models, fundamental analysis, and technical indicators. However, LLMs have introduced new capabilities to process and analyze large volumes of structured and unstructured data, extract meaningful insights, and enhance decision-making in real-time. This survey provides a structured overview of recent research on LLMs within the financial domain, categorizing research contributions into four main frameworks: LLM-based Frameworks and Pipelines, Hybrid Integration Methods, Fine-Tuning and Adaptation Approaches, and Agent-Based Architectures. This study provides a structured review of recent LLMs research on applications in stock selection, risk assessment, sentiment analysis, trading, and financial forecasting. By reviewing the existing literature, this study highlights the capabilities, challenges, and potential directions of LLMs in financial markets.

[15] 2507.01991

FinAI-BERT: A Transformer-Based Model for Sentence-Level Detection of AI Disclosures in Financial Reports

The proliferation of artificial intelligence (AI) in financial services has prompted growing demand for tools that can systematically detect AI-related disclosures in corporate filings. While prior approaches often rely on keyword expansion or document-level classification, they fall short in granularity, interpretability, and robustness. This study introduces FinAI-BERT, a domain-adapted transformer-based language model designed to classify AI-related content at the sentence level within financial texts. The model was fine-tuned on a manually curated and balanced dataset of 1,586 sentences drawn from 669 annual reports of U.S. banks (2015 to 2023). FinAI-BERT achieved near-perfect classification performance (accuracy of 99.37 percent, F1 score of 0.993), outperforming traditional baselines such as Logistic Regression, Naive Bayes, Random Forest, and XGBoost. Interpretability was ensured through SHAP-based token attribution, while bias analysis and robustness checks confirmed the model's stability across sentence lengths, adversarial inputs, and temporal samples. Theoretically, the study advances financial NLP by operationalizing fine-grained, theme-specific classification using transformer architectures. Practically, it offers a scalable, transparent solution for analysts, regulators, and scholars seeking to monitor the diffusion and framing of AI across financial institutions.

[16] 2507.01995

Fair sharing ratios of Profit and Loss sharing contracts

We consider islamic Profit and Loss (PL) sharing contract, possibly combined with an agency contract, and introduce the notion of {\em $c$-fair} profit sharing ratios ($c = (c_1, \ldots,c_d) \in (\mathbb R^{\star})^d$, where $d$ is the number of partners) which aims to determining both the profit sharing ratios and the induced expected maturity payoffs of each partner $\ell$ according to its contribution, determined by the rate component $c_{\ell}$ of the vector $c$, to the global success of the project. We show several new results that elucidate the relation between these profit sharing ratios and various important economic factors as the investment risk, the labor and the capital, giving accordingly a way of choosing them in connection with the real economy. The design of our approach allows the use of all the range of econometrics models or more general stochastic diffusion models to compute or approximate the quantities of interest.

[17] 2507.02011

Machine Learning Based Stress Testing Framework for Indian Financial Market Portfolios

This paper presents a machine learning driven framework for sectoral stress testing in the Indian financial market, focusing on financial services, information technology, energy, consumer goods, and pharmaceuticals. Initially, we address the limitations observed in conventional stress testing through dimensionality reduction and latent factor modeling via Principal Component Analysis and Autoencoders. Building on this, we extend the methodology using Variational Autoencoders, which introduces a probabilistic structure to the latent space. This enables Monte Carlo-based scenario generation, allowing for more nuanced, distribution-aware simulation of stressed market conditions. The proposed framework captures complex non-linear dependencies and supports risk estimation through Value-at-Risk and Expected Shortfall. Together, these pipelines demonstrate the potential of Machine Learning approaches to improve the flexibility, robustness, and realism of financial stress testing.

[18] 2507.02018

NGAT: A Node-level Graph Attention Network for Long-term Stock Prediction

Graph representation learning methods have been widely adopted in financial applications to enhance company representations by leveraging inter-firm relationships. However, current approaches face three key challenges: (1) The advantages of relational information are obscured by limitations in downstream task designs; (2) Existing graph models specifically designed for stock prediction often suffer from excessive complexity and poor generalization; (3) Experience-based construction of corporate relationship graphs lacks effective comparison of different graph structures. To address these limitations, we propose a long-term stock prediction task and develop a Node-level Graph Attention Network (NGAT) specifically tailored for corporate relationship graphs. Furthermore, we experimentally demonstrate the limitations of existing graph comparison methods based on model downstream task performance. Experimental results across two datasets consistently demonstrate the effectiveness of our proposed task and model. The project is publicly available on GitHub to encourage reproducibility and future research.

[19] 2507.02027

Arbitrage with bounded Liquidity

The arbitrage gains or, equivalently, Loss Versus Rebalacing (LVR) for arbitrage between two imperfectly liquid markets is derived. To derive the LVR, I assume a quadratic trading cost to model the cost of trading on the more liquid exchange and discuss to which situations my model arguably applies well (long tail CEX-DEX arbitrage, DEX-DEX arbitrage) and to which not so well (CEX-DEX arbitrage for major pairs). I discuss extension to other cost functions and directions for future research.

[20] 2507.02287

Seeing Through Green: Text-Based Classification and the Firm's Returns from Green Patents

This paper introduces Natural Language Processing for identifying ``true'' green patents from official supporting documents. We start our training on about 12.4 million patents that had been classified as green from previous literature. Thus, we train a simple neural network to enlarge a baseline dictionary through vector representations of expressions related to environmental technologies. After testing, we find that ``true'' green patents represent about 20\% of the total of patents classified as green from previous literature. We show heterogeneity by technological classes, and then check that `true' green patents are about 1\% less cited by following inventions. In the second part of the paper, we test the relationship between patenting and a dashboard of firm-level financial accounts in the European Union. After controlling for reverse causality, we show that holding at least one ``true'' green patent raises sales, market shares, and productivity. If we restrict the analysis to high-novelty ``true'' green patents, we find that they also yield higher profits. Our findings underscore the importance of using text analyses to gauge finer-grained patent classifications that are useful for policymaking in different domains.

[21] 2507.02412

Green Ammonia: A Techno-Economic Supply Chain Optimization

Green ammonia is emerging as a strategic intermediary within green energy supply chains, serving effectively as both an industrial commodity and hydrogen carrier. This study provides a techno-economic analysis of green ammonia supply chains, comparing cost-effective pathways from global production to European consumers, and evaluates ammonia alongside alternative hydrogen carriers. Gaseous hydrogen consistently remains the most economical import option for Europe, though ammonia holds a narrowing cost advantage over liquid hydrogen (from 16 % in 2030 to 10 % by 2040). Competitive ammonia suppliers, notably Morocco, the United States, and the United Arab Emirates, benefit from low renewable energy costs, with significant price reductions expected by 2040, driven by falling costs for electricity, electrolysers, and conversion technologies. Optimal transport modes vary by consumer demand and distance: trucks are ideal for low demands at all distances, rail for medium ranges, and pipelines for high-demand scenarios. By 2040, ammonia will primarily serve direct-use applications, as hydrogen consumers increasingly shift to direct hydrogen supplies. Policymakers should prioritize pipeline infrastructure for hydrogen distribution, cautiously invest in ammonia's short- to medium-term infrastructure advantages, and limit long-term reliance on ammonia as a hydrogen carrier to mitigate stranded asset risks.

[22] 2507.02439

Introducing a New Brexit-Related Uncertainty Index: Its Evolution and Economic Consequences

Important game-changer economic events and transformations cause uncertainties that may affect investment decisions, capital flows, international trade, and macroeconomic variables. One such major transformation is Brexit, which refers to the multiyear process through which the UK withdrew from the EU. This study develops and uses a new Brexit-Related Uncertainty Index (BRUI). In creating this index, we apply Text Mining, Context Window, Natural Language Processing (NLP), and Large Language Models (LLMs) from Deep Learning techniques to analyse the monthly country reports of the Economist Intelligence Unit from May 2012 to January 2025. Additionally, we employ a standard vector autoregression (VAR) analysis to examine the model-implied responses of various macroeconomic variables to BRUI shocks. While developing the BRUI, we also create a complementary COVID-19 Related Uncertainty Index (CRUI) to distinguish the uncertainties stemming from these distinct events. Empirical findings and comparisons of BRUI with other earlier-developed uncertainty indexes demonstrate the robustness of the new index. This new index can assist British policymakers in measuring and understanding the impacts of Brexit-related uncertainties, enabling more effective policy formulation.

[23] 2507.02511

Identity and Cooperation in Multicultural Societies: An Experimental Investigation

Immigration has shaped many nations, posing the challenge of integrating immigrants into society. While economists often focus on immigrants' economic outcomes compared to natives (such as education, labor market success, and health) social interactions between immigrants and natives are equally crucial. These interactions, from everyday exchanges to teamwork, often lack enforceable contracts and require cooperation to avoid conflicts and achieve efficient outcomes. However, socioeconomic, ethnic, and cultural differences can hinder cooperation. Thus, evaluating integration should also consider its impact on fostering cooperation across diverse groups. This paper studies how priming different identity dimensions affects cooperation between immigrant and native youth. Immigrant identity includes both ethnic ties to their country of origin and connections to the host country. We test whether cooperation improves by making salient a specific identity: Common identity (shared society), Multicultural identity (ethnic group within society), or Neutral identity. In a lab in the field experiment with over 390 adolescents, participants were randomly assigned to one of these priming conditions and played a Public Good Game. Results show that immigrants are 13 percent more cooperative than natives at baseline. Natives increase cooperation by about 3 percentage points when their multicultural identity is primed, closing the initial gap with immigrant peers.

[24] 2507.02560

Tertiary Education Completion and Financial Aid Assistance: Evidence from an Information Experiment

Understanding the role of information among disadvantaged students is crucial in explaining their investment decisions in higher education. Indeed, information barriers on the returns and the gains from completing college may explain a substantial share of variation in students' degree completion. We conduct a field experiment with 7,806 university students in Italy who benefit from financial aid assistance, by providing information, either on the labor market returns of completing college or on the education returns of meeting the academic requirements attached to the financial aid. Our results suggest that only the latter information treatment has a positive effect on academic performance, increasing the number of credits obtained by around 3, and by decreasing the probability of dropout by around 4 percentage points. We also find that the results are mediated by an aspiration lift generated by our treatment.

[25] 2507.01993

Finding good bets in the lottery, and why you shouldn't take them

We give a criterion under which the expected return on a ticket for certain large lotteries is positive. In this circumstance, we use elementary portfolio analysis to show that an optimal investment strategy includes a very small allocation for such tickets.

[26] 2507.02464

Resolving CAP Through Automata-Theoretic Economic Design: A Unified Mathematical Framework for Real-Time Partition-Tolerant Systems

The CAP theorem asserts a trilemma between consistency, availability, and partition tolerance. This paper introduces a rigorous automata-theoretic and economically grounded framework that reframes the CAP trade-off as a constraint optimization problem. We model distributed systems as partition-aware state machines and embed economic incentive layers to stabilize consensus behavior across adversarially partitioned networks. By incorporating game-theoretic mechanisms into the global transition semantics, we define provable bounds on convergence, liveness, and correctness. Our results demonstrate that availability and consistency can be simultaneously preserved within bounded epsilon margins, effectively extending the classical CAP limits through formal economic control.

[27] 2307.07015

Advertiser Learning in Direct Advertising Markets

Direct buy advertisers procure advertising inventory at fixed rates from publishers and ad networks. Such advertisers face the complex task of choosing ads amongst myriad new publisher sites. We offer evidence that advertisers do not excel at making these choices. Instead, they try many sites before settling on a favored set, consistent with advertiser learning. We subsequently model advertiser demand for publisher inventory wherein advertisers learn about advertising efficacy across publishers' sites. Results suggest that advertisers spend considerable resources advertising on sites they eventually abandon -- in part because their prior beliefs about advertising efficacy on those sites are too optimistic. The median advertiser's expected CTR at a new site is 0.177\%, four times higher than the true median CTR of 0.045\%. We consider how an ad network's pooling of advertiser information remediates this problem. As ads with similar visual elements garner similar CTRs, the network's pooling of information enables advertisers to better predict ad performance at new sites. Counterfactual analyses indicate that gains from pooling advertiser information are substantial: over six months, we estimate a median advertiser welfare gain of \$3,621 (an 18.3\% increase) and a median revenue gain of \$13,558 (a 77.7\% increase) among the 20 largest publishers.

[28] 2310.02867

Learning the Probability Distributions of Day-Ahead Electricity Prices

We propose a novel machine learning approach for probabilistic forecasting of hourly day-ahead electricity prices. In contrast with the recent advances in data-rich probabilistic forecasting, which approximates distributions with few features (such as moments), our method is nonparametric and selects the distribution from all possible empirical distributions learned from the input data without the need for limiting assumptions. The model that we propose is a multioutput neural network that accounts for the temporal dynamics of the probabilities and controls for monotonicity using a penalty. Such a distributional neural network can precisely learn complex patterns from many relevant variables that affect electricity prices. We illustrate the capacity of the developed method on German hourly day-ahead electricity prices and predict their probability distribution via many variables, doing so more accurately than the state-of-the-art benchmarks can, thus revealing new valuable information in the data.

[29] 2505.14588

Generative AI at the Crossroads: Light Bulb, Dynamo, or Microscope?

With the advent of generative AI (genAI), the potential scope of artificial intelligence has increased dramatically, but the future effect of genAI on productivity remains uncertain. The effect of the technology on the innovation process is a crucial open question. Some inventions, such as the light bulb, temporarily raise productivity growth as adoption spreads, but the effect fades when the market is saturated; that is, the level of output per hour is permanently higher but the growth rate is not. In contrast, two types of technologies stand out as having longer-lived effects on productivity growth. First, there are technologies known as general-purpose technologies (GPTs). GPTs (1) are widely adopted, (2) spur abundant knock-on innovations (new goods and services, process efficiencies, and business reorganization), and (3) show continual improvement, refreshing this innovation cycle; the electric dynamo is an example. Second, there are inventions of methods of invention (IMIs). IMIs increase the efficiency of the research and development process via improvements to observation, analysis, communication, or organization; the compound microscope is an example. We show that GenAI has the characteristics of both a GPT and an IMI -- an encouraging sign that genAI will raise the \textit{level} of productivity. Even so, genAI's contribution to productivity \textit{growth} will depend on the speed with which that level is attained and, historically, integrating revolutionary technologies into the economy is a protracted process.

[30] 2506.23767

Explainable AI for Comprehensive Risk Assessment for Financial Reports: A Lightweight Hierarchical Transformer Network Approach

Every publicly traded U.S. company files an annual 10-K report containing critical insights into financial health and risk. We propose Tiny eXplainable Risk Assessor (TinyXRA), a lightweight and explainable transformer-based model that automatically assesses company risk from these reports. Unlike prior work that relies solely on the standard deviation of excess returns (adjusted for the Fama-French model), which indiscriminately penalizes both upside and downside risk, TinyXRA incorporates skewness, kurtosis, and the Sortino ratio for more comprehensive risk assessment. We leverage TinyBERT as our encoder to efficiently process lengthy financial documents, coupled with a novel dynamic, attention-based word cloud mechanism that provides intuitive risk visualization while filtering irrelevant terms. This lightweight design ensures scalable deployment across diverse computing environments with real-time processing capabilities for thousands of financial documents which is essential for production systems with constrained computational resources. We employ triplet loss for risk quartile classification, improving over pairwise loss approaches in existing literature by capturing both the direction and magnitude of risk differences. Our TinyXRA achieves state-of-the-art predictive accuracy across seven test years on a dataset spanning 2013-2024, while providing transparent and interpretable risk assessments. We conduct comprehensive ablation studies to evaluate our contributions and assess model explanations both quantitatively by systematically removing highly attended words and sentences, and qualitatively by examining explanation coherence. The paper concludes with findings, practical implications, limitations, and future research directions. Our code is available at this https URL.

New articles on Quantitative Finance

[1] 2507.01963