Abstract

By combining AI and fluid physics, we discover a closed-form closure for 2D turbulence from small direct numerical simulation (DNS) data. Large-eddy simulation (LES) with this closure is accurate and stable, reproducing DNS statistics including those of extremes. We also show that the new closure could be derived from a 4th-order truncated Taylor expansion. Prior analytical and AI-based work only found the 2nd-order expansion, which led to unstable LES. The additional terms emerge only when inter-scale energy transfer is considered alongside standard reconstruction criterion in the sparse-equation discovery.

Subgrid-scale (SGS) closures, or parameterizations, are essential for simulating real-world systems involving turbulent flows [1], [2]. A prominent example is climate modeling, where closures are required for simulating the geophysical turbulence in the atmosphere and ocean, along with various other nonlinear, multi-scale processes [3]–[5]. Despite over a century of research, a general framework for deriving turbulence closures from first principles has remained elusive, resulting in the widespread reliance on semi-empirical and ad-hoc models [2], [6], [7]. In fact, the shortcomings of these closures are the main source of epistemic uncertainty in climate change projections and weather forecasts, especially for extreme events [8]–[11]. This has led to extensive efforts in recent decades to pursue innovative physical, mathematical, and, recently, artificial intelligence (AI) tools to develop SGS closures for geophysical flows [11]–[14].

An ideal closure 1) should accurately capture the detailed structure of the SGS fluxes, 2) should accurately reproduce the interactions between the SGS processes and the resolved, large-scale dynamics, e.g., the inter-scale energy and enstrophy transfers, and 3) when coupled to the numerical solver of the resolved scales, e.g., in large-eddy simulation (LES), the simulated flow should have key characteristics such as energy and enstrophy spectra matching those of the direct numerical simulation (DNS). In the context of weather and climate prediction, of particular interest is also capturing the statistics of rare (extreme) events. However, currently, a framework for developing closures satisfying (1)-(3) is lacking [1], [2], [6].

The “structural modeling” approach [2], e.g., based on a second-order truncated Taylor-series expansion of the SGS flux, yields closures such as the nonlinear gradient model (NGM2), which satisfies (1) with above 0.9 pattern correlation with filtered DNS [15]–[17]. However, these closures fail (2)-(3), e.g., producing no inter-scale energy transfer in 2D turbulence and leading to unstable LES [18]–[20]. The “functional modeling” approach [21]–[24] leads to eddy-viscosity closures such as Smagorinsky [25] and Leith [24], which produce stable LES and some aspects of (2) though they are often overly dissipative and miss backscattering (the transfer of energy from the SGS processes to the resolved scales), an important process in atmospheric and oceanic flows [4], [26]–[35]. Excessive dissipation also degrades the representation of extreme events in LES with these closures [36], [37]. Furthermore, these closures fail (1), e.g., lower than 0.5 pattern correlation [36], [38], [39].

Recently, AI approaches for developing data-driven closures have received significant attention. Most efforts have focused on supervised learning with filtered DNS data, which is an approach akin to structural modeling [7], [11], [40], [41]. While some studies have demonstrated that these data-driven closures can satisfy conditions (1)–(3), they suffer from three major limitations: (i) the need for large amounts of DNS snapshots for training, (ii) lack of interpretability, and (iii) poor generalization to out-of-distribution regimes, e.g., extrapolation to much higher Reynolds numbers [36], [42]–[45]. Self-supervised learning approaches, akin to functional modeling, only use DNS statistics and have also shown success; however, they still suffer from shortcomings (ii) and (iii) [46]–[48].

A number of studies have pursued an alternative approach: “equation discovery”, a class of AI techniques that aim to find a closed-form mathematical representation of the data [49]–[56]. The major appeals of this approach are data efficiency, interpretability, and generalizability, thus addressing the shortcomings of deep learning. In their pioneering work, Zanna and Bolton [19] used Bayesian sparse regression to find a closed-form equation for SGS momentum stress in geophysical turbulence. They robustly discovered a closure that resembled NGM2 but found that while this closure accurately captured the detailed structure of the SGS fluxes, the LES was unstable (unless the predicted SGS flux was attenuated). Using the same methodology, [20] discovered closures of the same structure for SGS momentum and heat fluxes across various setups of 2D turbulence and Rayleigh-Bénard convection and confirmed that they all exactly match the outcome of second-order truncated Taylor-series expansion; i.e., the discovered closure is NGM2. They also demonstrated that the LES instability results from NGM2’s inability to capture any interacscale energy transfer (neither diffusion nor backscattering). However, it remained unclear how a better equation discovery approach can be devised, though the need for a physics-informed loss function, evolving library (e.g., through genetic programming [32], [49]) were speculated.

In this Letter, we inform the criterion for “discovery” of a sparse representation with the fundamental understanding of turbulence physics. We robustly find, for diverse setups of 2D turbulent flows and with an expansive library, a closure that is NGM2 plus an additional term. We further show that this additional term is, in fact, the 4th-order term in the Taylor-series expansion of the SGS flux. We demonstrate that this new closure (NGM4, hereafter), accurately captures the structure of the SGS flux (pattern correlation \(\approx 0.99\)), accurately reproduce the inter-scale energy and enstrophy transfers, and leads to stable LES, that among other things, accurately reproduces the statistics of the rare, extreme events in diverse test cases that cover a broad range of dynamics mimicking atmospheric and oceanic turbulence.

The dimensionless continuity and momentum equations for LES of 2D turbulence in spatial dimensions (\(x,y\)) are given by: \[\begin{align} \frac{\partial \overline{u}_i}{\partial x_i} & = & 0, \tag{1}\\ \frac{\partial \overline{u}_i}{\partial t} +\frac{\partial\overline{u}_i\overline{u}_j}{\partial x_j} & = & -\frac{\partial \overline{p}}{\partial x_i} + \frac{1}{Re}\frac{\partial^2 \overline{u}_i}{\partial x_j x_j} -\frac{\partial \tau_{ij}}{\partial x_j} + \overline{\mathcal{Q}}_i. \tag{2} \end{align}\] Here, \(\overline{(.)}\) represents a low-pass filtering operation, \(u_i\) denotes velocity, \(p\) is pressure, \(Re\) is the Reynolds number, \(\overline{\mathcal{Q}}_i\) represents a time-invariant external forcing, Rayleigh drag, and the Coriolis force. The SGS stress tensor, \(\tau_{ij}=\overline{u_iu}_j-\overline{u}_i\overline{u}_j\), requires a closure model that expresses \(\tau_{11}, \tau_{12}=\tau_{21},\) and \(\tau_{22},\) as a function of the resolved flow variables (\(\overline{u}_i, \overline{p}\)).

We investigate four cases of 2D turbulence (see Figure 4), generating a diverse flow dynamics that vary in dominant length scales, energy (\(E=\frac{1}{2}u_iu_i\)), and enstrophy (\(Z=\frac{1}{2}\omega^2\)) cascade regimes (\(\omega\) is the vorticity). Here, we regard direct numerical simulation (DNS) data as the ground truth and utilize filtered DNS (FDNS) data to learn closure for \(\tau_{ij}\). The FDNS data are obtained by first filtering the DNS data using a Gaussian filter and then coarse-graining the results to the \(4\) to \(64\times\) coarser LES grid following [20].

The equation discovery method used in this study is a sparsity-promoting Bayesian linear regression approach [19], [52], [57] based on the relevance vector machine (RVM) [58], employed to derive closed-form closures for each element of \(\tau_{ij}\) using FDNS data. This method operates on a library of basis functions, \(\boldsymbol{\Phi}\), comprising linear or nonlinear combinations of relevant variables, such as velocity or its derivatives: \[\begin{align} \left[ \frac{\partial^{\left(q_1+q_2\right)} A}{\partial x^{q_1} \partial y^{q_2}}\right]^{p_1} \left[ \frac{\partial^{\left(q_4 + q_5\right)} B}{\partial x^{q_4} \partial y^{q_5}}\right]^{p_2} \left[ \frac{D C}{D t}\right]^{p_3},\label{eq10} \end{align}\tag{3}\] where \(A, B = \overline{u}\) or \(\overline{v}\), and \(C = \overline{\omega}\) or an element of \(\overline{S}_{ij} = \frac{1}{2}\left(\frac{\partial \overline{u}_i}{\partial x_j} + \frac{\partial \overline{u}_j}{\partial y_i} \right)\), the strain rate. Material derivative \(\frac{D}{Dt} = \frac{\partial}{\partial t} + u_j\frac{\partial}{\partial x_j}\) is included to account for non-Markovian (memory) effects, motivated by Mori-Zwanzig formalism [59], [60].

This library is extensive, with integers \(0 \le q \le 8\) and \(0 \le p \le 2\), although the total derivative order is constrained to 8th (yielding a total of 930 terms in the library). To accurately compute higher-order spatial derivatives required for this study, we used arbitrary-precision methods [61]. The construction of this library is motivated by the Galilean-invariant and symmetry properties of the SGS terms; e.g., this library includes the Pope’s tensors [32], [62]–[68].

The library must be comprehensive enough to express \(\boldsymbol{{s}}\), a vectorized element of \(\tau_{ij}\), as \(\boldsymbol{s}^{\text{RVM}} = \boldsymbol{\Phi}\boldsymbol{c}\). The regression weights, \(\boldsymbol{c}\), are optimized by minimizing the error \(\text{MSE} = \lVert \mathit{S}^{\text{RVM}} - \mathit{S}^{\text{FDNS}} \rVert^2_2\), where \(\mathit{S}\) consists of \(n\) stacked samples of \(\boldsymbol{s}\). The RVM enforces sparsity by iteratively pruning basis functions with weights exhibiting uncertainties above a predefined threshold, \(\alpha\), and recalculating the MSE until all remaining functions have uncertainties below \(\alpha\). A higher \(\alpha\) increases model complexity but reduces the MSE.

Figure 1: Representative examples of the effects of increasing the sparsity-level hyperparameter, \alpha, on the CC in the discovered closure. (a)-(b) The CC-\alpha relationship. In (a), the shading represents the max-min spread of all three elements of \tau_{ij}. (a) uses the common metric, CC of \tau_{ij}, while (b) uses a physics-informed metric that accounts for the CC of total (P_{\tau}), diffusion (P_{\tau} > 0), and backscattering (P_{\tau} < 0) inter-scale energy transfers as well. — Figure 1: Representative examples of the effects of increasing the sparsity-level hyperparameter, \(\alpha\), on the CC in the discovered closure. (a)-(b) The CC-\(\alpha\) relationship. In (a), the shading represents the max-min spread of all three elements of \(\tau_{ij}\). (a) uses the common metric, CC of \(\tau_{ij}\), while (b) uses a physics-informed metric that accounts for the CC of total (\(P_{\tau}\)), diffusion (\(P_{\tau} > 0\)), and backscattering (\(P_{\tau} < 0\)) inter-scale energy transfers as well.

For each of the four cases in Fig. 4, we separately discover closures for three elements of the SGS stress tensor. We also examine several LES grid sizes, and the effect of varying \(\alpha\), which as mentioned earlier, is a key hyper‐parameter in equation discovery. We use \(n\) = 100 FDNS samples from a training set and 20 FDNS samples from an independent testing set. As an illustrative example, Fig. 1 (a) displays the mean CC for all elements of \(\tau_{ij}\) as \(\alpha\) increases. Note that this structural modeling approach is what all past equation-discovery studies followed [19], [20]. For small \(\alpha\) values (\(\alpha < 1\)), no closure is discovered (CC = 0, zero terms). As \(\alpha\) increases, the number of discovered terms increases, and the CC value rises and eventually plateaus, forming an “L-Curve.” The elbow of this curve is commonly used to identify the \(\alpha\) that balances accuracy and model size in equation discovery [19], [52], [57]. In Fig. 1 (a), the CC-\(\alpha\) relationship for each element of \(\tau_{ij}\) converges to approximately 1 and robustly discovers the same model at the elbow. Recent studies [19], [20] found that the elbow of this curve (red circles) corresponds to the analytically derivable NGM2 [15], [16]. However, as these studies and earlier work found, LES with NGM2 leads to unstable a posteriori (online) simulations. [20] suggested that this instability is due to the inability of NGM2 in capturing any inter-scale energy transfers; i.e., \(P_{\tau}^{\text{NGM2}}(x,y) = 0\) (NGM2 captures neither diffusion nor backscattering).

Here, we introduce a physics-informed discovery criterion that requires not only accurate reconstruction of \(\tau_{ij}\) but also capturing the inter-scale energy transfer (both diffusion and backscattering), thus combining structural and functional modeling. Figure 1 (b) demonstrates that the L-curve in this new approach does form an elbow arond NGM2 (red circules), but rather, discovers a new closure (black circles) that has CC of \(\tau_{ij}>0.99\) (Fig. 1 (a)) and non-zero inter-scale energy transfer (Fig. 1 (b)). We have found that the closure discovered at the new elbow is NGM2 plus the second term from the Taylor-series expansion involving the higher orders (\(\mathcal{O}(\Delta^4\))), where \(\Delta\) is the LES grid spacing (see Eq. 4 ). We refer to this new closure as NGM4. As shown below, NGM4 outperforms existing physics-based closures in a priori and a posteriori tests.

It is noteworthy that the L-curves in Fig. 1 are consistently observed for all cases with any other \(N_{\text{LES}}\). Note that increasing \(\alpha\) leads to another closure with more terms but negligible improvements in CC; we have found this closure to be NGM6; see Eq. 4 (a generalized Taylor-series expansion of \(\tau_{ij}\) is provided in the supplementary information). The equation discovery with the expansive 930-term library confirms that Taylor-series expansion offers the best representation of both \(\tau_{ij}\) and the inter-scale energy transfer. Note that the memory terms never emerged during the discovery.

\[\tau_{ij}^{\text{NGM6}} = \underbrace{\underbrace{\frac{1}{1!}\frac{\Delta^2}{12} \left( \frac{\partial \overline{u}_i}{\partial x_k} \frac{\partial \overline{u}_j}{\partial x_k} \right)}_{\tau_{ij}^{\text{NGM2}}} + \frac{1}{2!}\frac{\Delta^4}{12^2} \left(\frac{\partial^2 \overline{u}_i}{\partial x_k \partial x_l}\frac{\partial^2 \overline{u}_j}{\partial x_k \partial x_l} \right)}_{\tau_{ij}^{\text{NGM4}}} + \frac{1}{3!}\frac{\Delta^6}{12^3}\left(\frac{\partial^3 \overline{u}_i}{\partial x_k \partial x_l \partial x_m }\frac{\partial^3 \overline{u}_j}{\partial x_k \partial x_l \partial x_m} \right) + \mathcal{O}\left(\Delta^8\right). \label{eq:NGM}\tag{4}\]

Figure 2: Comparison of the a priori (offline) performance of different closures. The table shows the ratio of domain-averaged P_{\tau} and P_Z to that of FDNS. The bar plots present the CC for \tau_{12}, P_{\tau}, and P_Z. All values are the mean and standard deviations calculated for N_{\text{LES}}=128 (Cases 1-2) and N_{\text{LES}}=512 (Cases 3-4); Table S1 shows the values for each case. — Figure 2: Comparison of the *a priori* (offline) performance of different closures. The table shows the ratio of domain-averaged \(P_{\tau}\) and \(P_Z\) to that of FDNS. The bar plots present the CC for \(\tau_{12}\), \(P_{\tau}\), and \(P_Z\). All values are the mean and standard deviations calculated for \(N_{\text{LES}}=128\) (Cases 1-2) and \(N_{\text{LES}}=512\) (Cases 3-4); Table S1 shows the values for each case.

We first compare the performance of NGMs (NGM2-6) with common baseline closures, the eddy-viscosity Smagorinsky (Smag) [25], its dynamic version (DSmag) [22], and the dynamic Leith (DLeith) [24] models (see Appendix B). Figure 2 shows that NGM4 and 6 perfectly capture the structure of the SGS stress and inter-scale enstrophy transfers, and very well capture the interscale energy transfers (both diffusion and backscattering), significantly outperforming NGM2 and the physics-based closures. An important feature of NGM4 and 6 is that they capture the energy backscatter that is seen in the FDNS. While the overall representations of inter-scale energy transfer of NGM4 and 6 are not perfect (around \(62\%\) and \(78\%\) of FDNS’, respectively), they are substantially better than the energy transfers of NGM2 (\(=0\)) and the eddy-viscosity closures, which are purely and excessively diffusive (as indicated by the negative CC values). The advantages of NGM4 and 6, especially over the eddy-viscosity closures, can also be seen in the inter-scale energy and enstrophy transfers’ spectra (Fig. S1). The a priori tests show that NGM4 and 6 are failry comprable, though NGM6 has a noticeably better \(P_\tau\) (see Table S1).

Figure 3: Comparison of the a posteriori performance of different subgrid-scale (SGS) closures for N_{\text{LES}}=512 (Cases 3-4). (a)-(b) The probability distribution function (PDF) of vorticity, \overline{\omega}. NGM4/6 can capture both the bulk and the tails of the PDF. The upper-right insets of (a)-(b) show the correlation coefficient (CC) between the vorticity of FDNS, \overline{\omega}_{\text{FDNS}}, and the SGS closures, \overline{\omega}_{\text{LES}}. The x-axis represents the eddy-turnover time (t_{\eta} = 1/\langle\overline{\omega}^2\rangle). NGM4/6 maintains a trajectory closer to FDNS than the eddy-viscosity models. (c)-(d) Comparison of the enstrophy spectra, \hat{Z}\left(k\right), of SGS closures. NGM4/6 captures the bulk of the enstrophy, while eddy-viscosity models struggle to do so. A similar figure for N_{\text{LES}}=128 (Cases 1-2) is shown in SI. — Figure 3: Comparison of the *a posteriori* performance of different subgrid-scale (SGS) closures for \(N_{\text{LES}}=512\) (Cases 3-4). (a)-(b) The probability distribution function (PDF) of vorticity, \(\overline{\omega}\). NGM4/6 can capture both the bulk and the tails of the PDF. The upper-right insets of (a)-(b) show the correlation coefficient (CC) between the vorticity of FDNS, \(\overline{\omega}_{\text{FDNS}}\), and the SGS closures, \(\overline{\omega}_{\text{LES}}\). The x-axis represents the eddy-turnover time (\(t_{\eta} = 1/\langle\overline{\omega}^2\rangle\)). NGM4/6 maintains a trajectory closer to FDNS than the eddy-viscosity models. (c)-(d) Comparison of the enstrophy spectra, \(\hat{Z}\left(k\right)\), of SGS closures. NGM4/6 captures the bulk of the enstrophy, while eddy-viscosity models struggle to do so. A similar figure for \(N_{\text{LES}}=128\) (Cases 1-2) is shown in SI.

NGM4 and 6’s strong a priori performance is reflected in their superior a posteriori performance. LES [69] with NGM 4 and 6 remain long-term stable when the LES resolution is high enough to capture over \(80\%\) of DNS enstrophy (see Fig. S2). This is reminiscent of findings in 3D turbulence, where LES typically needs a resolution high enough to cover at least \(80\%\) of DNS energy [1].

As shown in Figs. 3 and S3, when stable, LES with NGM4 and 6 capture both the bulk and tails of the probability distribution function (PDF) of vorticity (the tails of this PDF represent “extreme weather” in this prototype of atmospheric circulation). LES with eddy-viscosity closures fail to capture the extremes, especially for Cases 3 and 4, which have high-wavenumber forcings. Note that NGM2 for any case and any LES resolution leads to instabilities.

In addition to better capturing extreme events, LES with NGM4 and 6 outperforms eddy-viscosity models in short-term forecasting of vorticity (see insets of panels (a)-(b) in Figs. 3 and S3). further demonstrate the correlation coefficient (CC) between the vorticity of FDNS, \(\overline{\omega}_{\text{FDNS}}\), and the SGS closures, \(\overline{\omega}_{\text{LES}}\). All models start from the same initial condition, with NGM4/6 maintaining a trajectory closer to FDNS than the eddy-viscosity models.

LES with NGM4 and 6 also outperform LES with other closures in capturing the domain-averaged kinetic energy and enstrophy (Table S2). Figures 3 and S3 (c)-(d) show the enstrophy spectra of FDNS and LES. The insets magnifying the tails of the spectra do not clearly show the superiority of LES with any closure. However, the insets magnifying the spectra at lower wavenumbers reveal that LES with eddy-viscosity closures struggle to capture the bulk of the enstrophy. This observation is also consistent with the energy spectra (Fig. S4). Note that in a posteriori tests, LES with NGM4 and NGM4 show fairly similar performances, suggesting that in practice, using NGM4 might be enough.

In conclusion, a physics-informed data-driven discovery identified an analytically derivable, yet previously ignored, representation of SGS closure that satisfies both structural and functional modeling requirements. This closure (NGM4) is “interpretable” as it is derivable from the Taylor-series expansion and it represents the Leonard and cross stresses (see supporting information). This closure is “generalizable”, as its coefficients are entirely determined by LES resolution (\(\Delta\)). NGM4 is accurate: it is the first closure to simultaneously excel in both a priori and a posteriori tests. LES with NGM4 is stable only when the numerical resolution is high enough to capture \(80\%\) of DNS enstrophy; however, this is consistent with general limitations of LES [1].

A major difference between NGM4 and NGM6 is that the latter represents the Reynolds stress too (see supporting information). However, while a priori tests show that NGM6 better represents the inter-scale energy transfer, a posteriori tests do not demonstrate any advantage over NGM4 in terms of stability or accuracy. This is encouraging as implementing models with high-order derivatives in numerical solvers can be challenging, but these findings suggest that NGM4 might suffice in practice, e.g., in ocean modeling.

We note that NGM4 and 6 violate the Boussinesq hypothesis as they depend on second-order velocity gradients, highlighting the need to go beyond classical turbulence tensors to represent backscatter and anisotropic effects [62].

While here we only focused on the SGS stress tensor in 2D turbulence, these findings are relevant to multi-scale modeling in other dynamical systems too, as coarse-graining any quadratic nonlinearity can lead to an NGM-like representation. This work also presents the importance of combining physical and mathematical insight to better harness he power of AI methods in accelerating scientific discovery.

Acknowledgment↩︎

This work is supported by NSF grant AGS-2046309 and Schmidt Sciences. Computational resources were provided by NSF ACCESS (allocation ATM170020) and NCAR’s CISL (allocation URIC0009).

Data availability↩︎

The code for the numerical solver “py2d” is available at [69]. The codes for equation discovery and data analysis in this work can be found at [70].

Appendix↩︎

Appendix A: Figure 4 shows the cases considered in this study and their physical and numerical parameters.

Figure 4: The four cases considered in this study. These cases represent a broad range of dynamics in 2D turbulence and mimic the diversity of flow regimes (jets and vortices) in the atmosphere and ocean. (k_x,k_y) are the wavenumbers of the time-invariant forcing in the x and y directions and \beta is the gradient of the Coriolis force (see [20] for details). N_\mathrm{DNS} is the resolution of the pseudo-spectral solver (in each direction) used for DNS ([69]). The first row presents snapshots of the DNS vorticity, \omega. The second row shows snapshots of the FDNS vorticity, \overline{\omega}. The third row displays a snapshot of an element of the subgrid-scale (SGS) term, \tau_{12}. The last row depicts the energy (red line) and enstrophy (blue line) spectra for DNS (dashed line) and FDNS (solid line). FDNS is at N_{\text{LES}}=128 for Cases 1-2 and N_{\text{LES}}=512 for Cases 3-4. — Figure 4: The four cases considered in this study. These cases represent a broad range of dynamics in 2D turbulence and mimic the diversity of flow regimes (jets and vortices) in the atmosphere and ocean. \((k_x,k_y)\) are the wavenumbers of the time-invariant forcing in the \(x\) and \(y\) directions and \(\beta\) is the gradient of the Coriolis force (see [20] for details). \(N_\mathrm{DNS}\) is the resolution of the pseudo-spectral solver (in each direction) used for DNS ([69]). The first row presents snapshots of the DNS vorticity, \(\omega\). The second row shows snapshots of the FDNS vorticity, \(\overline{\omega}\). The third row displays a snapshot of an element of the subgrid-scale (SGS) term, \(\tau_{12}\). The last row depicts the energy (red line) and enstrophy (blue line) spectra for DNS (dashed line) and FDNS (solid line). FDNS is at \(N_{\text{LES}}=128\) for Cases 1-2 and \(N_{\text{LES}}=512\) for Cases 3-4.

Appendix B: The baseline physics-based closures used here are Smag, DSmag, and DLeith. These eddy viscosity closures introduce dissipation and do not account for backscattering: \(\tau_{ij} = -2\nu_{e}\overline{S}_{ij}\), with \(\overline{S}_{ij}\) representing the filtered rate of strain and \(\nu_{e}\) denoting the eddy viscosity [1]. \(\nu_e = \left(C_s\Delta\right)^2\sqrt{2\overline{S}_{ij}\overline{S}_{ij}}\) for Smag and \(\nu_{e} = \left(C_l \Delta\right)^3 |\nabla \overline{\omega}|\) for Leith. We use \(C_s = 0.17\) for Smag, as proposed in a previous work [21]; \(C_s\) and \(C_l\) are estimated dynamically based on the local flow structure for DSmag and DLeith, respectively. This procedure can yield \(\nu_{e} \leq 0\), potentially resulting in numerical instabilities; therefore, the commonly used “positive clipping" is applied to enforce diffusion (\(\nu_e \geq 0\)) [71].

References↩︎

[1]

S. Pope, Turbulent flows(Cambridge university press, 2000).

[2]

P. Sagaut, Large eddy simulation for incompressible flows: an introduction(Springer Science & Business Media, 2006).

[3]

D. Randall, M. Khairoutdinov, A. Arakawa, and W. Grabowski, Breaking the cloud parameterization deadlock, Bulletin of the American Meteorological Society 84, 1547 (2003).

[4]

H. T. Hewitt, M. Roberts, P. Mathiot, A. Biastoch, E. Blockley, E. P. Chassignet, B. Fox-Kemper, P. Hyder, D. P. Marshall, E. Popova, et al., Resolving and parameterising the ocean mesoscale in earth system models, Current Climate Change Reports 6, 137 (2020).

[5]

B. Fox-Kemper, A. Adcroft, C. W. Böning, E. P. Chassignet, E. Curchitser, G. Danabasoglu, C. Eden, M. H. England, R. Gerdes, R. J. Greatbatch, et al., Challenges and prospects in ocean circulation models, Frontiers in Marine Science 6, 65 (2019).

[6]

C. Meneveau and J. Katz, Scale-invariance and turbulence models for large-eddy simulation, Annual Review of Fluid Mechanics 32, 1 (2000).

[7]

K. Duraisamy, G. Iaccarino, and H. Xiao, Turbulence modeling in the age of data, Annual review of fluid mechanics 51, 357 (2019).

[8]

T. Palmer, Climate forecasting: Build high-resolution global climate models, Nature 515, 338 (2014).

[9]

J. Slingo and T. Palmer, Uncertainty in weather and climate prediction, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences 369, 4751 (2011).

[10]

C.-Y. Lai, P. Hassanzadeh, A. Sheshadri, M. Sonnewald, R. Ferrari, and V. Balaji, Machine learning for climate physics and simulations, arXiv preprint arXiv:2404.13227 (2024).

[11]

A. Bracco, J. Brajard, H. A. Dijkstra, P. Hassanzadeh, C. Lessig, and C. Monteleoni, Machine learning for the physics of climate, Nature Reviews Physics 7, 6 (2025).

[12]

J. Berner, U. Achatz, L. Batte, L. Bengtsson, A. De La Camara, H. M. Christensen, M. Colangeli, D. R. Coleman, D. Crommelin, S. I. Dolaptchiev, et al., Stochastic parameterization: Toward a new view of weather and climate models, Bulletin of the American Meteorological Society 98, 565 (2017).

[13]

P. Gentine, M. Pritchard, S. Rasp, G. Reinaudi, and G. Yacalis, Could machine learning break the convection parameterization deadlock?, Geophysical Research Letters 45, 5742 (2018).

[14]

J. Marston and S. Tobias, Recent developments in theories of inhomogeneous and anisotropic turbulence, Annual Review of Fluid Mechanics 55, 351 (2023).

[15]

A. Leonard, Energy cascade in large-eddy simulations of turbulent fluid flows, in Advances in geophysics, Vol. 18(Elsevier, 1975) pp. 237–248.

[16]

R. A. Clark, J. H. Ferziger, and W. C. Reynolds, Evaluation of subgrid-scale models using an accurately simulated turbulent flow, Journal of fluid mechanics 91, 1 (1979).

[17]

A. Leonard, Large-eddy simulation of chaotic convection and beyond, in 35th Aerospace Sciences Meeting and Exhibit(1997) p. 204.

[18]

D. Carati, S. Ghosal, and P. Moin, On the representation of backscatter in dynamic localization models, Physics of Fluids 7, 606 (1995).

[19]

L. Zanna and T. Bolton, Data-driven equation discovery of ocean mesoscale closures, Geophysical Research Letters 47, e2020GL088376 (2020).

[20]

K. Jakhar, Y. Guan, R. Mojgani, A. Chattopadhyay, and P. Hassanzadeh, Learning closed-form equations for subgrid-scale closures from high-fidelity data: Promises and challenges, Journal of Advances in Modeling Earth Systems 16, e2023MS003874 (2024).

[21]

D. K. Lilly, The representation of small-scale turbulence in numerical simulation experiments, in Proc. IBM Sci. Comput. Symp. on Environmental Science(1967) pp. 195–210.

[22]

M. Germano, U. Piomelli, P. Moin, and W. H. Cabot, A dynamic subgrid-scale eddy viscosity model, Physics of Fluids A: Fluid Dynamics 3, 1760 (1991).

[23]

D. K. Lilly, A proposed modification of the Germano subgrid-scale closure method, Physics of Fluids A: Fluid Dynamics 4, 633 (1992).

[24]

C. Leith, Stochastic models of chaotic systems, Physica D: Nonlinear Phenomena 98, 481 (1996).

[25]

J. Smagorinsky, General circulation experiments with the primitive equations: I. the basic experiment, Monthly weather review 91, 99 (1963).

[26]

G. Shutts, A kinetic energy backscatter algorithm for use in ensemble prediction systems, Quarterly Journal of the Royal Meteorological Society: A journal of the atmospheric sciences, applied meteorology and physical oceanography 131, 3079 (2005).

[27]

S. Chen, R. E. Ecke, G. L. Eyink, M. Rivera, M. Wan, and Z. Xiao, Physical mechanism of the two-dimensional inverse energy cascade, Physical review letters 96, 084502 (2006).

[28]

M. F. Jansen and I. M. Held, Parameterizing subgrid-scale eddy effects using energetically consistent backscatter, Ocean Modelling 80, 36 (2014).

[29]

M. F. Jansen, I. M. Held, A. Adcroft, and R. Hallberg, Energy budget-based backscatter in an eddy permitting primitive equation model, Ocean Modelling 94, 15 (2015).

[30]

I. Grooms, Y. Lee, and A. J. Majda, Numerical schemes for stochastic backscatter in the inverse cascade of quasigeostrophic turbulence, Multiscale Modeling & Simulation 13, 1001 (2015).

[31]

S. Khani and M. L. Waite, Backscatter in stratified turbulence, European Journal of Mechanics-B/Fluids 60, 1 (2016).

[32]

A. Ross, Z. Li, P. Perezhogin, C. Fernandez-Granda, and L. Zanna, Benchmarking of machine learning ocean subgrid parameterizations in an idealized model, Journal of Advances in Modeling Earth Systems 15, e2022MS003258 (2023).

[33]

S. Khani and C. N. Dawson, A gradient based subgrid-scale parameterization for ocean mesoscale eddies, Journal of Advances in Modeling Earth Systems 15, e2022MS003356 (2023).

[34]

M. Kang, Y. Jeon, and D. You, Neural-network-based mixed subgrid-scale model for turbulent flow, Journal of Fluid Mechanics 962, A38 (2023).

[35]

P. Perezhogin, C. Zhang, A. Adcroft, C. Fernandez-Granda, and L. Zanna, Implementation of a data-driven equation-discovery mesoscale parameterization into an ocean model, arXiv preprint arXiv:2311.02517 (2023).

[36]

Y. Guan, A. Chattopadhyay, A. Subel, and P. Hassanzadeh, Stable a posteriori LES of 2D turbulence using convolutional neural networks: Backscattering analysis and generalization to higher Re via transfer learning, Journal of Computational Physics 458, 111090 (2022).

[37]

Y. Guan, P. Hassanzadeh, T. Schneider, O. Dunbar, D. Z. Huang, J. Wu, and I. Lopez-Gomez, Online learning of eddy-viscosity and backscattering closures for geophysical turbulence using ensemble kalman inversion, arXiv preprint arXiv:2409.04985 (2024).

[38]

D. Carati, G. S. Winckelmans, and H. Jeanmart, On the modelling of the subgrid-scale and filtered-scale stress tensors in large-eddy simulation, Journal of Fluid Mechanics 441, 119 (2001).

[39]

R. D. Moser, S. W. Haering, and G. R. Yalla, Statistical properties of subgrid-scale turbulence models, Annual Review of Fluid Mechanics 53, 255 (2021).

[40]

S. L. Brunton, B. R. Noack, and P. Koumoutsakos, Machine learning for fluid mechanics, Annual review of fluid mechanics 52, 477 (2020).

[41]

V. Eyring, W. D. Collins, P. Gentine, E. A. Barnes, M. Barreiro, T. Beucler, M. Bocquet, C. S. Bretherton, H. M. Christensen, K. Dagon, et al., Pushing the frontiers in climate modelling and analysis with machine learning, Nature Climate Change 14, 916 (2024).

[42]

R. Maulik, O. San, A. Rasheed, and P. Vedula, Data-driven deconvolution for large eddy simulations of Kraichnan turbulence, Physics of Fluids 30, 125109 (2018).

[43]

A. Beck, D. Flad, and C.-D. Munz, Deep neural networks for data-driven LES closure models, Journal of Computational Physics 398, 108910 (2019).

[44]

A. Subel, Y. Guan, A. Chattopadhyay, and P. Hassanzadeh, Explaining the physics of transfer learning in data-driven turbulence modeling, PNAS Nexus , pgad015 (2023).

[45]

K. Srinivasan, M. D. Chekroun, and J. C. McWilliams, Turbulence closure with small, local neural networks: Forced two-dimensional and \(\beta\)-plane flows, Journal of Advances in Modeling Earth Systems 16, e2023MS003795 (2024).

[46]

G. Novati, H. L. de Laroussilhe, and P. Koumoutsakos, Automating turbulence modelling by multi-agent reinforcement learning, Nature Machine Intelligence 3, 87 (2021).

[47]

H. J. Bae and P. Koumoutsakos, Scientific multi-agent reinforcement learning for wall-models of turbulent flows, Nature Communications 13, 1443 (2022).

[48]

M. Kurz, P. Offenhäuser, and A. Beck, Deep reinforcement learning for turbulence modeling in large eddy simulations, International journal of heat and fluid flow 99, 109094 (2023).

[49]

M. Schmidt and H. Lipson, Distilling free-form natural laws from experimental data, science 324, 81 (2009).

[50]

S.-M. Udrescu and M. Tegmark, Ai feynman: A physics-inspired method for symbolic regression, Science advances 6, eaay2631 (2020).

[51]

S. L. Brunton, J. L. Proctor, and J. N. Kutz, Discovering governing equations from data by sparse identification of nonlinear dynamical systems, Proceedings of the national academy of sciences 113, 3932 (2016).

[52]

S. Zhang and G. Lin, Robust data-driven discovery of governing physical laws with error bars, Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences 474, 20180305 (2018).

[53]

Y. Chen, Y. Luo, Q. Liu, H. Xu, and D. Zhang, Symbolic genetic algorithm for discovering open-form partial differential equations (SGA-PDE), Physical Review Research 4, 023174 (2022).

[54]

A. Grundner, T. Beucler, P. Gentine, and V. Eyring, Data-driven equation discovery of a cloud cover parameterization, Journal of Advances in Modeling Earth Systems 16, e2023MS003763 (2024).

[55]

J. Newey, J. P. Whitehead, and E. Carlson, Model discovery on the fly using continuous data assimilation, Journal of Computational Physics , 114121 (2025).

[56]

M. Cranmer, Interpretable machine learning for science with PySR and SymbolicRegression. jl, arXiv preprint arXiv:2305.01582 (2023).

[57]

R. Mojgani, A. Chattopadhyay, and P. Hassanzadeh, Discovery of interpretable structural model errors by combining Bayesian sparse regression and data assimilation: A chaotic Kuramoto-Sivashinsky test case, Chaos: An Interdisciplinary Journal of Nonlinear Science 32, 061105 (2022).

[58]

M. E. Tipping, Sparse Bayesian learning and the relevance vector machine, Journal of machine learning research 1, 211 (2001).

[59]

J. Wouters and V. Lucarini, Multi-level dynamical systems: Connecting the ruelle response theory and the mori-zwanzig approach, Journal of Statistical Physics 151, 850 (2013).

[60]

E. J. Parish and K. Duraisamy, Non-markovian closure models for large eddy simulations using the mori-zwanzig formalism, Physical Review Fluids 2, 014604 (2017).

[61]

K. Jakhar, https://github.com/jakharkaran/ArbFFT(2024).

[62]

S. Pope, A more general effective-viscosity hypothesis, Journal of Fluid Mechanics 72, 331 (1975).

[63]

J. A. Anstey and L. Zanna, A deformation-based parametrization of ocean mesoscale eddy Reynolds stresses, Ocean Modelling 112, 99 (2017).

[64]

T. B. Gatski and C. G. Speziale, On explicit algebraic stress models for complex turbulent flows, Journal of fluid Mechanics 254, 59 (1993).

[65]

T. Jongen and T. Gatski, General explicit algebraic stress relations and best approximation for three-dimensional flows, International Journal of Engineering Science 36, 739 (1998).

[66]

T. S. Lund and E. Novikov, Parameterization of subgrid-scale stress by the velocity gradient tensor, Annual Research Briefs, 1992 (1993).

[67]

H. Li, Y. Zhao, J. Wang, and R. D. Sandberg, Data-driven model development for large-eddy simulation of turbulence using gene-expression programing, Physics of Fluids 33, 125127 (2021).

[68]

M. Reissmann, J. Hasslberger, R. D. Sandberg, and M. Klein, Application of gene expression programming to a-posteriori LES modeling of a Taylor Green vortex, Journal of Computational Physics 424, 109859 (2021).

[69]

K. Jakhar, R. Mojgani, M. Darman, Y. Guan, and P. Hassanzadeh, https://github.com/envfluids/py2d(2024).

[70]

K. Jakhar, https://github.com/jakharkaran/EqsDiscovery_2D-FHIT_RBC(2023).

[71]

R. Maulik, O. San, A. Rasheed, and P. Vedula, Subgrid modelling for two-dimensional turbulence using neural networks, Journal of Fluid Mechanics 858, 122 (2019).

An Analytical and AI-discovered Stable, Accurate, and Generalizable Subgrid-scale Closure for Geophysical Turbulence

Abstract

Acknowledgment↩︎

Data availability↩︎

Appendix↩︎

References↩︎

Subjects

Updated on Academus