Abstract

The immense computational cost of simulating turbulence has motivated the use of machine learning approaches for super-resolving turbulent flows. A central challenge is ensuring that learned models respect physical symmetries, such as rotational equivariance. We show that standard convolutional neural networks (CNNs) can partially acquire this symmetry without explicit augmentation or specialized architectures, as turbulence itself provides implicit rotational augmentation in both time and space. Using 3D channel-flow subdomains with differing anisotropy, we find that models trained on more isotropic mid-plane data achieve lower equivariance error than those trained on boundary layer data, and that greater temporal or spatial sampling further reduces this error. We show a distinct scale-dependence of equivariance error that occurs regardless of dataset anisotropy that is consistent with Kolmogorov’s local isotropy hypothesis. These results clarify when rotational symmetry must be explicitly incorporated into learning algorithms and when it can be obtained directly from turbulence, enabling more efficient and symmetry-aware super-resolution.

1 Introduction↩︎

Super-resolution (SR) with machine learning has become a promising method to augment numerical simulations of turbulence, by boosting the effective resolution of expensive calculations [1]. In this setting, convolutional neural networks (CNNs) have been widely applied to reconstruct velocity and vorticity fields from coarse inputs [2]–[5]. Complementary approaches span models that incorporate temporal coherence and dynamics-aware training objectives [6], [7], as well as generative methods such as generative adversarial networks (GANs) [8] or diffusion models that reproduce realistic spectra and scaling laws [9]–[11].

An important dimension of physical consistency is the treatment of symmetries. In turbulent flows, the velocity field \(\mathbf{U}(\mathbf{x}, t)\) is a random vector field whose statistics may exhibit distributional symmetries, i.e., invariance of the probability law under a group action, \(p(\mathbf{U}) = p(g \cdot \mathbf{U})\) for transformations \(g\) such as translations (homogeneity) or rotations/reflections (isotropy). Kolmogorov’s local isotropy hypothesis further asserts that when turbulence is strong enough for small eddies to form without being damped by viscosity, small-scale motions approach isotropy even when large scales are anisotropic [12], [13]. A formal treatment of statistical isotropy is provided in Appendix 5.

Equivariance is often an important inductive bias in machine learning for physics, as it ensures that models respect the symmetries of the underlying system [14]. In short, an equivariant model \(f\) satisfies \(f(g \cdot \mathbf{U}) = g \cdot f(\mathbf{U})\), so that outputs transform consistently with the inputs. Several strategies have been proposed to incorporate equivariance into SR. Architectural approaches impose symmetry exactly through group-equivariant convolutions or neural operators [15], [16], while loss-based approaches regularize models to transform consistently without altering their backbone [17], [18]. Relaxed group convolutions have been used to probe isotropy-breaking in turbulence by permitting departures from exact equivariance, highlighting its scale dependence [19]. [20] showed that CNNs learn rotation equivariance only when the coarsening operator commutes with rotations, sharpening the question of when equivariance can emerge implicitly from data.

These considerations motivate the central question of this study: to what extent does turbulence itself provide the rotational augmentation required for learning equivariance? We investigate this question by analyzing the role of statistical isotropy in determining whether rotational equivariance can be learned implicitly from data or must be imposed through explicit transformations. To this end, we compare models trained on channel-flow subdomains with differing anisotropy, contrasting boundary-layer regions against the more isotropic mid-plane. Our results demonstrate that:

Imposing explicit rotational augmentation during training improves generalization to unseen test data, indicating that rotational symmetry is a useful inductive bias for turbulence SR.
Increasing the temporal and spatial domain spanned by the training data increases its statistical isotropy, enabling models to acquire more rotational equivariance without explicit augmentation.
Equivariance error exhibits a distinct scale dependency, consistent with the stronger small-scale isotropy predicted by Kolmogorov’s hypothesis.

Collectively, these findings establish turbulence as a setting where understanding the interplay between data symmetries and model design is essential for physically grounded learning.

2 Methodology↩︎

Figure 1: Box placement and energy spectra for turbulent channel flow. The superresolution task involves filling in unresolved scales of turbulence. The energy spectra (right) show significant anisotropy in the turbulent boundary layer near the channel walls. Note that in the present study, we take k as the angular wavenumber. — Figure 1: **Box placement and energy spectra for turbulent channel flow.** The superresolution task involves filling in unresolved scales of turbulence. The energy spectra (right) show significant anisotropy in the turbulent boundary layer near the channel walls. Note that in the present study, we take \(k\) as the angular wavenumber.

Our study aims to test whether CNNs can acquire rotational equivariance implicitly from being trained to perform SR on turbulence data. The input to the CNNs are turbulent velocity fields discretized on a 3D Cartesian grid with each grid cell storing the velocity components (\(u\), \(v\), \(w\)) as the input channel dimension. To assess whether models preserve rotational symmetries, we evaluate their equivariance error. Given an input velocity field \(\overline{\mathbf{U}}\) and a model \(f\), for any rotation \(g \in G\), the absolute equivariance error is defined pointwise as \(\mathcal{E}(\overline{\mathbf{U}}; g) \;=\; \big\lVert f(g \cdot \overline{\mathbf{U}}) - g \cdot f(\overline{\mathbf{U}}) \big\rVert\), where \(g \cdot \overline{\mathbf{U}}\) denotes the rotated input and \(g \cdot f(\overline{\mathbf{U}})\) the rotated model output. Averaging over all group elements and \(N\) samples yields the overall equivariance error \[\overline{\mathcal{E}} \;=\; \frac{1}{|G|N} \sum_{g \in G} \sum_{n=1}^N \mathcal{E}(\overline{\mathbf{U}}_n; g).\] We evaluate equivariance error over the discrete octahedral group \(O\) (rotations without inversions). Further discussion is provided in Appendix 7.

2.1 Model architecture↩︎

We employ a compact multi-scale convolutional super-resolution network that upsamples a low-resolution velocity field volume \(\overline{\mathbf{U}} \in \mathbb{R}^{3\times D \times H \times W}\) to the target high-resolution \(\mathbf{U} \in \mathbb{R}^{3\times sD \times sH \times sW}\). Upsampling by factor \(s\) is implemented as a sequence of resize-then-refine stages, one for each factor of 2. At each stage, the input is upsampled through trilinear interpolation and passed through two convolutional layers. A final convolution projects the features to 3 output channels, yielding the super-resolved prediction. The SR model is trained to minimize the mean absolute error (MAE) loss between the ground truth and predicted high resolution fields, which has been shown to better preserve perceptual quality and reduce oversmoothing compared to mean squared error (MSE) loss in image restoration tasks [21]. See Appendix 6 for details on training hyperparameters.

2.2 Dataset↩︎

We evaluate models on 3D channel-flow subdomains drawn from direct numerical simulation found in the Johns Hopkins Turbulence Database [22]. We subsample the available data into 150 evenly spaced timesteps for the experiments in our study. To probe the effect of anisotropy, we compare (i) boundary-layer regions near the channel wall, where turbulence is strongly anisotropic, against (ii) mid-plane subdomains, where it is closer to isotropic (see Figure 1 for the component-wise energy spectra). To test temporal data augmentation, we sample from a time series of a single subdomain at a fixed \(y\)-coordinate (see Figure 1). The training set is sampled from the first 100 timesteps, while validation and test sets consist of the following 30 and 20 timesteps. To test spatial data augmentation, we add randomly sampled (without overlap) subdomains at a fixed \(y\)-coordinate and timestep.

Subdomain sizes were carefully selected in order to verify the implicit data augmentation hypothesis. The subdomains should contain inertial range length scales, where turbulence begins to cascade towards isotropy. As shown in Figure 1, a significant portion of the inertial range (where \(E(k) \sim k^{-5/3}\)) of the energy cascade is captured within each subdomain. The low-resolution inputs are obtained by applying a box filter and downsampling the high-resolution fields by a scaling factor of \(s=4\), thereby truncating the available lengthscales in the input vector fields. All code and data used in our experiments is available at https://github.com/atomicarchitects/turbulence-implicit-augmentation.

3 Results↩︎

Figure 2: Generalization and equivariance error under varying sampling and data augmentation. Left: Test MAE loss decreases with more training samples T, with dashed curves showing the added benefit of rotational augmentation. Middle: Absolute equivariance error also declines with T, with mid-plane models (red) consistently below boundary models (blue). Right: Bar plots at T=1 highlight reduced error when training on larger domains (lighter shaded bars). — Figure 2: **Generalization and equivariance error under varying sampling and data augmentation.** Left: Test MAE loss decreases with more training samples \(T\), with dashed curves showing the added benefit of rotational augmentation. Middle: Absolute equivariance error also declines with \(T\), with mid-plane models (red) consistently below boundary models (blue). Right: Bar plots at \(T=1\) highlight reduced error when training on larger domains (lighter shaded bars).

Figure 3: Equivariance error across scales. (a) (Left) Power spectra show larger, scale-dependent errors in boundary-layer turbulence, while mid-plane and larger spatio-temporal domains reduce errors through greater isotropy. (Right) Example x-component error fields (in standardized velocity units) at mid-z slice with T=1 illustrate smaller residuals in isotropic regimes. — Figure 3: **Equivariance error across scales.** (a) (Left) Power spectra show larger, scale-dependent errors in boundary-layer turbulence, while mid-plane and larger spatio-temporal domains reduce errors through greater isotropy. (Right) Example \(x\)-component error fields (in standardized velocity units) at mid-\(z\) slice with \(T=1\) illustrate smaller residuals in isotropic regimes.

Implicit augmentation over time. We first assess how the number of timesteps used for training affects super-resolution accuracy and rotational equivariance. Training sets consist of the first \(T \in \{1, 5, 10, 20, 50, 100\}\) timesteps of each simulation. To test for implicit rotational coverage, we compare models trained on raw data against those trained with explicit octahedral augmentation, in which each training snapshot is randomly rotated at every epoch. Figure 2 shows results for the anisotropic boundary layer and the more isotropic mid-plane. Test MAE decreases steadily with \(T\), while equivariance error drops quickly and saturates after only a few snapshots. Explicit augmentation reduces both MAE and equivariance error, with the strongest benefit in low-data regimes and in anisotropic boundary data. By contrast, mid-plane models benefit less, consistent with stronger implicit augmentation from isotropy. We further characterize these effects by computing the Fourier power spectrum of equivariance error fields (Figure 3). High-wavenumber modes exhibit consistently lower error across all training conditions, confirming that small scales act as a natural source of rotational consistency. Enlarging the training set in time or applying explicit augmentation primarily reduces error at intermediate scales, while all models converge in the dissipative range.

Implicit augmentation over space. Kolmogorov’s hypothesis suggests that increasing the spatial domain size should capture more isotropic small-scale motions. To test this, we fix the training set to a temporal snapshot \((T=1)\) and increase the number of boxes extracted from the channel flow at a fixed \(y\)-coordinate. In particular, for the original boundary and mid-plane boxes, we randomly sample two additional boxes at the same respective \(xz\) plane such that the expected degree of anisotropy in each box is maintained (by \(xz\)-plane homogeneity of this flow). As shown in Figures 2 and 3, the larger spatial domain consistently yields lower equivariance error, particularly at intermediate and large scales. Notably, training on three boxes from a single snapshot achieves an equivariance error comparable to training on 100 sequential snapshots in the mid-plane case. This reflects the fact that temporally adjacent snapshots are strongly correlated, whereas spatially distinct boxes supply more diverse and less redundant samples. Future work will investigate how temporally correlated snapshots affect a model’s ability to learn equivariance [23]. Additionally, our results extend the single-snapshot SR results of Fukami et al. [4] by showing that rotational equivariance can also be partially learned from a single snapshot of turbulence.

4 Conclusion↩︎

We have shown that statistical isotropy in turbulence acts as an implicit form of data augmentation, enabling convolutional models to acquire rotational equivariance without explicit enforcement. By analyzing equivariance error across both temporal and spatial sampling, we demonstrated that turbulence provides scale-dependent distributional symmetry: small scales consistently exhibit near-isotropy, while larger scales inherit anisotropy from boundary conditions. This characterization highlights turbulence as a natural test bed for developing methods that address multiscale symmetries.

While we focus here on super-resolution, implicit augmentation from isotropy is a general property of turbulent statistics. Symmetry-aware methods have already been applied to other fluids tasks, such as wall-shear estimation and turbulence closures, where they yield gains in physical consistency under frame changes [24], [25]. These successes suggest that the benefits of symmetry extend well beyond reconstruction tasks. Future work will investigate how the local inductive bias and translation equivariance of CNNs influence their ability to capture multiscale isotropy in turbulent flows. More broadly, our study underscores the importance of reasoning about distributional symmetries in the data itself, alongside architectural design, as a key ingredient for effective and physically consistent learning.

We acknowledge the support of the National Science Foundation under Cooperative Agreement PHY-2019786 (The NSF AI Institute for Artificial Intelligence and Fundamental Interactions). Julia Balla was supported by the Department of Defense (DoD) through the National Defense Science & Engineering Graduate (NDSEG) Fellowship Program. Jeremiah Bailey was supported by the MIT Summer Research Program (MSRP). Elyssa Hofgard was supported by the U.S. Department of Energy, Office of Science, Office of Advanced Scientific Computing Research, Department of Energy Computational Science Graduate Fellowship under Award Number DESC0024386. Ryley McConkey was supported by the Natural Sciences and Engineering Research Council of Canada (NSERC), Thornton Family Fund.

5 Statistical homogenity and isotropy in turbulence↩︎

In turbulent flows, the velocity field \(\mathbf{U}(\mathbf{x}, t)\) is a time-dependent random vector field, and its symmetries are naturally expressed in terms of distributions. For a vector field \(\mathbf{U}:\Omega\!\to\!\mathbb{R}^d\), a group \(G\) acts on both coordinates and components as \((\rho(g)\mathbf{U})(\mathbf{x}) \;=\; g\,\mathbf{U}\!\left(g^{-1}\mathbf{x}\right)\). The field is said to be statistically homogeneous if all statistics are invariant under shifts in the origin of the coordinate system. Formally, for any translation \(\mathbf{r}\in\mathbb{R}^d\) and any \(N\), the \(N\)-point joint distribution is invariant: \[\left(\mathbf{U}(\mathbf{x}_1, t), \ldots, \mathbf{U}(\mathbf{x}_N), t\right) \stackrel{d}{=} \left(\mathbf{U}(\mathbf{x}_1 + \mathbf{r}, t), \ldots, \mathbf{U}(\mathbf{x}_N + \mathbf{r}, t)\right).\] If, in addition, the distribution is invariant under rotations and reflections, the field is statistically isotropic, i.e., for any rotation \(g \in O(3)\), \[(\mathbf{U}(\mathbf{x}_1, t), \ldots, \mathbf{U}(\mathbf{x}_N, t)) \stackrel{d}{=} (g\mathbf{U}(g^{-1}\mathbf{x}_1, t), \ldots, g\mathbf{U}(g^{-1}\mathbf{x}_N, t)).\] Even in anisotropic flows, Kolmogorov’s local isotropy hypothesis posits that small-scale motions recover statistical isotropy when the Reynolds number—the ratio of inertial to viscous forces—is sufficiently high.

6 Training hyperparameters↩︎

We use a fixed CNN super-resolution architecture for all of our experiments. The network contains two successive upsampling layers, each enlarging the input by a factor of \(2\), resulting in an overall scale factor of \(s=4\). 3D Convolutions use kernels of size \(3\) kernels with reflection padding of one pixel on each side, followed by ReLU activations. All hidden layers have 128 channels. Models are trained with the Adam optimizer with learning rate \(3 \times 10^{-4}\) and batch size \(16\).

7 Continuous vs. discrete rotational symmetry in the Navier-Stokes equations↩︎

All evaluations in this work are carried out with respect to the octahedral group \(O\) (rotations without inversions), as discretization and downsampling on a Cartesian grid breaks continuous rotational symmetry. In a continuum, the Navier–Stokes equations are invariant under the full 3D rotation group \(SO(3)\). However, discretization on a Cartesian grid breaks this invariance, leaving only the rotational symmetries of the cube. Filtering operations used to generate low-resolution inputs further introduce deviations, so the data contain small but unavoidable rotational artifacts. Although in principle it is possible to test equivariance with respect to the entire \(SO(3)\) group using interpolation schemes, such approaches introduce additional complexity and ambiguity in the comparison. We leave both \(SO(3)\) evaluations and the inclusion of mirror and inversion symmetries for future work.

References↩︎

[1]

Karthik Duraisamy, Gianluca Iaccarino, and Heng Xiao. Turbulence modeling in the age of data. Annual Review of Fluid Mechanics, 51 (Volume 51, 2019): 357–377, 2019. ISSN 1545-4479. . URL https://www.annualreviews.org/content/journals/10.1146/annurev-fluid-010518-040547.

[2]

Kai Fukami, Koji Fukagata, and Kunihiko Taira. Super-resolution reconstruction of turbulent flows with machine learning. Journal of Fluid Mechanics, 870: 106–120, 2019. .

[3]

Bo Liu, Jiupeng Tang, Haibo Huang, and Xi-Yun Lu. Deep learning methods for super-resolution reconstruction of turbulent flows. Physics of Fluids, 32: 025105, 02 2020.

[4]

Kai Fukami, Koji Fukagata, and Kunihiko Taira. Single-snapshot machine learning for super-resolution of turbulence. Journal of Fluid Mechanics, 2024.

[5]

Zhentao Pang, Kai Liu, Hualin Xiao, Tai Jin, Kun Luo, and Jianren Fan. A deep-learning super-resolution reconstruction model of turbulent reacting flow. Computers & Fluids, 275: 106249, 2024. ISSN 0045-7930. . URL https://www.sciencedirect.com/science/article/pii/S0045793024000811.

[6]

Kai Fukami, Koji Fukagata, and Kunihiko Taira. Machine-learning-based spatio-temporal super resolution reconstruction of turbulent flows. Journal of Fluid Mechanics, 909: A9, 2021. .

[7]

Jacob Page. Super-resolution of turbulence with dynamics in the loss. Journal of Fluid Mechanics, 2025.

[8]

Ludovico Nista, Heinz Pitsch, Christoph D. K. Schumann, Mathis Bode, Temistocle Grenga, Jonathan F. MacArt, and Antonio Attili. Influence of adversarial training on super-resolution turbulence reconstruction. Phys. Rev. Fluids, 9: 064601, Jun 2024. . URL https://link.aps.org/doi/10.1103/PhysRevFluids.9.064601.

[9]

Dule Shu, Zijie Li, and Amir Barati Farimani. A physics-informed diffusion model for high-fidelity flow field reconstruction. Journal of Computational Physics, 478: 111972, 2023. ISSN 0021-9991. . URL https://www.sciencedirect.com/science/article/pii/S0021999123000670.

[10]

T. Whittaker, P. P. R. Nair, D. Livescu, and M. Chertkov. Turbulence scaling from deep learning diffusion generative models. Journal of Computational Physics, 2024.

[11]

Xiantao Fan, Deepak Akhare, and Jian-Xun Wang. Neural differentiable modeling with diffusion-based super-resolution for two-dimensional spatiotemporal turbulence. Computer Methods in Applied Mechanics and Engineering, 433: 117478, 2025. ISSN 0045-7825. . URL https://www.sciencedirect.com/science/article/pii/S0045782524007321.

[12]

A. N. Kolmogorov. The local structure of turbulence in incompressible viscous fluid for very large reynolds numbers. Proceedings: Mathematical and Physical Sciences, 434 (1890): 9–13, 1991. ISSN 09628444. URL http://www.jstor.org/stable/51980.

[13]

Stephen B. Pope. Turbulent Flows. Cambridge University Press, 2000.

[14]

Xuan Zhang, Limei Wang, Jacob Helwig, Youzhi Luo, Cong Fu, Yaochen Xie, Meng Liu, Yuchao Lin, Zhao Xu, Keqiang Yan, Keir Adams, Maurice Weiler, Xiner Li, Tianfan Fu, Yucheng Wang, Alex Strasser, Haiyang Yu, YuQing Xie, Xiang Fu, Shenglong Xu, Yi Liu, Yuanqi Du, Alexandra Saxton, Hongyi Ling, Hannah Lawrence, Hannes Stärk, Shurui Gui, Carl Edwards, Nicholas Gao, Adriana Ladera, Tailin Wu, Elyssa F. Hofgard, Aria Mansouri Tehrani, Rui Wang, Ameya Daigavane, Montgomery Bohde, Jerry Kurtin, Qian Huang, Tuong Phung, Minkai Xu, Chaitanya K. Joshi, Simon V. Mathis, Kamyar Azizzadenesheli, Ada Fang, Alán Aspuru-Guzik, Erik Bekkers, Michael Bronstein, Marinka Zitnik, Anima Anandkumar, Stefano Ermon, Pietro Liò, Rose Yu, Stephan Günnemann, Jure Leskovec, Heng Ji, Jimeng Sun, Regina Barzilay, Tommi Jaakkola, Connor W. Coley, Xiaoning Qian, Xiaofeng Qian, Tess Smidt, and Shuiwang Ji. Artificial intelligence for science in quantum, atomistic, and continuum systems. Foundations and Trends® in Machine Learning, 18 (4): 385–912, 2025. ISSN 1935-8237. . URL http://dx.doi.org/10.1561/2200000115.

[15]

Jacob Helwig, Xuan Zhang, Cong Fu, Jerry Kurtin, Stephan Wojtowytsch, and Shuiwang Ji. Group equivariant fourier neural operators for partial differential equations. In International Conference on Machine Learning, 2023.

[16]

Minkai Xu, Jiaqi Han, Aaron Lou, Kamyar Azizzadenesheli, Stefano Ermon, and Anima Anandkumar. Equivariant graph neural operator for modeling 3d dynamics. In International Conference on Learning Representations, 2024.

[17]

Yuzheng Bai et al. A regularization-guided equivariant approach for image restoration. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025.

[18]

M. Raissi, P. Perdikaris, and G.E. Karniadakis. Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. Journal of Computational Physics, 378: 686–707, 2019. ISSN 0021-9991. . URL https://www.sciencedirect.com/science/article/pii/S0021999118307125.

[19]

Rui Wang, Elyssa Hofgard, Han Gao, Robin Walters, and Tess E. Smidt. Discovering symmetry breaking in physical systems with relaxed group convolution. In Proceedings of the 41st International Conference on Machine Learning, ICML’24. JMLR.org, 2024.

[20]

Yuki Yasuda and Ryo Onishi. Rotationally equivariant super-resolution of velocity fields in two-dimensional fluids using convolutional neural networks. APL Machine Learning, 2023.

[21]

Hang Zhao, Orazio Gallo, Iuri Frosio, and Jan Kautz. Loss functions for image restoration with neural networks. IEEE Transactions on Computational Imaging, 3 (1): 47–57, 2017. .

[22]

Yi Li, Eric Perlman, Minping Wan, Yunke Yang, Charles Meneveau, Randal Burns, Shiyi Chen, Alexander Szalay, and Gregory Eyink. A public turbulence database cluster and applications to study lagrangian evolution of velocity increments in turbulence. Journal of Turbulence, 9: 1–29, 2008.

[23]

Mohammed Sardar, Małgorzata J. Zimoń, Samuel Draycott, Alistair Revell, and Alex Skillen. Concerning the use of turbulent flow data for machine learning, 2024. URL https://arxiv.org/abs/2412.06050.

[24]

Jonghwan Suk et al. Equivariant hemodynamics estimation on the artery wall. Computers in Biology and Medicine, 2024.

[25]

Alex Poff et al. Implicit modeling of equivariant tensor basis with euclidean turbulence closure neural networks. Physics of Fluids, 2025.

Implicit Augmentation from Distributional Symmetry in Turbulence Super-Resolution