September 24, 2025
Recent efforts have extended our view of the number and properties of satellite galaxies beyond the Local Group firmly down to \(\rm M_\star\sim 10^6 M_\odot\). A similarly complete view of the field dwarf population has lagged behind. Using the background galaxies sample from the Satellites Around Galactic Analogs (SAGA) Survey at \(z<0.05\), we take inventory of the dwarf population down to \(\rm M_\star \sim 5\times10^6 M_\odot\) using three metrics: the stellar mass function (SMF) as function of environment, the stellar-to-halo mass relation (SHMR) of dwarf galaxies inferred via abundance matching, and the quenched fraction of highly isolated dwarfs. We find that the low-mass SMF shape shows minimal environmental dependence, with the field dwarf SMF described by a low-mass power-law index of \(\alpha_1=-1.44\pm0.09\) down to \(\rm M_\star \sim 5\times10^6 M_\odot\), and that the quenched fraction of isolated dwarfs drops monotonically to \(f_{q} \sim 10^{-3}\) at \(\rm M_\star \sim \rm 10^{8.5} M_\odot\). Though slightly steeper than estimates from kinematic measures, our inferred SHMR agrees with literature measurements of satellite systems, consistent with minimal environmental dependence of the SHMR in the probed mass range. Finally, although most contemporary cosmological simulations against which we compare accurately predict the SAGAbg-SMF SHMR, we find that big-box cosmological simulations largely over-predict isolated galaxy quenched fractions via a turnaround in \(f_q(\rm M_\star)\) at \(\rm 10^8\lesssim M_\star/M_\odot\lesssim 10^9\), underscoring the complexities in disentangling the drivers of galaxy formation and the need for systematic multidimensional observations of the dwarf population across environments.
The galaxy stellar mass function (SMF), the differential number density of galaxies as a function of their stellar mass, is a key observable by which we inventory the content of the Universe. Though there are substantial uncertainties in the estimation of stellar masses at both the individual galaxy and population level, observed SMFs have shown remarkable consistency across the literature for massive low-redshift galaxies [1]–[5].
Measurements of the SMF at low stellar masses, however, have been elusive due to the difficulty in obtaining large samples of so-called “dwarf” galaxies, typically defined as galaxies with stellar masses \(\rm M_\star \lesssim 10^9 M_\odot\). While these galaxies are the most common type of galaxy at all redshifts, they remain at the observational frontier of extragalactic astronomy, both because they are intrinsically faint systems, and because obtaining reliable redshifts from photometry alone can be challenging for systems outside the resolved star regime. As such, constructing statistical samples of dwarf galaxies over large volumes remains a resource-intensive endeavor.
In the nearby (\(z\sim 0\)) and low-redshift (\(z\lesssim 0.2\)) Universe, searches for dwarf galaxies associated with more massive galaxies have led to increasingly precise stellar mass function measurements of satellite dwarfs [6]–[11]. Measurements of the SMF for field dwarfs outside the influence of a massive host have remained observationally challenging. This is due in part to the difficulty of selecting low-mass, low-redshift objects from imaging alone [12]–[15]. Nevertheless, it i s of substantial importance to understand the SMF across extragalactic environments due to the key role that the SMF plays in constraining galaxy formation models and probing the stellar-to-halo mass relation [16]–[22].
The relationship between the baryonic and dark matter content of low-mass galaxies has been a key, if elusive, ingredient in our understanding of both cosmology and extragalactic astrophysics. Dwarf galaxies have long been identified as key testing grounds of \(\Lambda\)CDM [23]–[31]; many accepted or proposed solutions to these challenges invoke baryonic feedback prescriptions to redistribute dark matter or modulate the galaxy–halo connection [32]–[43]. The connection between dark matter and baryons in low-mass galaxies is thus key not only to understanding the structure and evolution of dwarf galaxies, but to our understanding of the nature of dark matter at large.
Like the stellar mass function, measuring the stellar-to-halo mass relation (SHMR) of the dwarf galaxy population outside the influence of a more massive host is a long-standing observational challenge. This problem is, again, largely technical in nature: nearby dwarf galaxies for which we can measure dark matter halo masses via kinematics are typically associated with the Milky Way or Local Group [17], [44]–[46]. Individual measurements of halo masses via kinematic means also incur large uncertainties, and are necessarily directly measurable only out to the radius at which reliable kinematic properties can be securely measured. Recent progress has been made in measuring halo profiles of dwarf galaxies from weak lensing; however, this relies on stacked photometric samples rather than individual spectroscopic measurements [47].
In this work, we leverage the background spectra taken as part of the Satellites Around Galactic Analogs (SAGA) Survey to measure the stellar mass function of field dwarf galaxies down to \(\rm M_\star \sim 5\times10^6 M_\odot\). The recently completed SAGA Survey executed a census of the satellite systems of 101 nearby Milky Way-like analogs with the goal of placing the MW satellite system into cosmological context [11], [12], [22], [48], [49]. To identify the \(\sim 400\) satellites of the SAGA hosts identified in the main survey, more than 40000 total spectra were taken of candidate dwarf satellites; due to the photometric selection of these candidates, the objects that comprise the SAGA Background Sample (SAGAbg) are preferentially low-mass, low-redshift galaxies.
We presented the SAGAbg in and of this series, where we leveraged the high completeness of the SAGAbg sample for low-redshift dwarfs to estimate the mass-loading factor of dwarf galaxies and the late-time evolution of the Star-Forming Sequence, respectively. Here, we further leverage our characterization of the selection function of the SAGAbg galaxies to measure the stellar mass function of dwarf galaxies out to \(z\sim 0.05\).
Our approach to constructing SMF as a function of environment is presented across two sections: 2, which delineates the SAGAbg-SMF sample within the SAGA Survey, and 3, which describes the observational corrections we derive to measure the underlying SMF. We compute spectroscopic targeting corrections that account for the effective targeting volume of the SAGA Survey for each galaxy in the sample 3.1, and establish that photometric targeting incompleteness is highly subdominant in the construction of our observation-corrected SMF 3.2.
We fit the form of the SMF and use the measured SMF to derive two further results in 4: the stellar-to-halo mass relation of field dwarf galaxies down to \(\rm M_\star \sim 5\times10^6 M_\odot\) via abundance matching in 4.2, and the mass-dependence of quenching in isolated dwarf galaxies down to \(\rm M_\star \sim 4\times 10^8 M_\odot\) in 4.3. To guide the reader through this methodology, we present this outline in schematic form in 8.
These measurements push SMF and SHMR estimates of field dwarfs into the classical dwarf regime (\(\rm 10^5<M_\star<10^7M_\odot\), [30]), both extending our understanding of this extragalactic mass inventory in central galaxies and providing environmental context to previous measurements of the SMF and SHMR in satellite dwarf systems.
We adopt a flat \(\Lambda\)CDM cosmology with \(\Omega_m=0.3\) (\(\Omega_\Lambda=0.7\)), and \(H_0=70\;{\rm km\ s^{-1}}{}\;{\rm Mpc^{-1}}\). We use a [50] initial mass function (IMF) unless otherwise specified, and convert literature results that use other IMFs to a [50] IMF as stated in the text.
The SAGAbg sample, as presented in , is operationally defined as any spectrum within the SAGA spectroscopic catalog that is not associated with a SAGA host. In the present work, we introduce the SAGAbg-SMF sample: the set of all SAGAbg galaxies with a secure redshift at \(0<z<0.05\) that are not a SAGA satellite, but lie within the targeting-complete footprint of a SAGA host (a projected distance of \(<300\) kpc at the host redshift, or \(\sim 0.5\) deg).
SAGAbg-SMF, a set of 3097 galaxies at \(5\times 10^6 < \rm M_\star < 2\times 10^{11} M_\odot\), contains many of the same low-redshift galaxies as the SAGAbg-A sample introduced in of this series, but does not impose the same spectrum source requirements as SAGAbg-A. and of this series focused on SAGAbg-A, the subset of all SAGAbg galaxies for which redshifts were obtained either from archival wide-field surveys or from dedicated SAGA fiber spectroscopy with AAT/2dF or MMT/Hectospec. In [t:samplenumbers] we outline the sizes and differences of the various SAGAbg samples.
We set an upper redshift limit of \(z=0.05\) for the SAGAbg-SMF sample for three reasons: environment classification accuracy, SMF redshift evolution, and low-mass SAGAbg completeness. First, we use the HyperLEDA database [51] to assign environmental classifications based on physical proximity to any \(M_{K_s}<-23\) galaxy. For an extended source magnitude limit of \(m_{K_s}=13.5\), the 2MASS survey is complete down to \(M_{K_s}=-23\) out to \(z=0.046\), modulo \(K\)-corrections. We thus cannot accurately diagnose the environment of the SAGAbg-SMF sample beyond \(z\sim0.05\).
Second, the stellar mass function is known both theoretically and observationally to evolve with redshift [52]. In of this series, we defined the \(z\sim0\) approximation as the redshift range over which negligible galaxy evolution has occurred at the observational precision of that sample. We then demonstrated that for the SAGAbg-A sample, the maximum redshift at which the \(z\sim0\) approximation holds for the Star-Forming Sequence (SFS) is \(z=0.05\). We therefore adopt \(z=0.05{}\) as the maximum redshift for the SAGAbg-SMF sample.
Finally, the SAGAbg sample mass sensitivity is strongly redshift-dependent, with the limiting detectable mass rising to \(\rm M_\star \sim 10^8 M_\odot\) at \(z=0.05\). Because the focus of this work is on the dwarf galaxy population, we therefore restrict our analysis to \(z<0.05\).
We use the photometric stellar mass-to-light ratio and color prescriptions of [53], which are updated calibrations tuned for unbiased performance in stellar mass recovery of simulated galaxies at \(\rm M_\star < 10^9 M_\odot\): \[{M}_\star = 1.433(g-r) + 0.00153 M_g^2 - 0.335 M_g + 2.072,\] where \(M_g\) is the restframe \(g\)-band absolute magnitude. The updated stellar mass prescriptions induce a moderate systematic offset in our stellar mass estimates with respect to the color–mass relation used in [11], with a median difference of \(\rm \langle \log_{10}(M_\star/M_\odot)_{SAGA,DR3} - \log_{10}(M_\star/M_\odot)_{dlR}\rangle_{50} = -0.13\) over our sample.
In order to measure the SMF as a function of environment, we must first establish environmental criteria for the sample. To do so, we flag potential massive galaxies in the vicinity of the SAGAbg-SMF galaxies by querying the HyperLEDA database for galaxies at heliocentric velocities \(v<15000\;{\rm km\ s^{-1}}{}\) (\(z\lesssim 0.05\)) and absolute \(Ks\)-band magnitudes of \(M_{Ks}<-23\) (\(\log_{10} M_\star/M_\odot\gtrsim 10.4\)). Again, we emphasize that galaxies classified as satellites of the SAGA hosts themselves (i.e., the SAGA satellites presented in [11]) are not included in this sample.
We define three environmental classifications for galaxies in the SAGAbg-SMF sample at \(\rm M_\star < 10^{10} M_\odot\), where the upper limit of the stellar mass range is set by the approximate minimum host mass defined by the [54] criterion:
galaxies are those at \(\rm M_\star < 10^{10} M_\odot\) at \(c|z - z_{\rm massive}| > V_{\rm 200, massive}\) and \(R_{\rm proj}(z_{\rm massive}) > 1\) Mpc of any \(M_{K_s} < -23\) galaxy.
galaxies are those at \(\rm M_\star < 10^{10} M_\odot\) that do not meet the criteria of a field galaxy. Because we define potential hosts as galaxies with \(M_{K_s}<-23\) (\(\log_{10}({\rm M_\star/M_\odot})\) \(\sim 10\)), we do not assign field or satellite status to galaxies above this stellar mass threshold.
A more isolated subset of the field galaxies, following [54]. These galaxies must be at \(c|z - z_{\rm massive}| > 1000\;{\rm km\ s^{-1}}{}\) and \(R_{\rm proj}(z_{\rm massive}) > 1.5\) Mpc of a \(M_{K_s} < -23\) galaxy.
Here, \(c\) is the speed of light, \(z\) is the redshift of each SAGAbg-SMF galaxy, and \(z_{\rm massive}\) is the redshift of the potential host. This classification scheme requires an estimate of halo velocity \(V_{200}\) for the potential hosts in the sample. To approximate \(V_{200}\), we adopt a \(Ks\)-band mass-to-light ratio of \({M}_\star / L_{K_s} = 1 {M}_\odot/L_\odot\) and compute stellar masses from 2MASS \(Ks\)-band absolute magnitudes. We then use the stellar-to-halo mass relation of [19] to estimate halo masses, \(M_{200}\), from \(M_\star\). Finally, we relate \(M_{200}\) to \(V_{200}\) for an NFW halo [55] by: \[\label{e:v200} \begin{align} {M}_{200} &= \frac{V_{200}^3}{10 H_0 G},\\ \end{align}\tag{1}\] which follows from taking \(\rho_{\rm crit}(t) = \left(8\pi G H(t)\right)/3\), and relating the circular velocity of a NFW halo at \(R_{200}\) to \({M}_{200}\) and \(R_{200}\) via \(V_{200}^2 = G {M}_{200}/R_{200}\).
We refer to measurements over all galaxies in the SAGAbg-SMF sample as “SAGAbg-SMF (all)” – however, it should be noted that the SAGA survey fields were selected to avoid known background clusters. As such, our satellite and full samples are skewed away from the richest group and cluster environments.
We place the SAGAbg-SMF sample in the context of other wide-field spectroscopic surveys in 1 by comparing the SAGAbg-SMF galaxies to the redshift, apparent \(r\)-band magnitude, and stellar mass distributions of the Galaxy Mass and Assembly [5] and the SDSS-derived NASA Sloan Atlas [56]. To minimize systematic offsets, we recompute stellar masses for both the NSA and GAMA catalogs using the prescription of [53]. In both catalogs, we adopt Petrosian magnitudes measured from SDSS imaging1 to compute stellar masses.
As expected from its fainter \(r\)-band limits, the SAGAbg-SMF sample reaches lower in stellar mass than GAMA and SDSS at fixed redshift. To make a rough comparison of the stellar mass completeness limit across surveys, we estimate the stellar mass that corresponds to the limiting \(r\)-band magnitude of each survey (\(m_{r,\rm lim}=\{20.75,19.8,17.77\}\) for SAGA, GAMA, and SDSS, respectively) as a function of redshift based off the mean stellar mass-to-light ratio of the SAGAbg-SMF galaxies. The results of this estimate are shown in 1 as a function of redshift in the main left panel, and at \(z=0.05\) in the top right panel.
Although the SAGA Survey covers a smaller on-sky area (\(\Omega_{\rm SAGA}\sim 90\;\rm deg^2\)) than GAMA (\(\Omega_{\rm GAMA} \sim 180\rm \;deg^2\)) and SDSS (\(\Omega_{\rm SDSS}\sim 8500\rm\;deg^2\)), the fainter targeting limit and dwarf-focused photometric targeting of the SAGA Survey produce a low-mass, \(z<0.05\) galaxy sample that is comparable in number to GAMA at \(\log_{10}({\rm M_\star/M_\odot})\) \(\lesssim 8\) (\(N_{\rm SAGAbg-SMF}/N_{\rm GAMA,z<0.05}\approx 0.8\), where \(N_{\rm SAGAbg-SMF}=473\)).
This rough estimate allows us to compare the spectroscopic targeting limit between surveys as a function of stellar mass, but does not fully describe the implications of the SAGA selection function (which is not a purely magnitude-limited survey like GAMA and SDSS). Because the SAGA selection function is of tantamount importance to our measurement of the stellar mass function, we will defer our analysis of the SAGAbg-SMF photometric selection to a standalone discussion in 3.1.
With our SAGAbg-SMF sample in hand, we can now consider the measurement of the SMF as a function of environment. For readability, we separate our discussion of SMF measurement methodology into several sections: first, we construct corrections for spectroscopic incompleteness in 3.1. These corrections account for the multivariate spectroscopic selection function imposed by the SAGA photometric targeting scheme. This targeting scheme means in practice that the SAGA Survey targeting deviates substantially from a magnitude-limited sample for which classical \(1/V_{\max}\) corrections — that is, corrections based on the maximum volume \(V_{\rm max}\) over which some galaxy can be observed — are valid.
Next, we estimate corrections for photometric surface brightness incompleteness in 3.2. These corrections account for the existence of galaxies within the survey footprint at our mass and redshift range of interest that were not included in the original photometric targeting catalogs. As we will demonstrate, the photometric completeness correction is small compared to the spectroscopic completeness correction (and does not enter into our estimate of the isolated dwarf quenched fraction), but can play a significant role in setting the slope of the SMF at the lowest masses we consider in this work. In 3.3, we derive corrections for the size of the SAGA Survey on-sky footprint.
Finally, in 3.4, we bring the considerations developed throughout the section together to construct the framework with which we measure the stellar mass function of the SAGAbg-SMF sample. To guide the reader, we include a schematic figure of this methodology in 8 that summarizes the selection and correction functions constructed to derive the SMFs presented in this work.
rccl SAGAbg-A && 28281 &
SAGAbg-SMF & All & 3097 & 2.4
&Field & 1562 & 2.3
&Satellite & 1093 & 2.3
&Isolated & 908 & 2.3
The SAGA Survey targeting strategy was multidimensional, with tiered targeting priorities based on host proximity, surface brightness, \(r\)-band magnitude, and \(g-r\) color. The targeting selection also changed substantially over the course of the survey as the photometric properties of the SAGA satellites became better understood [11], [12], [48].
As such, we modify the traditional \(1/V_{\rm max}\) method to calculate incompleteness corrections by computing the effective comoving volume, \(V_{\rm eff}\), over which each individual galaxy in the sample would have been included in the sample from the probability that it would have been included as a redshifted SAGA background galaxy, \({\rm Pr[obs}|\vec{\theta}_i(z)]\), given its expected photometric properties as a function of redshift \(\theta_i(z)\).
The SAGA host volumes are explicitly excluded from the SAGAbg sample, and so we include an additional term \({\rm Pr[SAGAbg}|z]\) such that \({\rm Pr[SAGAbg}|z]=0\) within the redshift range corresponding to the SAGA satellite volume. This yields an effective volume defined as:
\[\label{e:veff} V_{{\rm eff},i} = \int_0^{z_{\rm max}} \frac{dV}{dz}{\rm Pr[SAGAbg}|z]{\rm Pr[obs}|\vec{\theta}_i(z)] dz,\tag{2}\] where \(dV/dz\) is the derivative of comoving volume with respect to redshift and \(z_{\rm max}=0.05\). \(\vec{\theta}_i(z)\) refers to the predicted photometric properties of the \(i\)th galaxy in the sample as a function of redshift: \[\vec{\theta}_i(z)=\left[ \mu_{r}^p(z,\mu_{r,i}), m_{r}^p(z, m_{r,i}),(g-r)^p(z, (g-r)_i)\right]\] where the \(p\) superscript indicates the predicted values of the \(r\)-band surface brightness, \(r\)-band magnitude, and \((g-r)\) color as a function of redshift. Here, the redshift dependence of the photometric properties reflects observational considerations and not physical redshift evolution. In the magnitude-limited case, our definition of \(V_{\rm eff}\) becomes equivalent to the classic \(V_{\rm max}\), modulo the exclusion of the SAGA host volume.
We empirically determine the spectroscopic success probability over photometric parameter space, \({\rm Pr[obs}|\vec{\theta}]\), by measuring the fraction of galaxies in the base photometric catalog that were redshifted (SAGA DR3 redshift quality flag of ZQUALITY\(\geq3\); see [11]) in bins of surface brightness (bins defined as \(16<\mu_r<26\) mag arcsec\(^{-2}\), \(d\mu_r=0.5\)), magnitude (\(10<m_r<21\), \(dm_r=0.55\)), and \((g-r)\) color (\(-0.2<g-r<2\), \(d(g-r)=0.11\)). The success fraction is shown in 2 in each projection of the three-dimensional space considered. In each panel, the size of the marker reflects the relative number of sources with the given photometric properties in the SAGA base catalog. The black contours show the distribution of the SAGAbg-SMF sample as observed, while the black arrow in the bottom left indicates the average direction that a galaxy moves as a function of redshift due to observational effects.
As expected, the SAGA Survey completeness shows a substantial color dependence; redder galaxies are less likely to be included in the sample. This is unsurprising given that: (1) the photometric targeting strategy of the SAGA Survey prioritizes bluer galaxies that are more likely to be low-redshift satellites of SAGA hosts [11], and (2) red galaxies are more likely to be quiescent galaxies for which reliable redshifts must be based on absorption features, which have a lower success rate at fixed object magnitude.
Below the SDSS spectroscopic targeting limit, the SAGA Survey is more incomplete for higher surface brightness objects. These high surface brightness galaxies are ostensibly easier to spectroscopically confirm, however, the SAGA Survey was substantially less likely to target them because the probability for these objects to be satellites of a SAGA host is low. The low completeness of high surface-brightness objects can then be understood as a photometric selection choice, rather than an observational constraint. With an empirical mapping of \({\rm Pr[obs}|\vec{\theta}]\) in hand, we can now consider the observational redshift dependence of the photometric properties of the galaxies in our sample. Our estimate of \(\vec{\theta}_i(z)\) includes three observational effects: the \(1/r^2\) drop-off in spectral flux, the redshift dependence of observed angular size, and the \(g\) and \(r-\)band \(k\)-corrections for each galaxy. We estimate \(k\)-corrections for each galaxy as a function of redshift using the photometric relations of [57]. Between \(z=0.005\) and \(z=0.05\), the change in \(r\)-band magnitude dominates (\(\langle |\Delta m_r|| \rangle=6.7\)) over the change in \(r\)-band surface brightness (\(\langle |\Delta \mu_r|\rangle = 1.1\)) and \(g-r\) color (\(\langle \Delta |g-r|\rangle = 0.02\)).
The probability that a galaxy will be classified as a SAGA satellite, \({\rm Pr[SAGAbg}|z]\), is straightforward to compute: for each galaxy, we exclude redshifts within 275 \({\rm km\ s^{-1}}{}\) of the SAGA host in whose footprint the galaxy was observed.
Unlike and of this series, which dealt with the relative selection function of the SAGA background galaxies as defined with respect to the lowest redshift sub-sample of the SAGA background galaxies, the work at present concerns the absolute completeness of the SAGAbg-SMF selection function.
As such, we consider whether a correction is neeeded for galaxies that were not identified as sources in the DESI Imaging catalog, which was used as SAGA’s original photometric targeting catalog. To do so, we construct a simple model based off of the observed mass-size and mass-to-light ratio distributions of the SAGAbg-SMF galaxies to predict the distribution of apparent magnitude and surface brightness for the low-mass end of SAGAbg-SMF sample and estimate the occurrence fraction of galaxies brighter than our spectroscopic targeting limit of \(m_r<20.75\) that may be missing from our photometric targeting catalog.
Here we sketch out the form of the model and incompleteness correction results; a full explanation of this incompleteness exercise can be found in 9. To estimate the relative number density of galaxies brighter than our spectroscopic targeting limit that are below our photometric catalog detection limits, we would like to know the distribution in apparent magnitude and surface brightness of the underlying galaxy population. And more specifically, at a 5\(\sigma\) limiting depth of \(m_r=23.9\) [58], the photometric catalogs are substantially deeper than our spectroscopic limit in apparent magnitude; as such, we focus on the expected surface brightness incompleteness.
We forward model the joint distribution over stellar mass, surface brightness, and redshift of the population using the size-mass relation of Asali et al., 2025 (submitted) and an assumed mass-to-light ratio of \({\rm M_\star}/L_r=0.9 {\rm M}_\odot/L_\odot\) (the median value of the observed SAGAbg-SMF galaxies). Through Monte Carlo resampling, we can then estimate the probability that galaxies at \(m_r<20.75\) of a given stellar mass and redshift will be fainter than our nominal surface brightness limit of \(\mu_{r, lim}=24.9\) mag arcsec\(^{-2}\), as approximated from the DESI Legacy Imaging survey depth and seeing.
This model for photometric incompleteness explicitly assumes a lognormal form to the mass-size relation; as such, a hidden population of low surface brightness galaxies (LSBGs) could cause an underestimate of our photometric incompleteness. Such a possibility has been explored in depth by [11], wherein it was concluded via a comparison to the Systematically Measuring Ultra-diffuse Galaxies [59] catalog as well as visual inspection of the SAGA fields that no substantial population of LSBGs brighter than the spectroscopic targeting limit was missed in the SAGA photometric catalogs.
We find that the effects of photometric incompleteness are small down to our stellar mass limit of \(\rm M_\star = 5\times10^6 M_\odot\). Indeed, when integrated over redshift, the photometric incompleteness correction factor is no more than 1.4 at \({\rm M_\star>5\times 10^6 M_\odot}\), and no more than 1.1 at \({\rm M_\star>5\times 10^8 M_\odot}\). This completeness correction for any given mass bin is smaller than the counting uncertainty incurred in that bin. However, because there is a mass dependence to the correction, we choose to include it in our SMF analysis despite the fact that the spectroscopic completeness correction dominates at all masses.
Because the main science goal of the SAGA Survey was to complete a census of the satellite systems around nearby MW-like hosts, the survey has a very high completeness of targets at \(10<r_{\rm proj}<300\) kpc in projected distance from a SAGA host. This high spatial completeness makes estimating the total area of the SAGA Survey straightforward: the total area is simply the sum of the on-sky area between \(10<r_{\rm proj}<300\) kpc from each host for a total of \(\Omega_{\rm SAGA}=86.67\) deg\(^2\) over 102 hosts (101 SAGA Survey DR3 hosts and 1 additional complete host). This produces a final sample of 3097 galaxies in our SAGAbg-SMF sample, as tabulated in [t:samplenumbers].
We define the stellar mass function \(\phi(\rm M_\star)\) to be the differential number density of galaxies with respect to \(\log_{10}({\rm M_\star/M_\odot})\). In practice, we bin our galaxy sample by stellar mass such that a bin contains no fewer than 10 objects and is no narrower than \(\Delta m \equiv \Delta(\log_{10}(\rm M_\star/M_\odot))=0.1\). In each bin, the differential number density of the \(i^{\rm th}\) bin, \(\phi(\rm M_{\star,i})\), is:
\[\phi({\rm M}_{\star,i}) = f_{\rm phot}({\rm M}_{\star,i})\frac{\Omega_{\rm sky}}{\Omega_{\rm SAGA}}\sum_j^N \frac{1}{V_{{\rm eff},j} \Delta m_i},\] where \(\Omega_{\rm sky} = 4\pi\) sr, \(j=0,1,...,N\) indexes the \(N\) galaxies within the \(i^{\rm th}\) stellar mass bin and \(\Omega_{\rm SAGA}\) is the on-sky footprint of the SAGA Survey as discussed in 3.3.
We consider two sources of uncertainty in our differential number densities: counting uncertainties due to the finite size of our galaxy sample, and weighting uncertainties due to our empirical method of determining the SAGAbg-SMF selection function.
To estimate counting uncertainties on \(\phi(\rm M_\star)\) when the raw number of galaxies observed in the sample is small or zero, we model the posterior probability of the underlying counts \(\lambda\) as a Gamma distribution with shape parameter \(k+0.5\) for \(k\) observed counts. This formulation is proportional to a Poisson distribution centered on \(k\) observed counts with a Jeffreys’ prior where \(P(\lambda)\propto\lambda^{-1/2}\), allowing us to defined reasonable confidence intervals when the number of observed galaxies is low.
Because our photometric catalog is finite in size, this empirical method to estimate the spectroscopic completeness function is itself uncertain; we quantify this uncertainty as the 68% confidence interval on the proportion of galaxies with a high-quality redshift assuming a Jeffreys prior on a binominal counting population. We propagate this uncertainty in the individual galaxy weights to our measurement of the SMF as: \[\begin{align} \sigma_{upper, i}^2 &= \sqrt{\sigma_{{\rm weights},i}^2 + \left[W_i\left(\frac{f(0.84,C_i + 0.5)}{C_i} - 1\right)\right]^2},\\ \sigma_{lower, i}^2 &= \sqrt{\sigma_{{\rm weights},i}^2 + \left[W_i\left(1 - \frac{f(0.16,C_i + 0.5)}{C_i} \right)\right]^2}, \end{align}\] where for each stellar mass bin \(i\), \(\sigma_{{\rm weights},i}\) is the uncertainty of the weights themselves, \(C_i\) is the observed galaxy counts in the \(i^{\rm th}\) bin, \(W_i\) is the sum of the weights associated with the galaxies in the \(i^{\rm th}\) bin, and \(f(\alpha,k)\) is the percent point function of the Gamma distribution with \(k=C_i+0.5\).
It should be noted that this is an ad hoc method to preserve asymmetric uncertainties; in our case, we find that uncertainty budget is strongly dominated by the SMF counts themselves over the uncertainties from the weights and so \(\sigma_{SMF} \approx \sigma_{\rm counting}\). This statement can be seen visually in 2: the SAGAbg-SMF sample (black contours) occupies regions of photometric parameter space that are well-sampled by the photometric and spectroscopic catalogs. Thus, the uncertainty in our estimate of galaxy differential number densities is dominated by the counting statistics on the galaxies themselves, rather than our uncertainty on the volume over which each galaxy would be observed by the SAGA Survey.
Having established our measurement of the SMF from the SAGA Survey, we now proceed with three steps. First, we fit standard functional forms to facilitate comparison with existing literature. Second, we present these results in a format that will make our measurements more accessible for future studies. Finally, we apply abundance matching techniques to estimate the relationship between stellar and halo mass across our sample.
We fit the SAGAbg-SMF stellar mass functions with a double Schechter function [3], [5], defined as:
\[\begin{align} \phi(M_\star) \equiv \frac{dN}{d\log_{10}M_\star} &= \ln10 \left[ \phi_{0,1}\left( \frac{M_\star}{M_{ch}}\right)^{1+\alpha_1} \right. \\ &+ \left. \phi_{0,2}\left( \frac{M_\star}{M_{ch}}\right)^{1+\alpha_2} \right]e^{\frac{-M_\star}{M_{ch}}}, \end{align}\] where \(\phi_{0,1}\) describes the normalization of the SMF \(\phi(\rm M_\star)\), \(\rm M_{ch}\) its characteristic mass and \(\alpha_1\) its mass dependence. The subscript indicates whether these parameters are associated with the first or second component of the Schechter function.
In our fiducial fitting procedure, we fit a double Schechter function directly to the binned, observed SMF assuming a Gaussian likelihood. We estimate SMF fit parameters using the emcee [60] implementation of Markov Chain Monte Carlo sampling using 32 walkers. Because we do not sample the SMF well at stellar masses above the expected value of \(\rm M_{ch}\), we adopt a Gaussian prior centered on \(\log_{10}(\rm M_{ch}/M_\odot)=10.8\) and \(\sigma=2\) based on literature fits to the SMF at higher masses [1], [2]. We assume a log-uniform prior for \(\phi_{0}\) bounded by \(10^{-4}<\phi_0<10^{-1}\) Mpc\(^{-1}\), and a uniform prior for \(\alpha\) bounded by \(-3<\alpha<0\).
To prevent walkers from settling at local extrema, we reinitialize our sampler after 3000 steps using a normally distributed initial walker position whose standard deviation is set by the difference between the 60th and 40th percentiles for each parameter in the preceding chain. We repeat until convergence is achieved, as indicated by a maximum Gelman–Rubin statistic of no larger than \(R=1.2\) across the fit parameters.
To interpret the implications of our SMF results in the context of the galaxy–halo connection of dwarf galaxies, we make an estimate of the stellar-to-halo mass relation of our field galaxy sample via simple abundance matching of halo and galaxy number densities.
Following classical abundance matching [61], we map halo masses to stellar masses by finding the halo and stellar mass pair (\(M_{\rm peak}\),\(M_\star\)) such that \[\rm \int_{M_{\rm peak}}^\infty \phi_{h}(M)dM = \int_{M_\star}^\infty \phi_\star(M)dM,\] where \(\phi_h({\rm M},\epsilon)\) and \(\phi_\star({\rm M})\) are the peak halo mass and stellar mass functions, respectively. The peak halo mass is the maximum halo mass reached by each halo and subhalo over cosmic time. In our fiducial method, we assume that there is no scatter in the SHMR (that is, exact top-down galaxy-halo matching). We have additionally verified that allowing for scatter in the SHMR does not change the inferred SHMR at the 1\(\sigma\) level, but that the SAGAbg-SMF sample does not provide meaningful constraints on this scatter, which is an expected result given the size of the sample [62], [63]. Thus, though we note that the log-normal constant scatter model allows for SHMR scatter that is consistent with previous results [19], we choose to adopt the fiducial model in the remainder of this work.
We derive our halo mass function from the BolshoiP simulation, which has a box size of 250 Mpc/\(h\) on a side with a mass resolution of \(1.55\times 10^8 {M}_\odot/h\).2 Because this value is not small with respect to the lowest halo masses probed by the sample, we choose to fit a functional form to the simulated HMF rather than using the HMF directly. We also choose to use a functional form in our abundance matching approach because the simple matching scheme we adopt here does not account for environmentally driven mass loss that has been shown to be necessary for subhalo abundance matching [63]–[65].
The form of the halo velocity (and mass) function adopted by [66] for the bolshoip simulations behaves approximately as a power-law at the expected halo masses of the SAGAbg-SMF sample, which are much lower than the characteristic halo velocity (mass) at the knee of the halo mass function. We thus adopt fit a power-law form to the HMF over \(10.5<\rm M_h/M_\odot< 11.5\), the mass regime where the BolshoiP simulations are well-resolved (\(\rm M_h\gtrsim 10^9 M_\odot\)). Below this minimum halo mass, we assume that the HMF is well-approximated by an extrapolation of the best-fit power-law down to \(\log_{10}(\rm M_h/M_\odot)\sim 9\).
In the main text, we perform abundance matching for the full SAGAbg-SMF sample – in 12, we carry out an additional exercise wherein we apply a modified abundance matching technique to the SAGAbg-SMF field galaxies alone.
cccccccc SAGAbg-SMF: all & 6.7 & 11.3 & \(0.97^{+0.62}_{-0.38}\) & \(-1.37^{+0.05}_{-0.06}\) & \(2.30^{+1.86}_{-1.04}\) & \(-0.41^{+0.29}_{-0.39}\) & \(11.21^{+0.55}_{-0.45}\)
SAGAbg-SMF: satellite & 10 & 7 & \(0.64^{+0.70}_{-0.36}\) & \(-1.32^{+0.11}_{-0.13}\) & \(3.13^{+3.39}_{-1.74}\) & \(-0.52^{+0.38}_{-0.36}\) & \(10.81^{+0.83}_{-0.96}\)
SAGAbg-SMF: field & 6.7 & 10 & \(0.49^{+0.59}_{-0.27}\) & \(-1.44^{+0.09}_{-0.09}\) & \(2.42^{+3.81}_{-1.52}\) & \(-0.41^{+0.29}_{-0.39}\) & \(10.99^{+0.69}_{-0.93}\)
SAGAbg-SMF: isolated & 6.9 & 10 & \(0.28^{+0.33}_{-0.14}\) & \(-1.48^{+0.11}_{-0.13}\) & \(1.45^{+2.14}_{-0.93}\) & \(-0.42^{+0.30}_{-0.37}\) & \(10.68^{+0.77}_{-0.79}\)
[5] (GAMA, all) & 6.75 & 11.5 & \(0.63^{+0.10}_{-0.09}\) & \(-1.53^{+0.03}_{-0.03}\) & \(3.65^{+0.14}_{-0.13}\) & \(-0.47^{+0.07}_{-0.07}\) & \(10.75^{+0.02}_{-0.02}\)
[2] (GAMA, all) & 8 & 11.6 & \(0.8^{+0.23}_{-0.23}\)& \(-1.47^{+0.05}_{-0.05}\) & \(4.0^{+0.34}_{-0.34}\) & \(-0.4^{+0.18}_{-0.18}\) & \(10.66 ^{+0.05}_{-0.05}\)
[4] (GAMA, all) & 7.5 & 11.7 & \(0.6^{+0.1}_{-0.1}\)& \(-1.5^{+0.15}_{-0.15}\) & \(2.9^{+0.4}_{-0.4}\) & \(-0.62^{+0.15}_{-0.15}\) & \(10.8 ^{+0.2}_{-0.2}\)
[1] (SDSS) & 8.8 & 11.8 & \(0.66^{+0.09}_{-0.09}\) & \(-1.6
^{+0.12}_{-0.12}\) & \(4.0^{+0.12}_{-0.12}\) & \(0.52^{+0.04}_{-0.04}\) & \(10.67^{+0.01}_{-0.01}\)
[10] (M31 sats) & 4.1 & 8.7 &– & \(-1.5^{+0.1}_{-0.1}\) & – & – & –
[20] (MW sats) & 2.5 & 7.4 &– & [-1.46, -1.39] & – & – & –
[7] (M101 sats) & \(5.2\) & \(10.3\) & &– & \(-1.1^{+0.1}_{-0.1}\) & – & –
[8] (Cen A sats) & \(5.1\) & \(10.3\) & &– & \(-1.1^{+0.2}_{-0.2}\) & – & –
[6] (M81 sats) & \(\sim 5\) & \(\sim 10.5\) & &– & \(-1.27^{+0.04}_{-0.04}\) & – & –
[67] (GAMA groups) & 8.3 & 12 & – & \(-1.5^{+0.13}_{-0.13}\) & – & \(-0.81 ^{+0.04}_{-0.04}\) & \(10.87 ^{+0.01}_{-0.01}\)
[3] (SDSS sats) & 9 & 11.5 & \(0.04^{+0.08}_{-0.03}\) & \(-1.83^{+0.23}_{-0.23}\) & \(1.2_{-0.9}^{+3.7}\) & \(-0.84^{+0.08}_{-0.08}\) & \(10.71^{+0.02}_{-0.02}\)
[68] (Virgo cluster) & 7 & 11.8 &– & \(-1.35^{+0.02}_{-0.02}\) & – & – &\(11.33^{+0.17}_{-0.11}\)
[69] (Perseus cluster) & 7 & 11.5& – & \(-1.20^{+0.02}_{-0.02}\) & – & – & \(11.2^{+0.1}_{-0.1}\)
[67] (GAMA ungrouped) & 8.3 & 12 & – & \(-1.57^{+0.03}_{-0.03}\) & – & \(-0.22 ^{+0.06}_{-0.06}\) & \(10.41 ^{+0.01}_{-0.01}\)
[3] (SDSS centrals) & 9 & 12 & \(0.9_{-0.20}^{+0.25}\) & \(-1.16
^{+0.05}_{-0.05}\) & \(1.6^{+0.9}_{-0.6}\)& \(-0.51 ^{+0.09}_{-0.09}\) & \(10.80 ^{+0.01}_{-0.01}\)
From our measurement of the SMF of isolated low-mass galaxies in 3, we can also measure the volume-corrected quenched fraction of these isolated systems. We follow [49] in our definition of quenched; that is, we require that quenched galaxies have an H\(\alpha\) equivalent width of \(\rm EW(H\alpha)<2\;\AA\), and a NUV specific star formation rate of \(\rm SFR_{NUV}/M_\star < 10^{-11}\;yr^{-1}\). Conversely, we classify a galaxy as star-forming if either star formation indicator surpasses its threshold for quenched classification. We restrict our analysis to \(\log_{10}({\rm M_\star/M_\odot})\) \(\geq 8.5\); below this stellar mass, the uncertainty on measured H\(\alpha\) equivalent widths for at least half of the considered spectra are greater than \(2\rm \AA\).
Unlike the SAGA satellite sample and SAGAbg-A sample analyzed in and of this series, not all of the SAGAbg-SMF galaxies have an associated spectrum, as non-satellite galaxies with secure archival redshifts were not necessarily re-observed. As such, for this analysis we consider only the intersection between the SAGAbg-A and SAGAbg-SMF samples, reducing the number of isolated galaxies considered from 936 to 803. To bolster this number, we additionally include the \(z<0.05\) isolated galaxies from the equatorial GAMA survey in the survey regions that do not overlap with the SAGA footprint, adding 2347 galaxies to the sample.
The effective volume \(V_{\rm eff}\) over which each GAMA galaxy is observable is computed in the same way as the SAGAbg-SMF SMFs described in 3, where the GAMA stellar masses and photometric properties are calculated as in 2.4, and the spectroscopic completeness function is calculated as the ratio between the differential surface density of spectrosopic targets with respect to \(r\)-band magnitude in GAMA and that of the deeper SAGAbg sample down to the SAGAbg magnitude limit. In practice, this approach yields a sigmoidal completeness function with a boundary near the GAMA magnitude limit (best-fit logistic function parameters: \(m_{r,0}=19.8\), \(k=1.5\) where the completeness can be described as \({\rm Pr[GAMA}|m_r] = 1-(1 + \exp\{-k(m_r-m_{r,0})\})^{-1}\).
With these completeness functions in hand, we can directly calculate the volume-corrected quenched fractions of the SAGAbg-SMF and GAMA samples as a function of stellar mass. We find no evidence for a statistically significant shift between the surface brightness distribution of the star-forming and quenched dwarfs in the SAGAbg-SMF at fixed mass across environments, and so do not apply a photometric correction to either survey. Each galaxy in the sample is weighted by its corresponding effective volume in our calculation of the quenched fraction. We will return to this metric and its implications in 6.6.
Here we present the stellar mass function of SAGAbg-SMF galaxies as a function of environment down to \(\rm M_\star \sim 5\times10^6 M_\odot\) (5.1); this measurement further enables a measurement of the stellar-to-halo mass relation of field dwarfs (5.2).
We show the stellar mass function of the full, field, satellite, and isolated SAGAbg-SMF samples of 3, sequentially from the top panel. In each panel, our SMF measurements are shown by vertical bars (where the dark- and light-shaded bars show the 68% and 95% confidence intervals, respectively), whereas the dark- and light-shaded regions show the 68% and 95% confidence intervals, respectively, of a double Schechter model fit to the data.
We also show the estimated SMF without volume corrections as grey errorbars in each panel (that is, the SMF one would arrive at by assuming \(V_{\rm eff} = V(z=0.05)\) for all galaxies where \(V(z)\) denotes comoving volume out to redshift \(z\)). In this mass range, the photometric completeness correction is small (\(f_{\rm phot} \sim 2\)) relative to the volume correction. However, we note that the two address different effects: the former accounts for galaxies missing from the parent photometric sample, while the latter adjusts the volume over which number densities are computed to reflect the actual detection limit for each galaxy. A further discussion of the mass-dependence of SAGAbg-SMF effective volume and the raw galaxy counts for each subsample are given in 10.
In the top panel, we compare the SMF of the full SAGAbg-SMF sample to previous measurements of the SMF from other spectroscopic surveys. This represents an important consistency check if we are to then consider SMFs based on environmental classifications. As shown by the red points, our measured stellar mass function is consistent with lower limits estimated from previous GAMA [2] and Local Volume [70] estimates, as well as the most recent GAMA survey measurement [5].
There is evidence for a local cosmic under-density at comoving distances less than \(\sim 100\) Mpc [71], which encompasses a substantial fraction of our total redshift range. It has also been recently suggested by [72] using an analysis of photometric objects near DESI Year One spectroscopic targets that the stellar mass function may be more steeply rising that is implied by surveys whose low-mass SMF is primarily constrained by nearby galaxies. To estimate whether local large-scale structure in the vicinity of the Local Group and the presence of the SAGA hosts may bias our measurements, we compute the stellar mass function of the full SAGAbg sample at \(0.03<z<0.1\) in 13 using the same methodology presented above. At this higher redshift range, our limiting stellar mass rises to \(\rm M_\star \sim 10^8 M_\odot\); in the overlapping stellar mass range, we see good agreement between the SAGAbg-SMF SMF and that of the SAGAbg galaxies at \(0.03<z<0.1\), indicating that the local under-density does not significantly affect our measured SMF.
To place our measured SMFs in literature context, in [t:smf] we give the best-fit parameters for the SAGAbg-SMF fit to a double Schechter function along with recent results from spectroscopic surveys (middle section) and recent measurements of the low-mass SMF as a function of environment (lower section, ordered from highest to lowests density environments).
We caution that a direct comparison of parameter inferences should be tempered by two sources of systematic uncertainty: the mass range of each study, and possible mass-dependent stellar mass estimation biases. First, for the observed range of SMF shapes seen in the literature, the low-mass slope dominates the overall shape of the SMF only at \(\rm M_\star \lesssim 10^8\;M_\odot\). Second, as noted by [4], a mass-dependent bias in stellar mass estimations can introduce substantial systematic uncertainty in the slope of the SMF – it has furthermore been shown recently that such mass-dependent biases may be exacerbated in the dwarf mass regime [53].
Nevertheless, it is instructional to compare our results to those in the literature. Two main points emerge: first, we find broad consistency between our results and those from the literature to the level of existing study-to-study variation. The faint-end slope that we infer (\(\alpha_1\)) at \(\rm M_\star> 5\times 10^6 M_\odot\) from our double Schechter fit is fairly consistent with previous measurements from GAMA [2] and SDSS [1] (zero lies within the 73% and 84% credible intervals, respectively, of the difference between our posterior for \(\alpha_1\) and their reported values, assuming Gaussian uncertainties for literature values). We do find a significantly shallower low-mass slope than [5], though we note that our directly measured SMFs are consistent, as shown in 3.
Second, there is no clear evidence in the literature for a correlation between environment and the faint-end SMF slope in the mass range explored by the SAGAbg-SMF sample. With our environmentally-stratified SAGAbg-SMF sample, we now consider the question of environmental dependence for a set of homogeneously selected and measured SMFs.
In the second panel of 3, we show the stellar mass function of satellite galaxies. Again, the SAGA satellites themselves are explicitly excluded from the sample — the satellites considered here are associated with galaxies other than the SAGA hosts. We overplot the SDSS-based results of [3] as the aqua curve, and the scaled results of the SAGA satellite mass function [11] as aqua points. Scaling the SAGA satellite mass function is required because the number densities of the SAGA satellite mass function refer specifically to the volume within 300 kpc of a MW-like host (as opposed to the stellar mass function, which is the number density of galaxies that are satellites of some host).
We find generally good agreement in the three mass functions in the region of stellar mass overlap. At \(\rm M_\star \gtrsim10^{10}M_\odot\), the divergence between the [11] and [3] results is expected from the choice of SAGA hosts (that is, a “satellite” of a MW-analog that exceeds the stellar mass of the MW-like host is ill-defined). One may note from [t:smf] that the measured low-mass slope of our SMF differs significantly from that of [3] as a function of environment, but we emphasize that given the stark difference in mass range covered between the two samples, one should compare results as a function of directly predicted differential number density rather than best-fit values of \(\alpha_1\).
In the third panel of 3 we show the stellar mass function of only low-mass field galaxies, as defined in 2.3. By definition, here, our stellar mass function constraints truncate at \(\rm M_\star=10^{10}M_\odot\). We again overplot the measurements of [3], now for central galaxies, in red. We find a somewhat lower field galaxy SMF at the high-mass end of our sample (that is, the [3] SMF is greater than the 95% CI of our estimated number densities for \(\rm M_\star \gtrsim 10^{9.2} M_\odot\)), but we note that this may be due to the difference in our definition of “field” for galaxies that approach our massive host stellar mass limit.
Finally, in the lower panel of 3, we show the SMF of only isolated low-mass galaxies. As a reminder, these are a subset of field galaxies that are at least 1.5 Mpc in projected distance and 1000 \({\rm km\ s^{-1}}{}\) in velocity difference from their nearest massive neighbor; these distance criteria follow those presented in [54] as the threshold separation at which environmental processing from massive hosts is negligible. We find no statistically significant difference in the shape of the isolated SMF as compared to the field SMF, consistent with a minimal impact of environmental processing on stellar mass assembly in the field sample.
cccccc 6.75 & 9.93 & 9.99 & 10.05 & 10.11 & 10.16
7.25 & 10.16 & 10.20 & 10.24 & 10.29 & 10.33
7.75 & 10.36 & 10.40 & 10.44 & 10.47 & 10.51
8.25 & 10.55 & 10.59 & 10.63 & 10.67 & 10.72
8.75 & 10.74 & 10.79 & 10.84 & 10.89 & 10.95
9.25 & 10.93 & 10.99 & 11.06 & 11.12 & 11.19
9.75 & 11.13 & 11.22 & 11.31 & 11.38 & 11.47
As described in 4.2, we estimate the stellar-to-halo mass relation implied by our measured stellar mass functions using abundance matching.
In 4, we compare the results of our SHMR to stellar-to-halo mass relations estimated via semi-analytic and empirical modeling at left, and to subhalo abundance matching of nearby satellite systems at right. These two categories do not represent completely disjoint sets, but the visual separation is created to allow the reader to distinguish broadly between works that infer the SHMR using semi-analytic or empirical modeling across environments (top), subhalo and forward modeling of the MW/M31 systems (left), and satellite systems beyond the MW/M31 pair (right).
In 5, we compare our inferred SHMR via abundance-matching to the state of the observational literature from kinematic [17], [44]–[46] and weak lensing [47] measurements. The most notable feature in our comparison is that our inferred SHMR lies systematically below mass modeling estimates at \(5\times10^6 \lesssim \rm M_\star \lesssim 5\times 10^7 M_\odot\) — that is, our SHMR implies lower integrated galaxy formation efficiencies (\(\rm M_\star/M_{halo}\)) than kinematic estimates. We will explore this divergence in greater detail in 6.2. At higher masses, the SAGAbg-SMF SHMR is in overall good agreement with weak lensing and high-mass kinematic measurements at the level of existing study-to-study scatter in the literature.
We report a non-parametric description of our posterior distribution over the stellar-to-halo mass relation in [t:shmr]. The best-fit power-law3 that corresponds to this relation is: \[\label{e:shmr} \rm \log_{10}(M_{halo}) = 0.40\log_{10}(M_\star)+7.36,\tag{3}\] for a stellar mass range of \(5\times10^6\lesssim \rm M_\star <10^{10}\).
With our measurements of the galaxy stellar mass function, dwarf quenched fraction, and stellar-to-halo mass relation in hand, we now consider the implications of our measurements on the contemporary picture for dwarf galaxy star formation and assembly.
As summarized visually in 3, we find that the low-mass slope (\(\alpha_1\) in [t:smf]) of the galaxy SMF is relatively insensitive to environment at the precision of our SMF measurements; we find that \(\alpha_1\) is statistically consistent between the full, satellite, and field measurements of the SAGAbg-SMF SMF. This finding is in agreement with a recent analysis of the GAMA survey at higher masses (\(\log_{10}({\rm M_\star/M_\odot})\) \(\gtrsim 8\)) who find the slope of the low-mass SMF to be insensitive to whether or not galaxies reside in groups [67].
This minimal environmental dependence of the low-mass slope of the SMF is a result that has been implied by previous cross-sample analyses of satellite systems [11] and clusters [68], as well as a comparison between our results and the recent literature measurements of environmentally partitioned SMFs in [t:smf], but is here shown with a homogeneous approach in both source data and analysis.
Although the magnitude of the change between any given environmental bin is not statistically significant, we do see weak evidence for a steepening low-mass slope as a function of distance from a massive host: when drawing from the posterior distribution of \(\alpha_1\) for each environmental classification considered, the satellite SMF (\(\alpha_1=-1.32\pm0.12\)) is the steepest in only \(\sim\! 5\%\) of draws. Because star formation activity in dwarfs correlates strongly with environment, we now consider whether a difference in slope between the satellite and isolated dwarf samples can be explained by a correlation between star formation and low-mass SMF slope.
Using the same star formation rate indicators as described in 4.3, we classify all available SAGAbg-SMF galaxies as either star-forming or quenched, and measure the resulting SF-dependent SMFs with the same methodology as 3. The full details of this exercise can be found in 13; we find that the low-mass slope of the star-forming SMF, as well as the star-forming satellite SMF, are consistent with \(\alpha_1\) measured for the field and isolated SAGAbg-SMF samples. The quenched galaxies, meanwhile, are consistent with (and formally slightly shallower than) \(\alpha_1\) measured for the satellite SAGAbg-SMF SMF.
We therefore suggest that the slight change in slope between the isolated and satellite SAGAbg-SMF SMFs is attributable to the difference in star formation properties of the galaxies in these subsamples. This finding is consistent with work in literature with higher mass galaxies, where it was found that has found that the SMF at \(\log_{10}(\rm M_\star/M_\odot)\gtrsim 9\) steepens for galaxies with higher specific star formation rate increases [3] or bluer optical colors [2]. We note that recent work on the Virgo cluster has found the opposite trend [68]; though a unified analysis would be necessary to demonstrate this result, such a reversal could reflect a difference in the mass-dependence of the dominant quenching mechanisms experienced in clusters versus lower-density environments.
Our abundance matching-estimate of the dwarf SHMR extends down to the lowest stellar masses for which stellar-to-halo mass ratios have been measured via kinematic modeling. As discussed above, this sensitivity has revealed an apparent tension, wherein our abundance matching-derived SHMR implies a steeper SHMR than previous mass-modeling results.
Possibilities for such an apparent discrepancy have been previously reported in the literature. For example, [73] proposed that the inherent prerequisite of gas-richness in HI kinematic modeling biases the low-mass end of HI mass-modeling samples towards unusually late-forming systems with anomalously high stellar-to-halo mass ratios, while [74] suggested that the over-representation of LG dwarfs in studies could bias these samples towards galaxies with more tidal processing. Similarly, [75] suggested that sampling individual dwarf halo masses at fixed stellar mass biases the apparent SHMR to low-mass halos due to the steeply declining HMF.
Another possibility, of course, is that the assumptions we make to perform abundance matching begin to break down at the low-mass end of our sample. This potential breakdown could include factors such as a rising, mass-dependent scatter in the dwarf SHMR (as opposed to our test of a constant lognormal scatter) or other violations to the assumption that stellar mass is rank-ordered with halo mass. Previous work predicts, for example, that an extrapolation of our best-fit SHMR down to lower mass (\(\rm M_{halo} \lesssim 3\times10^9\;M_\star\)) would likely be invalid due a steep decline in the fraction of halos that actually host a galaxy, breaking the rank-ordered assumption [76]. It could also be the case that we under- or over-estimate the slope of the stellar or halo mass functions at the very lowest masses in our sample, where both the relative mass resolution of the simulation from which we estimate the HMF and our observational signal-to-noise in the SMF are lowest.
Though we cannot comment more definitively here on the origin of a possible bias, the hypothesis proposed in previous works that a discrepancy between simulated dwarf SHMRs and individual mass modeling results is partially observational (or selection-related) in nature is supported by the discrepancy between individual field dwarf modeling and our statistical measurement of the field dwarf SHMR.
Our abundance matching-based estimate of the dwarf SHMR extends more than an order of magnitude lower in stellar mass than previous statistical measurements of dwarfs [17], [22] based on shallower SDSS data, allowing the sample to act as a bridge between previous modeling efforts to infer the dwarf SHMR and observational measures of dwarf halo masses.
The literature on abundance matching in the low-mass regime has previously been dominated by higher density environments (satellites, groups, and clusters), where the efficiency of low-mass galaxy identification is bolstered by the known distance of the more massive neighbors. Recent simulations have suggested that the \(z=0\) stellar mass to peak halo mass relations of dwarfs varies with environment due to differences in halo assembly history [77]; as these effects manifest in the relationship between the stellar mass and peak halo mass, the predicted effect is separate from differences in the expected environmental dependence of the \({\rm M_\star}(z=0)/{\rm M_{halo}}(z=0)\) due to preferential mass loss from the dark matter halo from processes such as tidal and ram-pressure stripping [78]. To test this prediction, we compare our abundance matched SHMR of dwarfs to the abundance matching-driven SHMRs inferred for satellites of the Milky Way [20], [75], SDSS groups [17], and nearby MW-like galaxies [65].
We do not find strong evidence for a systematic difference between the SHMR we infer from abundance matching the SAGAbg-SMF sample and that of satellite systems from the literature. This result is consistent with minimal environmental evolution of the low-mass SHMR, though we caution that a direct comparison against literature results may incur substantial systematics due both to the galaxy-halo modeling scheme adopted and to variation in the nature of the SHMR being fit (e.g., MW-like systems in [65], galaxy groups in [17], and the MW itself in [21]). To test whether this result could be attributable to the presence of satellite dwarfs in the SAGAbg-SMF sample, we perform an additional abundance matching exercise to estimate the SHMR of field dwarfs alone and find no statistically significant difference with the SHMR inferred for the full sample (12).
A minimal environmental dependence of the dwarf SHMR is potentially in tension with previous predictions for satellite galaxies to show higher \(\rm M_\star/M_{peak}\) values compared to their field counterparts ([77] using the MARVELOUS simulation suite), as the magnitude of the predicted difference in SHMR at a host separation of 50 kpc (satellite-like) and 1 Mpc (field-like) would nominally be statistically significant given our SHMR uncertainties. That being said, other observational modeling — for example, [79] modeling of the Local Group and Local Field mass functions) — do imply that Local Field dwarfs may be characterized by a steeper SHMR than Local Group satellites.
Due to the variation in SHMR inferred for dwarf systems across the literature, we thus suggest that precisely ruling in or out an environmental dependence of the SHMR will require joint modeling of satellite and field systems where the same methodology for environment selection and SHMR inference is used for the observed and simulated galaxy samples. This should be possible in the near future with the large dwarf galaxy samples that probe the full range of environments avaialable from DESI [14].
As shown in 6, we find that most contemporary cosmological simulations successfully reproduce our dwarf field SHMR down to our limiting mass and precision. Of the numerical works considered, only the NIHAO [80] and EDGE [73] simulations deviate from the 95% confidence interval of our measured SHMR at any mass. The NIHAO SHMR, furthermore, shows a similar scaling to the SAGAbg-SMF SHMR (\(\rm M_{\rm halo}\propto M_\star^{0.43}\) in the NIHAO suite); the log-additive offset from our results may reflect systematics in stellar and/or halo mass calibrations or definitions, rather than differences in galaxy formation physics.
Given the general success of simulations in predicting our observed SHMR, it is worth asking how these simulations capture the physics that modulate \(M_\star/M_{\rm peak}\), and what we can learn from the dwarf SHMR. For dwarfs that are sufficiently massive to have evaded formation suppression from reionization or the cosmic UV background [81], star formation and stellar feedback are thought to be the main modulators of galaxy formation efficiency [30]. The nature of simulated star formation-driven outflows — and, in particular, the multiphase nature of the outflows — that serve to regulate star formation can depend sensitively on numerical resolution. Resolution-based challenges in simulating stellar feedback manifest largely in two regimes: resolving supernova remnant evolution when the outflow is launched [82]–[85], and — for codes that adopt fixed-mass rather than fixed-spatial resolution — when the low-density outflow reaches large galactocentric radii [86], [87].
This is especially true in light of the fact that, to achieve the large volumes necessary to assemble a cosmological sample of simulated galaxies, most of the simulations referenced in 6 invoke some subgrid model or effective equation of state to model stellar feedback and regulate star formation. Thus, while the low-mass SHMR is a key observable that quantifies overall galaxy formation efficiency, there appears to be a multiplicity of avenues through which a realistic SHMR can be generated down to \(\rm M_\star\gtrsim10^6M_\odot\). As such, understanding the mechanisms of the regulation requires a multi-pronged approach with other tracers of feedback, as we will discuss in the following section.
Though it is widely accepted that stellar feedback processes such as supernovae-driven outflows and winds dominate the integrated galaxy formation efficiency of the low-mass galaxy population, the mechanisms behind this regulation are still a highly active area in the literature [88], [89]. Quantifying the relative roles of ejective feedback via strongly mass-loaded winds (characterized by high mass-loading factors \(\eta_m \equiv \rm\dot{M}_{out}/ SFR\)) and preventative feedback via strongly energy-loaded winds (characterized by high energy-loading factors \(\eta_E\equiv \dot{E}_{\rm out}/\rm SFR\)) is in particular crucial to understanding the physical mechanisms that modulate formation efficiency in the dwarf regime.
In of this Series, we demonstrated that the low-mass galaxies in the SAGAbg-A sample at \(z<0.2\) are well-described by a mass-loading factor of \(\eta_m=0.92\). Indeed, there is growing observational support for the idea that bright (\(10^8 \lesssim \rm M_\star \lesssim 10^9\;M_\odot\)) dwarfs are characterized by relatively low (order unity) mass-loading factors relative to the high \(\eta_m\) schemes that many of the above simulations employ [90]–[96].
If we posit that the SAGAbg-SMF sample is described by weakly mass-loaded winds, we can consider further implications for how the galaxy–halo connection is regulated in this mass regime. A complete characterization of the mass- and energy-loading factors of the SAGAbg-SMF galaxies would require a more comprehensive joint analysis with numerical experiments. Here, however, we note that when considered in conjunction with the mass-loading factor measured in , the declining galaxy formation efficiency of the SAGAbg-SMF SHMR is most naturally described by a model in which preventative star formation feedback, cosmic ray pressure and/or high energy-loading factors efficiently regulates galaxy formation at low mass.
This picture aligns well with recent results on preventative and ejective feedback, including work by [97], who use an analytic gas-regulator model to argue that the SHMR is dominated by an energy-loading factor that rises steeply with declining halo mass and only weakly dependent on the mass-loading factor; by [98], who showed that the inclusion of non-thermal pressure support sources such as cosmic rays can support both bursty star formation and moderately (\(\eta_m\sim1\)) mass-loaded outflows; and by [87], who used the Arkenstone wind-launching model to show that regulation via preventative feedback from high-specific energy winds (i.e. star formation efficiently prevents gas from accreting) is able to reproduce the stellar-to-halo mass relation seen in simulations with highly ejective feedback schemes.
The presence of nearby, massive galaxies is understood to play a predominant role in the quenching of star formation in low-mass galaxies; observationally, self-quenching mechanisms have been demonstrated to be highly inefficient at \(\rm M_\star\lesssim 10^9 M_\odot\) [54]. Numerical studies have since suggested that both internal feedback processes [99], large-scale effects from the cosmic web [100], and backsplash populations from massive hosts [101] have the potential to contribute substantively to the assembly of an isolated, quenched dwarf population. Indeed, a growing number of isolated, quenched dwarfs have since been discovered below this mass threshold, but it has remained challenging to pin down quenching mechanisms for individual systems without a broader statistical picture of the quenched population [103]–[108].
The quenched fraction implied by the SAGAbg-SMF and GAMA samples is shown in 7, along with previous measurements from SDSS [54]. Our measurement of the quenched fraction demonstrates the existence of a small population of quenched and isolated dwarfs down to \(\log_{10}({\rm M_\star/M_\odot})\) \(=8.5\), contextualizing serendipitous discoveries and allowing for a comparison to theoretical predictions of \(f_q(\rm M_\star)\), the mass-dependent quenched fraction of isolated systems. We find a total of 13 quenched, isolated galaxies at \(\log_{10}({\rm M_\star/M_\odot})\) \(<10\) between the two samples; two of the SAGAbg-SMF galaxies are additionally at \(\log_{10}({\rm M_\star/M_\odot})\) \(<9\) (all of the GAMA quenched and isolated galaxies have stellar masses of \(10^9<\rm M_\star/M_\odot<10^{10}\)).
In addition to previous observations of quenched, isolated systems, we also compare to semi-empirical modeling results at left, and recent results from cosmological simulations at right. In particular, at left we show the UniverseMachine semi-empirical modeling of quenched, isolated systems by [22], UM-SAGA, which uses both the SAGA satellite data and [54] isolated dwarf sample to update the UniverseMachine environmental quenching model. We find that the UM-SAGA model is broadly consistent with observational results: the 68% confidence interval of the UniverseMachine prediction overlaps with the 68% credible interval of our quenched fraction over the majority of the domain (and our measurement is contained within the 95% CI of the UniverseMachine prediction at all stellar masses considered).
Conversely, mock observations of isolated Illustris-TNG100 and SIMBA [109] made by [99] and star-formation splits of central galaxies in Illustris-TNG50 [100], [101] and FIREBox [102] predict a substantively different trend for \(f_q(\rm M_\star)\). At higher masses, FIREBox, TNG50 and TNG100 successfully reproduces the quenched fraction of isolated galaxies, but predict a turnaround in \(f_q(\rm M_\star)\) at a factor of a few about \(\rm M_\star= 10^9 M_\odot\). SIMBA, conversely, predicts a systematically higher quenched fraction for more massive galaxies but intersects with our measurement just below \(\rm M_\star=10^9 M_\odot\) before hinting at a turnaround in \(f_q(\rm M_\star)\). The other simulation considered in [99], EAGLE [110]–[113] and SIMBA, predicts a similar turnaround behavior of \(f_q(\rm M_\star)\), but maintains an overall higher quenched fraction at all masses (\(f_q>0.1\), above the upper y-limit of 7); an analysis of the NewHorizon simulations by [114] also predict higher quenched fractions (albeit for dwarfs in the field, not necessarily isolated to the degree that we consider), with quenched fraction increasing for decreasing stellar mass at \(\rm M_\star \lesssim 10^{10} M_\odot\).
One caveat to our comparison is potential for systematics originating from the simulation works’ definition of galaxy quenching. [99] created mock spectra from their simulated galaxies to closely emulate the observational selection of [54]; [100] defined quenched galaxies as any system with an instantaneous specific SFR \(<10^{-11}\) yr\(^{-1}\); [101] defined quenched galaxies as any system at least 1 dex below the Star-Forming Sequence at its \(z=0\) stellar mass; and [102] defined quenched as a galaxy with a 100 Myr-averaged specific star formation rate of \(<10^{-10.5}\) yr\(^{-1}\) (they also consider a lower threshold of \(<10^{-11}\) yr\(^{-1}\), which results in a lower \(f_q\) than their fiducial values but is still elevated relative to our observations).
Despite variations in how each work defines the threshold for quenched galaxy classification, we do not see strong evidence that this variation could account for the discrepancy between the simulated and observed quenched fractions. We in particular highlight that the results of [99], who explicitly created mock-observed spectra for a near apples-to-apples comparison with observatiodsns, produces the same trend as seen in the works that define quenching based on intrinsic SFR. We also note that [100] and [101], who both use the TNG50 simulations, produce consistent \(f_q(\rm M_\star)\) relations despite choosing different quenched classifiers.
This consistency indicates that many contemporary cosmological simulations systematically predict a turnaround in self-quenching efficiency at around \(\rm 10^8<M_\star /M_\odot <10^{10}\), resulting in apparent over-quenching of isolated dwarfs at \(\rm M_\star \sim 4\times 10^8 M_\odot\) compared both to our sample and previous upper limits placed by [54]. The apparent over-quenching in simulations is of particular interest because the cosmological simulations considered in 6 are able to broadly and consistently reproduce the dwarf SHMR, which should also trace galaxy formation feedback physics. One potential mechanism to explain both the simulated SHMR and \(f_q(\rm M_\star)\) predictions is that over-vigorous stellar feedback (as suggested in [99]) is coupled with elevated star formation rates in low-mass progenitors at higher redshifts (as suggested observationally by of this Series), allowing for a build-up of a simulated quiescent dwarf population while still producing a \(M_\star(z=0) /M_{\rm peak}\) relation in line with observations.
This being said, however, it is also important to recall the potential role of numerical effects, particularly when the total galaxy mass becomes close to the mass resolution of the simulation4. The potential for a tight coupling between resolution and emergent galaxy properties can be straightforwardly understood for the case of stellar feedback, where both the energetics of individual supernovae and their effect on the multiphase interstellar medium are largely implemented via subgrid models [116].
As such, both variations in the presumed physical models for stellar feedback and the coupled numeric effects of their implementation can substantially impact star formation structure and behavior in simulated galaxy. These factors are expected to be particularly strong at dwarf masses, both because the overall number of resolution elements associated with the galaxy is smaller, and because stellar feedback is expected to dominate galaxy formation efficiency [117]. Indeed, [99] find that increasing simulation resolution can substantially impact their predicted quenched fractions, though the general trend of quenched fraction as a function of mass remains largely intact [99].
High resolution simulations of dwarfs also point towards lower quenched fractions in isolated systems at these masses: the EDGE simulations [73], [118] and MARVEL-ous Dwarfs + DC JUSTICE LEAGUE + ROMULUS25 suite [77] predict ubiquitously star-forming populations down to \(\rm M_\star \sim 10^7 M_\odot\), and FIRE [81] zoom-in simulations of \(\rm M_{halo}\sim 10^{10} M_\odot\) galaxies find two out of 14 galaxies to be self-quenched in the field at \(\rm 10^5\lesssim M_\star/M_\odot\lesssim 10^7\). Similarly, the DARKLIGHT semi-empirical modeling by [118] — which is constrained by both observations and, at \(\rm M_\star \lesssim 10^7\;M_\odot\), by the high-resolution EDGE cosmological simulations [73], [119]) — predicts a quenched fraction of \(f_q=0\) for their model galaxies at \(\rm M_\star \gtrsim 5\times 10^6\;M_\odot\). Assuming binomial counting uncertainties with a Jeffreys’ prior, we arrive at a 68% upper limit of \(f_q\lesssim 0.01\) between \(\rm 10^8 \lesssim M_\star/M_\odot \lesssim 10^9\), which is statistically consistent with our observations. This apparent relation between simulation resolution and isolated dwarf quenched fraction implies that the stellar mass turnaround in \(f_q(\rm M_\star)\) may be impacted by both numerical resolution and the included physics [118].
That cosmological simulations can successfully reproduce the SAGAbg-SMF dwarf SHMR while over-predicting the quenched fraction of isolated dwarfs underscores the complexity of building stellar feedback models to modulate dwarf evolution. This complexity leads to the need for multidimensional observations — such as the mass-loading factor of , the Star-Forming Sequence time evolution of , and the SHMR and quenched fractions presented here — to identify and distinguish between physically reasonable recipes for extragalactic astrophysical processes.
In this work, we leverage the background spectra associated with the Satellites Around Galactic Analogs (SAGA) survey to compute three key diagnostics of the low-mass galaxy population:
The galaxy stellar mass function (SMF) down to \(\rm M_\star \sim 5\times10^6 M_\odot\) separated by extragalactic environment (5.1; 3 and best-fit model parameters given in [t:smf]).
The stellar-to-halo mass relation of field dwarfs down to \(\rm M_\star \sim 5\times10^6 M_\odot\) (5.2, 5 and tabulated in [t:shmr]).
The quenched fraction of isolated field dwarfs with measurements down to \(\rm M_\star = 4\times 10^8 M_\odot\) (6.6; 7).
Our measurements with SAGAbg-SMF provide new statistical leverage to connect \(\rm M_\star/M_{halo}\) measured for individual LV dwarfs with that of previous wide-field surveys, compare the SHMR of dwarf satellites with that of field dwarfs, and contextualize serendipitous discoveries of quenched field dwarfs below the empirical self-quenching mass scale of \(\log_{10}({\rm M_\star/M_\odot})\) \(\sim 9\). In particular:
We observe minimal environmental dependence of the dwarf galaxy SMF, with some evidence that the steeper (shallower) SMF observed in the isolated dwarfs (satellites) at \(\rm 10^7 \lesssim M_\star/M_\odot \lesssim 10^{10}\) can be explained by a correlation between SMF shape and star formation activity. (6.1).
Our inferred SHMR may be steeper than implied by mass modeling of individual galaxies at \(\log_{10}({\rm M_\star/M_\odot})\) \(\sim 7\); such a discrepancy may imply a systematic bias in kinematic measurements of dwarf halo masses, or a divergence between the low-mass SHMR and abundance matching assumptions (5.2, 6.2).
We do not find strong evidence for higher integrated galaxy formation efficiency (\(\rm M_\star/M_{\rm peak}\)) in satellite dwarfs as compared to field dwarfs, as has been recently predicted from cosmological simulations (6.3).
In combination with the near-unity mass-loading factor measured in of this series, the SAGAbg-SMF SHMR is naturally explained by high energy-loading factors and largely preventative star formation feedback. Despite this, the field dwarf SHMR is consistent with many cosmological simulations (at the 68% CI level) that employ a largely ejective feedback scheme (6.5).
The SAGAbg-SMF includes two quenched, isolated galaxies down to \(\rm M_\star \sim 4\times 10^8 M_\odot\), implying that the isolated quenched fraction continues to drop below \(\rm M_\star \sim 10^9 M_\odot\). This is contrast to contemporary big-box cosmological simulations, which largely predict a turnaround in \(f_q(\rm M_\star)\) at these masses – even when they accurately reproduce the dwarf SHMR (6.6).
The field dwarf population presents an opportune testbed in which to identify how and where competing methods to regulate galaxy and halo assembly succeed or fail. As summarized above, physically reasonable numerical recipes for galaxy formation can simultaneously reproduce the stellar-to-halo mass relation of field dwarfs, and over-predict the quenched fraction of dwarfs in isolation. This mismatch emphasizes the complexity of evaluating prescriptions for baryonic feedback and cycling at galactic scales — especially given that we lack ab initio theories for many of the key astrophysical processes involved — and the need for multidimensional observations to differentiate between competing theories for dwarf galaxy formation regulation.
In the three papers of this series, we have used the SAGAbg sample to build this kind of observational dataset: in , we showed that the SAGAbg dwarfs are characterized by moderately mass-loaded winds, while in , we demonstrated that the Star-Forming Sequence of dwarfs has evolved significantly at \(z<0.2\). Finally, in this work, we measured the integrated galaxy formation efficiency and quenched fraction of dwarfs outside the influence of a massive halo.
Each of these works aims to illuminate a different aspect of the dwarf formation problem to drive towards a complete picture of low-mass assembly across environment and cosmic time. As discussed in 6.5 and 6.6, the complexity of the problem calls for a holistic view of the dwarf population to advance our understanding of the processes that govern the assembly of low-mass galaxies and their halos. Further elucidating the physical mechanisms of dwarf regulation will require not just a broader, but a multi-pronged view of the low-mass Universe. Fortunately, one need not wait overly long; with new and upcoming wide-field views of properties including but not limited to star formation [15], [120], dwarf halo structure [121], and interstellar and circumgalactic medium enrichment [122], and the significant expansion in the number of spectroscopically identified galaxies from large surveys like DESI [14], our view of the low-mass Universe is set to not only exponentially expand in number, but greatly broaden in parameter space over the coming years.
The ideas underpinning this work took shape during and benefited greatly from discussion at the Dwarf Galaxies, Star Clusters, and Streams Workshop hosted by the Kavli Institute for Cosmological Physics at the University of Chicago. The authors would like to thank Drummond Fielding, Pratik Gandhi, Stacy Kim, Andrey Kravtsov, Sergio Martin-Alvarez, Viraj Pandya, and Andrew Pontzen for insightful discussions that contributed substantially to the development of this work. The authors would additionally like to thank the anonymous referee for their helpful feedback and suggestions that substantially improved the quality of this manuscript.
M.G.and Y.A.were supported in part by a grant to M.G from the Howard Hughes Medical Institute (HHMI) through the HHMI Professors Program.
This research used data from the SAGA Survey (Satellites Around Galactic Analogs; sagasurvey.org). The SAGA Survey is a galaxy redshift survey with spectroscopic data obtained by the SAGA Survey team with the Anglo-Australian Telescope, MMT Observatory, Palomar Observatory, W. M. Keck Observatory, and the South African Astronomical Observatory (SAAO). The SAGA Survey also made use of many public data sets, including: imaging data from the Sloan Digital Sky Survey (SDSS), the Dark Energy Survey (DES), the GALEX Survey, and the Dark Energy Spectroscopic Instrument (DESI) Legacy Imaging Surveys, which includes the Dark Energy Camera Legacy Survey (DECaLS), the Beijing-Arizona Sky Survey (BASS), and the Mayall z-band Legacy Survey (MzLS); redshift catalogs from SDSS, DESI, the Galaxy And Mass Assembly (GAMA) Survey, the Prism Multi-object Survey (PRIMUS), the VIMOS Public Extragalactic Redshift Survey (VIPERS), the WiggleZ Dark Energy Survey (WiggleZ), the 2dF Galaxy Redshift Survey (2dFGRS), the HectoMAP Redshift Survey, the HETDEX Source Catalog, the 6dF Galaxy Survey (6dFGS), the Hectospec Cluster Survey (HeCS), the Australian Dark Energy Survey (OzDES), the 2-degree Field Lensing Survey (2dFLenS), and the Las Campanas Redshift Survey (LCRS); HI data from the Arecibo Legacy Fast ALFA Survey (ALFALFA), the FAST all sky HI Survey (FASHI), and HI Parkes All-Sky Survey (HIPASS); and compiled data from the NASA-Sloan Atlas (NSA), the Siena Galaxy Atlas (SGA), the HyperLeda database, and the Extragalactic Distance Database (EDD). The SAGA Survey was supported in part by NSF collaborative grants AST-1517148 and AST-1517422 and Heising–Simons Foundation grant 2019-1402. SAGA Survey’s full acknowledgments can be found at https://sagasurvey.org/ack.
The CosmoSim database used in this paper is a service by the Leibniz-Institute for Astrophysics Potsdam (AIP). The MultiDark database was developed in cooperation with the Spanish MultiDark Consolider Project CSD2009-00064.
The authors gratefully acknowledge the Gauss Centre for Supercomputing e.V. (www.gauss-centre.eu) and the Partnership for Advanced Supercomputing in Europe (PRACE, www.prace-ri.eu) for funding the MultiDark simulation project by providing computing time on the GCS Supercomputer SuperMUC at Leibniz Supercomputing Centre (LRZ, www.lrz.de). The Bolshoi simulations have been performed within the Bolshoi project of the University of California High-Performance AstroComputing Center (UC-HiPACC) and were run at the NASA Ames Research Center.
GAMA is a joint European-Australasian project based around a spectroscopic campaign using the Anglo-Australian Telescope. The GAMA input catalogue is based on data taken from the Sloan Digital Sky Survey and the UKIRT Infrared Deep Sky Survey. Complementary imaging of the GAMA regions is being obtained by a number of independent survey programmes including GALEX MIS, VST KiDS, VISTA VIKING, WISE, Herschel-ATLAS, GMRT and ASKAP providing UV to radio coverage. GAMA is funded by the STFC (UK), the ARC (Australia), the AAO, and the participating institutions. The GAMA website is https://www.gama-survey.org/ .
The CosmoSim database used in this paper is a service by the Leibniz-Institute for Astrophysics Potsdam (AIP). The MultiDark database was developed in cooperation with the Spanish MultiDark Consolider Project CSD2009-00064.
The authors gratefully acknowledge the Gauss Centre for Supercomputing e.V. (www.gauss-centre.eu) and the Partnership for Advanced Supercomputing in Europe (PRACE, www.prace-ri.eu) for funding the MultiDark simulation project by providing computing time on the GCS Supercomputer SuperMUC at Leibniz Supercomputing Centre (LRZ, www.lrz.de). The Bolshoi simulations have been performed within the Bolshoi project of the University of California High-Performance AstroComputing Center (UC-HiPACC) and were run at the NASA Ames Research Center.
To guide the reader in both the flow of the paper and the connection between the SAGA Survey and the SAGAbg-SMF sample, we provide a schematic description of our approach to deriving each of these quantities in 8. From top to bottom, the figure walks through the process from sample selection and collection, selection function correction, and data product generation. The red panels show processing related primarily to the SAGA Survey itself, while blue panels show processing and analysis related specifically to the SAGAbg sample. The grey panel denotes the use of the Bolshoi-Planck dark-matter only cosmological simulation to measure halo mass functions for abundance matching. Each panel notes both the relevant section and, where applicable, a graphical representation of the process step.
As discussed in 3.2, the absolute completeness of the SAGAbg-SMF must be evaluated over both spectroscopic and photometric completeness. The DESI Legacy Imaging survey, from which the SAGA photometric catalogs are constructed, has a 5\(\sigma\) point source depth of \(m_r=23.9\) with an average 1.2” seeing in the \(r\)-band [58].
For the sake of a rough estimate of whether photometric incompleteness should be considered, let us adopt the limiting surface brightness of compact objects to be approximately \(\mu_{r,\lim}=24.9\) mag arcsec\(^{-2}\). The \(r-\)band luminosity at which a galaxy reaches the surface brightness limit, then, can be written as: \[L_{r,\lim} = 3.4\times10^{12} (1+z)^4 R_{\rm eff}^2 g_{AB}^\nu 10^{-0.4 \mu_{r,\lim}}\] For a galaxy of size \(R_{\rm eff} = 1\) kpc, this corresponds to a limiting \(r-\)band luminosity of \(L_{r,\lim} = 2.8\times10^7\;L_\odot\) given a solar absolute magnitude of \(M_{r,\odot}=4.61\). The \(\log_{10}({\rm M_\star/M_\odot})\) \(<9\) galaxies in the SAGAbg-SMF have a median mass-to-light ratio in the \(r-\)band of \({\rm M}_\star/L_r = 0.9 {\rm M}_\odot/L_\odot\), implying that a photometric correction term may be necessary for the low-mass end of our sample.
To actually compute this term, we must model the allowable number density galaxies in the SAGAbg-SMF volume that are both missing from the photometric targeting catalog, and at an apparent magnitude brighter than our \(m_r<20.75\) limit. We choose a forward modeling approach to this problem, wherein we sample the observed stellar mass-to-light relation of the SAGAbg-SMF sample and the mass-size relation of Asali et al., 2025 (submitted) 10,000 times over a grid of stellar masses at \(6<\) \(\log_{10}({\rm M_\star/M_\odot})\) \(<10\) and redshifts \(0.0015<z<0.05\). We find the mass-to-light relation of the SAGAbg-SMF galaxies to be well-described by a normally distributed linear relation wherein \(\log_{10}({\rm M_\star/M_\odot})\) \(-M_r = 3.21 (\pm0.01)\log_{10}(\rm M_\star/M_\odot) \;-2.2(\pm0.1)\) with \(\sigma_{M/L}=0.24\pm0.01\).
For a given instantiation, the mass, redshift, size and stellar mass-to-light ratio of the model galaxy immediately yields the apparent magnitude and \(r\)-band surface brightness. We discard galaxies fainter than \(m_r=20.75\) from the sample, and define the probability that a random galaxy at a given mass and redshift to be detected in the photometric targeting catalog as the fraction of model galaxies brighter than the surface brightness detection limit. Finally, we marginalize over redshift assuming a uniformly distributed galaxy density within the sphere to arrive at the probability that a galaxy of a given stellar mass will be detected in the photometric targeting catalog.
The results of this simple exercise are shown in 9. At a given mass, the completeness correction small compared to the counting uncertainty incurred in that bin. We thus conclude that photometric completeness is a secondary consideration for this sample as compared to spectroscopic volume completeness.
The \(V_{\rm eff}\) volume correction is fundamental to accurately estimating differential number densities from galaxy samples because it quantifies the volume fraction over which a given galaxy is observable in the survey, compensating for our intrinsic property-dependent ability to observe galaxies at a given distance.
To give the reader a sense of the comoving volume probed by the SAGAbg-SMF sample as a function of stellar mass, in the top panel of 10 we convert our \(V_{\rm eff}\) values to the redshift that they would correspond to in a magnitude-limited sample – that is, the redshift \(z_{\rm max}\) such that \(V_{\rm eff} = \int_0^{z_{\rm max}} \frac{dV}{dz} dz\). This shows that galaxies above \(\rm M_\star \gtrsim 10^8 M_\odot\) are generally complete across the full redshift range of the SAGAbg-SMF sample. This can also be seen in the lower panel of 10, which shows that the raw counts in the sample are consistent with a turnover at \(\rm M_\star \sim 10^8 M_\odot\) for each of the sub-samples considered in this work.
Below this stellar mass, our volume-completeness drops substantially – the corresponding maximum detectable redshift \(z_{\rm max}\) drops to around \(z=0.01\) at the lowest stellar masses considered. However, we note that it is important to remember that the \(V_{\rm eff}\) method is not a way to account for galaxies that the sample has missed (as is the case for the much-smaller photometric completeness correction described in 3.2 and 9), but a correction to the volume over which we compute the number density of the galaxies that the sample does contain.
To test our implementation of the \(1/V_{\rm eff}\) method, we also implement a modification of the STY parametric fitting method [123]–[125] as described below in 4.1. We adopt the \(1/V_{\rm eff}\) method as our fiducial fitting approach: as detailed below, the modified STY approach requires stronger assumptions about how the photometric properties of galaxies in the sample change as a function of mass, whereas the \(1/V_{\rm eff}\) method estimates this completeness as a function of distance, which is more directly predictable from observables. However, we implement both methods to show that our inferred SMF fitting parameters are consistent between the two methods.
In the original STY method it is assumed that there exists a sample-wide, redshift-dependent mass completeness limit, such that the probability of any object \(i\) in a given galaxy sample being drawn from a parametric SMF \(\Phi({\rm M_\star}|\vec{X})\) described by parameters \(\vec{X}\) can be written as
\[\label{e:sty} {\rm Pr}[{\rm M_{\star,i}, z}|\vec{X}] \propto \frac{\phi({\rm M_{\star,i}}|\vec{X})}{\int_{\rm M_{\star, \rm min}(z)}^{\rm M_{\star, \rm max} (z)} \phi({\rm M_\star}|\vec{X}) d{\rm M_\star}}.\tag{4}\] where \(\rm M_{\star, min}(z)\) and \(\rm M_{\star, max}(z)\) are the minimum and maximum stellar masses for which the sample is complete as a function of redshift \(z\). The non-monotonicity of the mass completeness of the SAGAbg selection function at fixed redshift means that we cannot exactly define these minimum and maximum stellar masses. As such, we adopt a more general form of 4 such that:
\[{\rm Pr}[{\rm M_{\star,i}, z},\vec{X}] \propto \frac{\phi({\rm M_{\star,i}},\vec{X})}{\int_{0}^{\infty} {\rm Pr}[{\rm obs}|\vec{\theta}_i(\rm M_\star)]\phi({\rm M_{\star}},\vec{X})d{\rm M_\star}},\]
where \({\rm Pr}[{\rm obs}|\vec{\theta}_i(\rm M_\star)]\) takes the same for as 2 , and is the probability that a galaxy will observed with photometric properties \(\vec{\theta}_i(\rm M_\star)\) as a function of stellar mass \(\rm M_\star\). Here, we calculate the expected photometric properties of the \(i\)th galaxy as a function of its stellar mass, as opposed to as a function of redshift as in 2 .
In practice, we hold the color fixed and allow the apparent magnitude and surface brightness to vary as a function of stellar mass, assuming a fixed mass-to-light ratio. We adopt the scaling of the Asali et al. 2025 (submitted) mass-size relation (where effective radius scales with stellar mass approximately as \(\rm R_{eff} \propto M^{0.3}\)) to estimate a rough scaling between stellar mass and r-band surface brightness of \(\mu_r \propto -\log_{10}(\rm M_\star/M_\odot)\). We can then write the likelihood of all galaxies in the sample being drawn from \(\phi({\rm M_\star},\vec{X})\) as:
\[\mathcal{L}(\text{SAGAbg-SMF}|\vec{X}) \propto \prod_{i}^{N_{\rm gal}} {\rm Pr}[{\rm M}_{\star,i}|\vec{X}].\]
Due to the construction of the likelihood for the STY method, the fitting procedure cannot be used directly to constrain the absolute scale of the SMF. Although there are methods to compute this normalization after fitting, we forgo this step because the \(1/V_{\rm eff}\) method includes this normalization. We adopt the same priors for the STY method as assumed for the \(1/V_{\rm eff}\) method.
In [t:smf] we report fitting parameters and uncertainties from the fiducial approach described in 4.1. We also report the inferred parameters obtained via the modified STY approach in [t:smfsty], and here note that all inferred parameters are within the 68% credible interval of our fiducial inference.
In addition to the abundance matching presented in the main text (4.2), we infer the SHMR that best describes the field dwarfs of the SAGAbg-SMF sample. To estimate the halo mass function as a function of the environment, we apply a modified version of the environmental classification scheme presented in 2.3 to the \(z=0\) snapshot of the BolshoiP [126] dark-matter only simulations. We define potential host halos to be those with \(\log_{10}(\rm M_{vir}/M_\odot)>12.14\), which corresponds to the lowest estimated halo mass of the HyperLEDA hosts used for the observational sample. For a randomly drawn sample of 100,000 halos in the simulation, we compute the projected radius and line-of-sight velocity difference to each potential host halo as:
\[\begin{align} R_{\rm proj}(\vec{r}_{i},\vec{r}_{j}) &= \sqrt{(x_i-x_j)^2+(y_i-y_j)^2}\\ \Delta v_{i,j} &= H_0(z_i-z_j) + (v_{i,z}-v_{j,z}) \end{align}\]
Where the \(i\) and \(j\) subscripts indicate the test object and potential host halo, respectively, \(\vec{r}=(x,y,z)\) are the simulated positions of each halo, and \(v_{z}\) is the peculiar velocity of the halo in the \(z\)-direction. To be consistent with our observed halo classification, we also use the halo mass-based estimate of \(V_{\rm 200}\) given in 1 for the simulated host halos, as we find that it is 16% (median fractional offset) systematically higher than the true simulation maximum circular velocity reported in the simulation. We then use the simulated estimates of \(R_{\rm proj}\) and \(\Delta v\) to determine environment in the same way as for the observed galaxies in the sample.
We find that the fraction of satellite and field galaxies in the simulation closely matches the observed sample (\(f_{\rm sample}^{\rm sim}=[0.47,0.53]\) for simulated satellite and field halos, respectively, compared to \(f_{\rm sample}^{\rm obs}=[0.45,0.55]\) in the observed sample). However, the simulated halos predict a substantially lower fraction of isolated dwarfs (\(f^{\rm sim}_{\rm sample}=0.16\), compared to \(f^{\rm obs}_{\rm sample}=0.32\)). This discrepancy is because the isolated criterion uses a constant velocity difference cutoff to determine environment (unlike the field classification, where the minimum velocity difference required to trigger the satellite condition scales with the potential host’s mass). As a result, the isolated classifier is more sensitive to the lower halo mass limit chosen for potential hosts.
The slope of the low-mass field galaxy HMF and isolated galaxy HMF are nearly identical in our test (\(\beta(\text{field})=-0.96\), \(\beta(\text{isolated})=-1.02\)); the slope of the satellite HMF (for our definition of “satellite”) is slightly shallower, at \(\beta(\text{satellite})=-0.86\). As expected, the slope of the environmentally-averaged HMF is between these values, at \(\beta(\text{all})=-0.91\). Our form for the HMF is not a good approximation for the HMF functional form of [66] for very massive halos (\(\rm M_h\gtrsim 10^{13} M_{\odot}\)), where the true HMF declines more steeply than the low-mass power-law slope. To verify that the divergence in our approximation to the HMF does not affect our abundance matching results, we also perform abundance matching for our environmentally-averaged sample against the analytic form of the HMF given by [66], and verify that our inferred SHMR does not significantly change.
We find that the SHMR implied via this exercise for the field dwarfs alone is statistically consistent with the abundance matching-based SHMR of the full SAGAbg-SMF sample, supporting the idea that the presence of satellite galaxies in the SAGAbg-SMF sample is not expected to fully explain the consistency between the SHMR we estimate for the SAGAbg-SMF sample and previous measurements of the SHMR for Local Group and Local Volume satellite systems.
In this final appendix section, we present machine-readable tabulations of our measured SMFs and consider additional subsamples of the SAGAbg sample over which SMFs can be constructed. In particular, we compute the SMF of the SAGAbg sample at higher redshift (\(0.03<z<0.1\)), and as a function of star formation activity. In all cases we use the same methodology as outlined in 3, and classify star formation activity in the same way as described in 4.3.
First, we present the machine-readable SMFs measured from the SAGAbg-SMF sample as a function of environment in [t:smfmrt]. Only the first few rows of the table are shown here; the reader can find a full version of the table associated with the online version of this article.
The redshift range of the SAGAbg-SMF sample, \(z<0.05\), corresponds to a maximum comoving distance of \(d\approx 210\) Mpc; the exclusion of the SAGA host volume at the low-redshift end of this range further reduces the effective comoving volume probed by the sample. To check whether considerations such as cosmic variance [127] substantially affect our measured stellar mass functions, we additionally measure the stellar mass function of SAGAbg galaxies at \(0.03<z<0.1\) using the same methodology presented in 3.
Because we do not have a complete census of \(\log_{10}({\rm M_\star/M_\odot})\) \(\gtrsim 10.5\) hosts in this redshift range, we only measure the environmentally-averaged SMF at \(0.03<z<0.1\). As shown in 11, our measurement of the SMF out to \(z=0.1\) is consistent both with our SAGAbg-SMF SMF and with previous literature measurements down to our limiting stellar mass in this redshift range (\(\log_{10}({\rm M_\star/M_\odot})\) \(\sim 8\)). We thus conclude that volume effects such as cosmic variance do not play a substantial role in setting the shape or normalization of the SAGAbg-SMF stellar mass functions.
We now consider the SAGAbg-SMF SMF as a function of star formation activity for both the environmentally-averaged sample and the satellite subsample. Because we have very few quenched field dwarfs in the sample, the field and isolated dwarf SMFs would be unaffected by a cut in SFR; we thuse do not recalculate the field or isolated dwarf SMF as a function of star formation activity.
The resulting SMFs are shown in 12; their accompanying double Schechter fits are detailed in [t:smfsf]. We find that the star-forming SMF of both the environmentally-averaged and satellite samples are consistent in their low-mass slope with that of the full, field, and isolated SAGAbg-SMF SMF. The quenched subsamples, meanwhile, suggest a slightly shallower SMF – though it should be noted that the dynamic range over which the quenched SMFs are fit is around an order of magnitude smaller than that of the star-forming galaxies due to a lack of low-mass quenched systems in the sample.
cccccccc & 7.05 & 2.655e-02 & 3.754e-02 & 5.340e-02 & 7.301e-02 & 9.474e-02 & all
7.05 & 7.20 & 4.753e-02 & 6.748e-02 & 9.378e-02 & 1.285e-01 & 1.665e-01 & all
7.20 & 7.30 & 2.775e-02 & 3.950e-02 & 5.413e-02 & 7.401e-02 & 9.604e-02 & all
⋮& ⋮& ⋮& ⋮& ⋮& ⋮& ⋮& ⋮
ccccccc & All & \(-1.46^{+0.03}_{-0.07}\) & \(-0.3^{+0.2}_{-0.3}\) & \(10.8^{+0.3}_{-0.1}\) & &
& Satellite & \(-1.49^{+0.03}_{-0.03}\) & \(-0.93^{+0.43}_{-0.06}\) & \(10.52^{+0.29}_{-0.02}\) & &
& Field & \(-1.51^{+0.04}_{-0.04}\) & \(-0.39^{+0.24}_{-0.29}\) & \(10.74^{+0.21}_{-0.16}\) & &
& Isolated & \(-1.55^{+0.11}_{-0.05}\) & \(-0.58^{+0.27}_{-0.26}\) & \(10.8^{+0.2}_{-0.2}\) & &
ccccccc Star-forming, all & 2322 & \(0.98^{+0.65}_{-0.52}\) & \(-1.41^{+0.06}_{-0.07}\) & \(1.91^{+1.33}_{-0.92}\) & \(-0.56^{+0.39}_{-0.33}\) & \(10.51^{+0.85}_{-0.24}\)
Quenched, all & 90 & \(0.15^{+0.09}_{-0.04}\) & \(-1.17^{+0.11}_{-0.18}\) & \(0.49^{+0.72}_{-0.26}\) & \(-0.37^{+0.26}_{-0.35}\) & \(10.58^{+0.89}_{-0.58}\)
Star-forming, satellites & 921 & \(0.50^{+0.56}_{-0.28}\) & \(-1.31^{+0.12}_{-0.13}\) & \(2.44^{+2.89}_{-1.42}\) & \(-0.49^{+0.35}_{-0.37}\) & \(10.78^{+0.86}_{-0.99}\)
Quenched, satellites & 71 & \(0.16^{+0.11}_{-0.05}\) & \(-1.16^{+0.12}_{-0.22}\) & \(0.61^{+1.14}_{-0.35}\) & \(-0.37^{+0.26}_{-0.35}\) & \(10.38^{+1.04}_{-0.80}\)
We draw catalog photometry for 1 from the nsa_v0_1_2 and InputCatAv07 for SDSS and GAMA, respectively.↩︎
The cosmology adopted in BolshoiP is consistent with our adopted cosmology (\(h=0.7\), \(\Omega_m=0.3\)).↩︎
We do not report uncertainties on the power-law fit, as the non-parametric SHMR posterior is better suited for uncertainty analysis.↩︎
Of the simulations considered by [99], the TNG100 baryonic mass resolution is \(\rm M_{bary}\sim 2\times10^6 M_\odot\), and the SIMBA resolution is an order of magnitude higher at \(\rm M_{bary}\sim 2\times10^7M_\odot\). TNG50 and FIREBox around two orders of magnitude higher resolution in bayronic mass resolution compared to TNG100 [102], [115]↩︎