• No se han encontrado resultados

The clustering of Ly alpha emitters in a Lambda CDM Universe

N/A
N/A
Protected

Academic year: 2020

Share "The clustering of Ly alpha emitters in a Lambda CDM Universe"

Copied!
16
0
0

Texto completo

(1)Mon. Not. R. Astron. Soc. 391, 1589–1604 (2008). doi:10.1111/j.1365-2966.2008.14010.x. The clustering of Lyα emitters in a CDM Universe Alvaro Orsi,1,2 Cedric G. Lacey,1 Carlton M. Baugh1 and Leopoldo Infante2 for Computational Cosmology, Department of Physics, University of Durham, South Road, Durham DH1 3LE de Astronomı́a y Astrofı́sica, Facultad de Fı́sica, Pontificia Universidad Católica de Chile, Casilla 306, Santiago 22, Chile. 2 Departamento. Accepted 2008 September 24. Received 2008 September 16; in original form 2008 July 17. ABSTRACT. We combine a semi-analytical model of galaxy formation with a very large simulation which follows the growth of a large-scale structure in a  cold dark matter (CDM) universe to predict the clustering of Lyα emitters. We find that the clustering strength of Lyα emitters has only a weak dependence on Lyα luminosity but a strong dependence on redshift. With increasing redshift, Lyα emitters trace progressively rarer, higher density regions of the universe. Due to the large volume of the simulation, over 100 times bigger than any previously used for this application, we can construct mock catalogues of Lyα emitters and study the sample variance of current and forthcoming surveys. We find that the number and clustering of Lyα emitters in our mock catalogues are in agreement with measurements from current surveys, but there is a considerable scatter in these quantities. We argue that a proposed survey of emitters at z = 8.8 should be extended significantly in solid angle to allow a robust measurement of Lyα emitter clustering. Key words: methods: N-body simulations – methods: numerical – galaxies: evolution – galaxies: high-redshift – large-scale structure of Universe.. 1 I N T RO D U C T I O N The study of galaxies at high redshifts opens an important window on the process of galaxy formation and conditions in the early universe. The detection of populations of galaxies at high redshifts is one of the great challenges in observational cosmology. Currently, three main observational techniques are used to discover high redshift, star-forming galaxies: (i) the Lyman-break dropout technique, in which a galaxy is imaged in a combination of three or more optical or near-infrared (near-IR) bands. The longer wavelength filters detect emission in the rest-frame ultraviolet from ongoing star formation, whereas the shorter wavelength filters sample the Lymanbreak feature. Hence, a Lyman-break galaxy appears blue in one colour and red in the other (Steidel et al. 1996, 1999). By shifting the whole filter set to longer wavelengths, the Lyman-break feature can be isolated at higher redshifts; (ii) submillimetre emission, due to dust being heated when it absorbs starlight (Smail, Ivison & Blain 1997; Hughes et al. 1998). The bulk of the energy absorbed by the dust comes from the rest-frame ultraviolet and so the dust emission is sensitive to the instantaneous star formation rate (iii) Lyα line emission from star-forming galaxies, typically identified using either narrowband imaging (Hu, Cowie & McMahon 1998; Kudritzki et al. 2000; Gawiser et al. 2007; Ouchi et al. 2008) or long-slit spectroscopy of gravitationally lensed regions (Ellis et al. 2001; Santos et al. 2004; Stark et al. 2007). The Lyα emission.  E-mail: alvaro.orsi@durham.ac.uk  C. C 2008 RAS 2008 The Authors. Journal compilation . is driven by the production of Lyman-continuum photons and so is dependent on the current star formation rate. The Lyman-break drop out and submillimetre detection methods are more established than Lyα emission as a means of identifying substantial populations of high-redshift galaxies. Nevertheless, in the last few years, there have been a number of Lyα surveys which have successfully found high-redshift galaxies (e.g. Hu et al. 1998; Kudritzki et al. 2000). The observational samples have grown in size such that statistical studies of the properties of Lyα emitters have now become possible: for example, the Subaru/XMM–Newton Deep Survey (SXDS) (Ouchi et al. 2005, 2008) has allowed estimates of the luminosity function (LF) and clustering of Lyα emitters in the redshift range 3 < z < 6, and the Multiwavelength Survey by YaleChile (MUSYC) (Gawiser et al. 2007; Gronwall et al. 2007) has also produced clustering measurements at z ∼ 3. Furthermore, the highest redshift galaxy (z = 6.96) robustly detected to date was found using the Lyα technique (Iye et al. 2006). Taking advantage of the magnification of faint sources by gravitational lensing, Stark et al. (2007) reported six candidates for Lyα emitters in the redshift range 8.7 < z < 10.2, but these have yet to be confirmed. The Dark Ages Z Lyman-alpha Explorer (DAzLE) Project (Horton et al. 2004) is designed to find Lyα emitters at z = 7.73 and z = 8.78. However, the small field of view of the instrument (6.83 × 6.83 arcmin2 ) makes it difficult to use if for studying large-scale structure (LSS) at such redshifts. On the other hand, the Emission-Line galaxies with (Visible and Infrared Survey Telescope for Astronomy) (VISTA) Survey (ELVIS) (Nilsson et al. 2007a,b) would appear to offer a more promising route to study the LSS of very high-redshift galaxies (z = 8.8).. Downloaded from https://academic.oup.com/mnras/article-abstract/391/4/1589/1746848 by Pontificia Universidad Catolica de Chile user on 12 April 2019. 1 Institute.

(2) 1590. A. Orsi et al..  C. galaxy bias as a function of Lyα luminosity. However, these results depend on an analytical model for the halo bias (Sheth, Mo & Tormen 2001), and furthermore the linear bias assumption breaks down on small scales. Here, we will present an explicit calculation of the clustering of galaxies by implementing the semi-analytical model on top of a large N-body simulation of the hierarchical clustering of the dark matter distribution. This allows us to predict the spatial distribution of Lyα emitting galaxies, and to create realistic maps of Lyα emitters at different redshifts. These maps can be analysed with simple statistical tools to quantify the spatial distribution and clustering of galaxies at high redshifts. The N-body simulation used in this work is the Millennium Simulation, carried out by the Virgo Consortium (Springel et al. 2005). The simulation of the spatial distribution of Lyα emitters is tested by creating mock catalogues for different surveys of Lyα emitting galaxies in the range 3 < z < 9. The clustering of Lyα emitters in our model is analysed with correlation functions and halo occupation distributions (HODs). Taking advantage of the large volume of the Millennium Simulation, we also compute the errors expected on correlation function measurements from various surveys due to cosmic variance. The outline of this paper is as follows. Section 2 gives a brief description of the semi-analytical galaxy formation model and describes how it is combined with the N-body simulation. In Section 3, we establish the range of validity of our simulated galaxy samples by studying the completeness fractions in the model Lyα LFs. Section 4 gives our predictions for the clustering of Lyα emitters in the range 0 < z < 9. In Section 5, we compare our simulation with recent observational data and we also make predictions for future measurements (clustering and number counts) expected from the ELVIS. Finally, Section 6 gives our conclusions. 2 THE MODEL We use the semi-analytical model of galaxy formation, GALFORM, to predict the properties of the Lyα emission of galaxies and their abundance as a function of redshift. The GALFORM model is fully described in Cole et al (2000) (see also the review by Baugh 2006), and the variant used here was introduced by Baugh et al (2005) (see also Lacey et al. 2008, for a more detailed description). The model computes star formation histories for the whole galaxy population, following the hierarchical evolution of the host dark matter haloes. The merger histories of dark matter haloes are calculated by using Monte Carlo method, following the formalism of the extended Press & Schechter theory (Press & Schechter 1974; Lacey & Cole 1993). When using Monte Carlo merger trees, the mass resolution of dark matter haloes can be arbitrarily high, since the whole of the computer memory can be devoted to one tree rather than a population of trees. In contrast, N-body merger trees are constrained by a finite mass resolution due to the particle mass, which is usually poorer than that typically adopted for Monte Carlo merger trees. Discrepancies between the model predictions obtained with Monte Carlo trees and those extracted from a simulation only become evident fainter than some luminosity which is set by the mass resolution of the N-body trees, as we will see in the next section (see also Helly et al. 2003). A critical assumption of the Baugh et al. model is that stars formed in starbursts have a top-heavy initial mass function (IMF), where the IMF is given by dN/d ln (m) ∝ m−x and x = 0. Stars formed quiescently in discs have a solar neighbourhood IMF, with the form proposed by Kennicut (1983): x = 0.4 for m < 1 M and x = 1.5 for m > 1 M . Both IMFs cover the mass range 0.15 C 2008 RAS, MNRAS 391, 1589–1604 2008 The Authors. Journal compilation . Downloaded from https://academic.oup.com/mnras/article-abstract/391/4/1589/1746848 by Pontificia Universidad Catolica de Chile user on 12 April 2019. Despite these observational breakthroughs, predictions of the properties of star-forming Lyα emitting galaxies are still in the relatively early stages of development. Often these calculations employ crude assumptions about the galaxy formation process to derive a star formation rate, and hence a Lyα luminosity or use hydrodynamical simulations, which, due to the high computational overhead, study relatively small cosmological volumes. Haiman & Spaans (1999) made predictions for the escape fraction of Lyα emission and the abundance of Lyα emitters using the Press–Schechter formalism and a prescription for the dust distribution in galaxies. Radiative transfer calculations of the escape fraction have been made by Zheng & Miralda-Escudé (2002), Ahn (2004) and Verhamme, Schaerer & Maselli (2006) for idealized geometries, while Tasitsiomi (2006) and Laursen & Sommer-Larsen (2007) applied these calculations to galaxies taken from cosmological hydrodynamical simulations. Barton et al (2004) and Furlanetto et al (2005) calculated the number density of Lyα emitters using hydrodynamical simulations of galaxy formation. Nagamine et al. (2006, 2008) used hydrodynamical simulations to predict the abundance and clustering of Lyα emitters. The typical computational boxes used in these calculations are very small (∼10–30 h−1 Mpc), which makes it impossible to evolve the simulation accurately to z = 0. Hence, it is difficult to test if the galaxy formation model adopted produces a reasonable description of present-day galaxies. Furthermore, the small box size means that the reliable clustering predictions can only be obtained on scales smaller than the typical correlation length of the galaxy sample. As we will show in this paper, small volumes are subject to significant fluctuations in clustering amplitude. The semi-analytical approach to modelling galaxy formation allows us to make substantial improvements over previous calculations of the properties of Lyα emitters. The speed of this technique means that the large populations of galaxies can be followed. The range of predictions which can be made using semi-analytical models is, in general, broader than that produced from most hydrodynamical simulations, so that the model predictions can be compared more directly with observational results. A key advantage is that the models can be readily evolved to the present day, giving us more faith in the ingredients used, i.e. we can be reassured that the physics underpinning the predictions presented for a high-redshift population of galaxies would not result in too many bright/massive galaxies at the present day. The first semi-analytical calculation of the properties of Lyα emitters based on a hierarchical model of galaxy formation was carried out by Le Delliou et al. (2005). This is the model used throughout this work, which has been shown to be successful in predicting the properties of Lyα emitters over a wide range of redshifts. The semi-analytical model allows us to connect Lyα emission to other galaxy properties. Le Delliou et al. (2006) showed that this model successfully predicts the observed Lyα LFs and equivalent widths (EWs), along with some fundamental physical properties, such as star formation rates, gas metallicities and stellar and halo masses. In Nilsson et al. (2007a), we used the model to make further predictions for the LF of very high-redshift Lyα emitters and to study the feasibility of current and forthcoming surveys which aim to detect such high-redshift galaxies. Kobayashi, Totani & Nagashima (2007) developed an independent semi-analytical model to derive the LFs of Lyα emitters. The focus of this paper is to use the model introduced by Le Delliou et al. (2005) to study the clustering of high-redshift Lyα emitting galaxies and to extend the comparison of model predictions with current observational data. Le Delliou et al. (2006) already gave an indirect prediction of the clustering of Lyα emitters by studying.

(3) The clustering of Lyα emitters. (i) The integrated stellar spectrum of the galaxy is calculated, based on its star formation history, including the effects of the distribution of stellar metallicities, and taking into account the IMFs adopted for different modes of star formation. (ii) The rate of production of Lyman continuum photons is computed by integrating over the stellar spectrum, and assuming that all of these ionizing photons are absorbed by neutral hydrogen within the galaxy. We calculate the fraction of Lyα photons produced by these Lyman continuum photons, assuming the case B recombination (Osterbrock 1989). (iii) The observed Lyα flux depends on the fraction of Lyα photons which escape from the galaxy (f esc ), which is assumed to be constant and independent of galaxy properties. Calculating the Lyα escape fraction from first principles by following the radiative transfer of the Lyα photons is very demanding computationally. A more complete calculation of the escape fraction would have into account the structure and kinematic properties of the interstellar medium (Zheng & Miralda-Escudé 2002; Ahn 2004; Verhamme et al. 2006). In our model, we adopt the simplest possible approach, which is to fix the escape fraction, f esc , to be the same for each galaxy, without taking into account its dust properties. This results in a surprisingly good agreement between the predicted number counts and LFs of emitters and the available observations at 3  z  7 (Le Delliou et al. 2005, 2006). Le Delliou et al. (2005) chose f esc = 0.02 to match the number counts at z ≈ 3 at a flux f ≈ 2 × 10−17 erg cm−2 s−1 . The same value is used in this work. This value for the Lyα escape fraction seems very small, but is consistent with direct observational estimates for low-redshift galaxies. Atek et al. (2008) derive escape fractions for a sample of nearby star-forming galaxies by combining measurements of Lyα, Hα and Hβ, and find that most have escape fractions of 3 per cent  C. or less. Le Delliou et al. (2006) also showed that if a standard solar neighbourhood IMF is adopted for all modes of star formation, then a substantially larger escape fraction would be required to match the observed counts of Lyα emitters, and even then the overall match would not be as quite good as it is when the top-heavy IMF is used in bursts. Once we obtain the galaxy properties from the semi-analytical model, we plant these galaxies into a N-body simulation, in order to add information about their positions and velocities. The simulation used here is the Millennium Simulation (Springel et al. 2005). This simulation adopts concordance values for the parameters of a flat CDM model, m = 0.25 and b = 0.045 for the densities of matter and baryons at z = 0, h = 0.73 for the present-day value of the dimensionless Hubble constant, σ 8 = 0.9 for the rms linear mass fluctuations in a sphere of radius 8 h−1 Mpc at z = 0 and n = 1 for the slope of the primordial fluctuation spectrum. The simulation follows 21603 dark matter particles from z = 127 to z = 0 within a cubic region of comoving length 500 h−1 Mpc. The individual particle mass is 8.6 × 108 h−1 M , so the smallest dark halo which can be resolved has a mass of 2 × 1010 h−1 M . Dark matter haloes are identified using a Friends-Of-Friends (FOF) algorithm. To populate the simulation with galaxies from the semi-analytical model, we use the same approach as in Benson et al. (2000). First, the position and velocity of the centre of mass of each halo are recorded, along with the positions and velocities of a set of randomly selected dark matter particles from each halo. Secondly, the list of halo masses is fed into the semi-analytical model in order to produce a population of galaxies associated with each halo. Each galaxy is assigned a position and velocity within the halo. Since the semi-analytical model distinguishes between central and satellite galaxies, the central galaxy is placed at the centre of mass of the halo, and any satellite galaxy is placed on one of the randomly selected halo particles. Once galaxies have been generated, and positions and velocities have been assigned, it is a simple process to produce catalogues of galaxies with spatial information and any desired selection criteria. The combination of the semi-analytical model with the N-body simulation is essential to study the detailed clustering of a desired galaxy population, although the clustering amplitude on large scales can also be estimated analytically (Le Delliou et al. 2006). An example of the output of the simulation is shown in the four images of Fig. 1 which show redshifts z = 0, 3.3, 5.7 and 8.5. The dark matter distribution (shown in green) becomes smoother as we go to higher redshifts, due to the gravitational growth of structures. As shown in Fig. 1, for this particular luminosity cut, the number density of Lyα emitters varies at different redshifts. As we will show in the next section, these catalogues at high redshift are not complete at faint luminosities, so we have to restrict our predictions to brighter luminosities as we go to higher redshifts. 3 LUMINOSITY FUNCTIONS The model presented by Le Delliou et al. (2005, 2006) differs in two main ways from the one presented in this paper. (i) There is a slight difference in the values of the cosmological parameters used. (ii) The earlier work used a grid of halo masses together with an analytical halo mass function rather than the set of haloes from an N-body simulation. In Section 3.1, we investigate the impact of the different choice of cosmological parameters on the LF of Lyα emitters, to see if the very good agreement with observational data obtained by Le Delliou et al. (2006) is retained on adopting the. C 2008 RAS, MNRAS 391, 1589–1604 2008 The Authors. Journal compilation . Downloaded from https://academic.oup.com/mnras/article-abstract/391/4/1589/1746848 by Pontificia Universidad Catolica de Chile user on 12 April 2019. < m < 125 M . Within the framework of CDM, Baugh et al. argued that the top-heavy IMF is essential to match the counts and redshift distribution of galaxies detected through their submillimetre emission, whilst retaining the match to galaxy properties in the local Universe, such as the optical and far-IR LFs and galaxy gas fractions and metallicities. Nagashima et al. (2005a,b) showed that such a topheavy IMF also results in predictions for the metal abundances in the intracluster medium and in elliptical galaxies in much better agreement with observations. Lacey et al. (2008) showed that the same model predicts galaxy evolution in the IR in good agreement with observations from Spitzer, and also discussed independent observational evidence for a top-heavy IMF. Reionization is assumed in our model to occur at zreion = 10 (Kogut et al. 2003; Dunkley et al. 2008). The photoionization of the intergalactic medium is assumed to suppress the collapse and cooling of gas in haloes with circular velocities V c < 60 km s−1 at redshifts z < zreion (Benson et al. 2002). Recent calculations (Hoeft et al. 2006; Okamoto, Gao & Theuns 2008) imply that the above parameter values overstate the impact of photoionization on gas cooling, and suggest that photoionization only affects smaller haloes with V c  30 km s−1 . Our model predictions, for the range of Lyα luminosities we consider in this paper, are not significantly affected by adopting the lower Vc cut. We neglect any attenuation of the Lyα flux by propagation through the intergalactic medium. This effect would suppress the observed Lyα flux mainly for z  zreion , so this should only affect our results for very high redshifts (z  10). The model used to predict the luminosities and EWs of the Lyα galaxies is identical to that described in Le Delliou et al. (2005, 2006). The Lyα emission is computed by the following procedure.. 1591.

(4) 1592. A. Orsi et al.. Millennium cosmology. In Section 3.2, we assess the completeness of our samples of Lyα emitters due to the finite mass resolution of the Millennium Simulation. 3.1 Comparison of model predictions with observed LFs In this section, we investigate the impact on the model predictions of the choice of cosmological parameters by re-running the model of Le Delliou et al. (2005, 2006), keeping the galaxy formation parameters the same but changing the cosmological parameters to match those used in the Millennium Simulation. To recap, the original Le Delliou et al. (2006) model used m = 0.3,  = 0.7, b = 0.04, σ 8 = 0.93 and h = 0.7. In Fig. 2, we compare  C. the cumulative LFs obtained with GALFORM for the two sets of cosmological parameters with current observational data in the redshift range 3 < z < 7. The observational data are taken from: Kudritzki et al. (2000) (crosses), Cowie & Hu (1998) (asterisks), Gawiser et al. (2007) (diamonds) and Ouchi et al. (2008) (triangles and squares) in the z = 3.3 panel; Ajiki et al. (2003) (pluses), Maier et al. (2003) (asterisks), Hu et al. (2004) (diamonds), Rhoads et al. (2003) (triangles), Shimasaku et al. (2006) (squares) and Ouchi et al. (2008) (crosses) in the z = 5.7 panel; and Taniguchi et al. (2005) (crosses) and Kashikawa et al. (2006) (asterisks and diamonds) in the z = 6.7 panel. At z = 3.3, the two model curves agree very well, and are consistent with the observational data shown. At z = 5.72, the two models do not match as well as in the previous case, but C 2008 RAS, MNRAS 391, 1589–1604 2008 The Authors. Journal compilation . Downloaded from https://academic.oup.com/mnras/article-abstract/391/4/1589/1746848 by Pontificia Universidad Catolica de Chile user on 12 April 2019. Figure 1. The spatial distribution of Lyα emitting galaxies (coloured circles) in a slice from the Millennium Simulation, with the dark matter distribution in green. The four panels are for redshifts in the range 0 < z < 8.5, as indicated in each panel. The colour of the circles changes with the Lyα luminosity of the galaxies, as shown in the key in the upper-right corner of the first panel. Only galaxies brighter than log (LLyα [erg s−1 h−2 ]) = 42.2 are plotted. Each image covers a square region 100 × 100 h−1 Mpc2 across and having a depth of 10 h−1 Mpc, which is less than 1000 the volume of the full simulation box..

(5) The clustering of Lyα emitters. 1593. 3.2 The completeness of the Millennium galaxy catalogues. Figure 2. The cumulative LFs of Lyα emitters at redshifts z = 3.3 (top), z = 5.7 (centre) and z = 6.7 (bottom). The blue points correspond to observational data (as indicated by the key with full references in the text). The black and red curves correspond, respectively, to the GALFORM predictions using the cosmological parameters of the Millennium Simulation and those adopted in Le Delliou et al.. The Millennium Simulation has a halo mass resolution limit of 1.72 × 1010 h−1 M . In a standard GALFORM run, a grid of haloes which extends to lower mass haloes than the Millennium resolution is typically used, with M res = 5 × 109 h−1 M at z = 0. A fixed dynamic range in halo mass is adopted in these runs, but with the mass resolution shifting to smaller masses with increasing redshift: for our standard setup, we have M res = 7.8 × 107 and 1.4 × 107 h−1 M at z = 3 and 6, respectively. Therefore, when putting GALFORM galaxies into the Millennium, our sample does not contain galaxies which formed in haloes with masses below the resolution limit of the Millennium. This introduces an incompleteness into our catalogues when compared to the original GALFORM prediction. The incompleteness of the galaxy catalogues is more severe for low luminosity galaxies because they tend to be hosted by low-mass haloes, as will be shown in the next section. Hereafter, we will use N-body sample to refer to the GALFORM galaxies planted in the Millennium haloes, to distinguish them from the pure GALFORM catalogues generated using a grid of haloes masses. In order to quantify the incompleteness of the N-body sample as a function of luminosity, we define the completeness fraction as the ratio of the cumulative LF for the N-body sample to that obtained for a pure GALFORM calculation, and look for the luminosity at which the completeness fraction deviates from unity. The panels of Fig. 3 give different views of the completeness of the N-body samples. The top panel shows the luminosity above which a catalogue can be considered as complete: we define the completeness limit as the luminosity at which the completeness fraction first drops to 0.85. The figure clearly shows how the luminosity corresponding to this completeness limit becomes progressively brighter as we move to higher redshifts. For z > 9, the N-body sample is incomplete at all luminosities plotted. The bottom panel of Fig. 3 shows how the sample becomes more incomplete at any redshift as we consider fainter fluxes. A sample with galaxies brighter than log [F Lyα (erg s−1 cm−2 )] = −19 is less than 70 per cent complete at all redshifts z > 5, while a sample with galaxies brighter than log [F Lyα (erg s−1 cm−2 )] = − 17 is always over 90 per cent complete for z < 9. The completeness fraction monotonically decreases with increasing redshift until z ∼ 6 for very faint fluxes. For z > 6, the completeness rises again: the shape of the bright end of the LF at this redshift is sensitive to the choice of the redshift of reionization. In summary, the requirement that our samples be at least 80 per cent complete restricts the range of validity of the predictions from the Millennium Simulation to redshifts below 9, and fluxes brighter than log [F Lyα (erg s−1 cm−2 )] > −17.5. 4 CLUSTERING PREDICTIONS In this section, we present clustering predictions using Lyα emitters in the full Millennium volume. To study the clustering of galaxies,.  C. C 2008 RAS, MNRAS 391, 1589–1604 2008 The Authors. Journal compilation . Downloaded from https://academic.oup.com/mnras/article-abstract/391/4/1589/1746848 by Pontificia Universidad Catolica de Chile user on 12 April 2019. both are still consistent with the observational data. Finally, at z = 6.7, the differences are small and both curves are consistent with observational data. The conclusion from Fig. 2 is that there is not a significant change in the model predictions on using these slightly different values of the cosmological parameters. Furthermore, the observational data are not yet sufficiently accurate to distinguish between the two models or to motivate the introduction of further modifications to improve the level of agreement, such as using a different Lyα escape fraction..

(6) 1594. A. Orsi et al.. we calculate the two-point correlation function, ξ (r), of the galaxy distribution. In order to quantify the evolution of the clustering of galaxies, we measure the correlation function over the redshift interval 0 < z < 9. To calculate ξ (r) in the simulation, we use the standard estimator (e.g. Peebles 1980): 1 + ξ (r) =. DD , (1/2)Ngal n V (r). (1). where DD stands for the number of distinct data pairs with separations in the range r to r + r, n is the mean number density of galaxies, Ngal is the total number of galaxies in the simulation volume and V(r) is the volume of a spherical shell of radius r and thickness r. This estimator is applicable in the case of periodic boundary conditions. In the correlation function analysis, we consider two parameters which help us to understand the clustering behaviour of Lyα galaxies: the correlation length, r0 , and the galaxy bias, b, both of which are discussed below. 4.1 Correlation length evolution A common way to characterize the clustering of galaxies is to fit a power-law to the correlation function:  −γ r , (2) ξ (r) = r0  C. where r0 is the correlation length and γ = 1.8 gives a good fit to the slope of the observed correlation function over a restricted range of pair separations around r0 at z = 0 (e.g. Davis & Peebles 1983). The correlation length can also be defined as the scale where ξ = 1, and quantifies the amplitude of the correlation function when the slope γ is fixed. Fig. 4 shows the correlation function of Lyα emitting galaxies, ξ gal (solid black curves) of the full catalogues down to the completeness limits at each redshift, calculated using equation (1). The red curve shows ξ dm , the correlation function of the dark matter. At z = 0, ξ dm is larger than ξ gal , but for z > 0 ξ dm is increasingly below ξ gal . We will study in detail the comparison of the dark matter and Lyα galaxy correlation functions in Section 4.2. Another notable feature of Fig. 4 is that ξ gal (r) differs considerably from a power law, particularly on scales greater than 10 h−1 Mpc. When fitting equation (2) to the correlation functions plotted in Fig. 4, we use only the measurements in the range [1,10] h−1 Mpc, where ξ gal (r) behaves most like a power law. We fix the slope γ = 1.8 for all ξ gal (r) to allow a comparison between different redshifts, although we note that for z < 5, the slope of ξ gal (r) is closer to γ = 1.6. By using the power-law fit, we can compare the clustering amplitudes of different galaxy samples. To determine the clustering evolution of Lyα emitters, we split the catalogues of Lyα emitters into luminosity bins. For each of these subsamples, we calculate the correlation function and then we obtain r0 by fitting equation (2) as described. Fig. 5 (top) shows the dependence of r0 on C 2008 RAS, MNRAS 391, 1589–1604 2008 The Authors. Journal compilation . Downloaded from https://academic.oup.com/mnras/article-abstract/391/4/1589/1746848 by Pontificia Universidad Catolica de Chile user on 12 April 2019. Figure 3. Completeness of the Millennium galaxy catalogues with respect to Lyα luminosity or flux. Top: the minimum luminosity down to which the catalogues are 85 per cent complete. Bottom: the completeness fraction as a function of redshift for a range of fluxes −19 < log [F Lyα (erg s−1 cm−2 )] < −17, as indicated by the key.. Figure 4. The correlation function predicted for Lyα emitters (black solid curve) for a range of redshifts, as indicated in each panel. Lyα emitters are included down to the completeness limit at each redshift shown in Fig 3. The solid red curve shows the correlation function of the dark matter at the same epochs. The blue dashed line shows the power-law fit of equation (2), evaluated in the range 1 < r(Mpc h−1 ) < 10, as delineated by the vertical dashed lines..

(7) The clustering of Lyα emitters. 1595. 4.2 The bias factor of Lyα emitters. Figure 5. Top: the evolution of the correlation length r0 as a function of Lyα luminosity for several redshifts in the range 0 < z < 9, as indicated by the key. The thin solid coloured lines show the errors on the correlation length. Bottom: the evolution of the median mass of haloes which host Lyα emitting galaxies as a function of Lyα luminosity, for the same range of redshifts as above.. luminosity for different redshifts in the range 0 < z < 9. The errors are shown by the area enclosed by the thin solid lines for each set of points, and are calculated as the 90 per cent confidence interval of the χ 2 fit of the correlation functions to equation (2) (ignoring any covariance between pair separation bins). The range of luminosities plotted is set by the completeness limit of the simulation described in the previous section. We also discard galaxy samples with fewer than 500 galaxies, as in such cases, the errors are extremely large and the correlation functions are poorly defined. The clustering in high-redshift surveys of Lyα emitters is sensitive to the flux limit that they are able to reach, as shown by Fig. 5. The model predictions show modest evolution of r0 with redshift for most of the luminosity range studied. Over this redshift interval, on the other hand, the correlation length of the dark matter changes dramatically, as shown by Fig. 4. Typically, at a given redshift, we find that r0 shows little dependence on luminosity until a luminosity of LLyα ∼ 1042 (erg s−1 h−2 ) is reached, brightwards of which there is a strong increase in clustering strength with luminosity. This trend is even more pronounced at higher redshifts. Galaxies at z = 0 are less clustered than galaxies in the range 3 < z < 7, except at luminosities close to LLyα ∼ 1040 (erg s−1 h−2 ). At z = 8.5, r0 increases from r0 ∼ 5 h−1 Mpc at LLyα ∼ 1042 (erg s−1 h−2 ) to r0 ∼ 12 h−1 Mpc at LLyα > 1042.5 (erg s−1 h−2 ). The growth of r0 with limiting luminosity is related to the masses of the haloes which host Lyα galaxies. As shown in the bottom panel  C. The galaxy bias, b, quantifies the strength of the clustering of galaxies compared to the clustering of the dark matter. One way to calculate the bias is by taking the ratio of ξ gal and ξ dm , ξ gal = b2 ξ dm . Both correlation functions are estimated using equation (1). Since the simulation contains 10 billion dark matter particles, a direct pair-count calculation of ξ dm would demand a prohibitively large amount of computer time, so we extract dilute samples of the dark matter particles, selecting randomly ∼107 particles. In this way, we only enlarge the pair-count errors on ξ dm (which nevertheless are still much smaller than for ξ gal ) but obtain the correct amplitude of correlation function itself. To obtain the bias parameter of Lyα emitters as a function of luminosity, we split the full catalogue of galaxies at each redshift into luminosity bins. For each of these bins, we calculate ξ gal and divide by ξ dm to get the square of the bias. Due to non-linearities, the ratio of ξ gal and ξ dm is not constant on all scales. As a reasonable estimation of the bias we chose the mean value over the range 6 < r < 30 h−1 Mpc. Over these scales, the bias does seem to be constant and independent of scale. This range is quite similar to the one used by Gao, Springel & White (2005) to measure the bias parameter of dark matter haloes in the Millennium Simulation. The bias parameter can also be calculated approximately using various analytical formalisms (Mo & White 1996; Sheth et al. 2001; Mandelbaum et al. 2005). These procedures relate the halo bias to σ (m, z), the rms linear mass fluctuation within a sphere which on average contains mass m. The bias factor for galaxies of a given luminosity is then obtained by averaging the halo bias over the haloes hosting these galaxies. Le Delliou et al. (2006) used the analytical expression of Sheth et al. (2001) to calculate the bias parameter for the semi-analytical galaxies. This gives a reasonable approximation to the large-scale halo bias measured in N-body simulations (e.g Angulo, Baugh & Lacey 2008). Fig. 6 shows the bias parameter as a function of luminosity for redshifts in the range 0 < z < 9, and compares the direct calculation using the N-body simulation (solid lines) with the analytical estimation (dashed lines). In order to calculate the uncertainty in our value  of the bias, we assume an error on ξ gal (r) of the form ξgal = 2 (1 + ξgal )/DD (Hewitt 1982; Baugh et al. 1996), and assuming a negligible error in ξ dm we get  1 + b2 ξdm 1 b= , (3) bξdm DD. C 2008 RAS, MNRAS 391, 1589–1604 2008 The Authors. Journal compilation . Downloaded from https://academic.oup.com/mnras/article-abstract/391/4/1589/1746848 by Pontificia Universidad Catolica de Chile user on 12 April 2019. of Fig. 5, there is not a simple relation between the median mass of the host halo and the luminosity of Lyα emitters. For a given luminosity, Lyα galaxies tend to be hosted by haloes of smaller masses as we go to higher redshifts. In addition, for redshifts z > 0, there is a trend of more luminous Lyα emitters being found in more massive haloes. The key to explaining the trends in clustering strength is to compare how the effective mass of the haloes which host Lyα emitting galaxies is evolving compared to the typical or characteristic mass in the halo distribution (M ∗ ) (Mo & White 1996); if Lyα emitters tend to be found in haloes more massive than M∗ , then they will be more strongly clustered than the dark matter. This difference between the clustering amplitude of galaxies and mass is explored more in the next section. In a hierarchical model for the growth of structures, haloes more massive than M∗ are more clustered, and thus we expect a strong connection between the evolution of r0 and the masses of the haloes. Fig. 5 shows that the dependence of r0 (and host halo mass) on luminosity becomes stronger at higher redshifts..

(8) 1596. A. Orsi et al.. for the error in the bias estimation. This error is shown in Fig. 6 as the range defined by the thin solid lines surrounding the bias measurement shown by the points. The first notable feature of Fig. 6 is the strong evolution of bias with increasing redshift: from z = 0 to 8.5, the bias factor increases from b(z = 0) ∼ 0.8 to b(z = 8.5) ∼ 12, which means that the clustering amplitude of Lyα emitters at z = 8.5 is over 140 times the clustering amplitude of the dark matter at this redshift. Another interesting prediction is the dependence of bias on Lyα luminosity. For z > 3, there seems to be a strong increase of the bias with luminosity for bins where LLyα > 1042 (erg s−1 h−2 ). The agreement between the analytic calculation of the bias and the simulation result is reasonable over the range 0 < z < 5, but becomes less impressive as higher biases are reached. A similar discrepancy was also noticed by Gao et al. (2005), where they compared the halo bias extracted from the simulation with different analytic formulae (see also Angulo et al. 2008). Another way to describe galaxy clustering is through the HOD (Benson et al. 2000; Cooray & Sheth 2002; Berlind et al. 2003). The HOD gives the mean number of galaxies which meet a particular observational selection as a function of halo mass. For flux-limited samples, the HOD can be broken down into the contribution from central galaxies and satellite galaxies. In a simple picture, the mean number of central galaxies is zero below some threshold halo mass, Mmin , and unity for higher halo masses. With increasing halo mass, a second threshold is reached, Mcrit , above which a halo can also host a satellite galaxy. The number of satellites is usually described by a power-law of slope β. In the simplest case, three parameters are needed to describe the HOD (Berlind & Weinberg 2002; Hamana  C. Figure 7. The HOD of Lyα emitters at z = 3.3 (top) and z = 4.9 (bottom). Each set of points represents a model sample with a different luminosity limit, as given by the key in the upper panel. The dashed line in each panel corresponds to a ‘best’ fit using the Berlind et al. (2003) parametrization.. et al. 2004); more detailed models have been proposed to describe the transition from 0 to 1 galaxy (Berlind et al. 2003). We can compute the HOD directly from our model. The results are shown in Fig. 7, where we plot the HOD at two different redshifts for different luminosity limits. For comparison, we plot the HOD parametrization of Berlind et al. (2003) against our model predictions. In general, this HOD does a reasonable job of describing the model output, and is certainly preferred over a simple three parameter model. However, for the z = 3.3 case (top panel of Fig. 7), the shape of the model HOD for log (M/M ) > 13 is still more complicated than can be accommodated by the Berlind et al. parametrization, showing a flattening in the number of satellites as a function of increasing halo mass. There is less disagreement in the z = 4.9 case (bottom panel), but our model HOD becomes very noisy for large halo masses. 5 M O C K C ATA L O G U E S In this section, we make mock catalogues of Lyα emitters for a selection of surveys. In the previous section, we used the full simulation box to make clustering predictions, exploiting the periodic boundary conditions of the computational volume. The simulation is so large that it can accommodate many volumes equivalent to those sampled by current Lyα surveys, allowing us to examine the fluctuations in the number of emitters and their clustering. The characteristics of the surveys we replicate are listed in Table 1. C 2008 RAS, MNRAS 391, 1589–1604 2008 The Authors. Journal compilation . Downloaded from https://academic.oup.com/mnras/article-abstract/391/4/1589/1746848 by Pontificia Universidad Catolica de Chile user on 12 April 2019. Figure 6. The galaxy bias as a function of Lyα luminosity at different redshifts, as indicated by the key. The solid lines show the results from the simulation and the dashed lines show the analytical expression of Sheth et al. (2001). The area enclosed by the thin solid lines shows the error on the bias estimation for each redshift..

(9) The clustering of Lyα emitters. 1597. Table 1. Summary of survey properties and simulation results. (1) Survey. ELVIS. (3) zsimulation. (4) z. (5) Area (arcmin2 ). (6) EW obs (Å). (7) F Lyα (erg s−1 cm−2 ). (8) N obs. (9) N median mock. (10) 10–90 per cent. (11) Cv. 3.1 3.1 3.7 5.7 8.8. 3.06 3.06 3.58 5.72 8.54. 0.04 0.06 0.06 0.10 0.10. 961 3538 3474 3722 ∼3160. 80 328 282 335 100. 1.5 × 10−17 1.1 × 10−17 2.7 × 10−17 7.4 × 10−18 3.7 × 10−18. 162 356 101 401 –. 142 316 80 329 20. 89–207 256–379 60–110 255–407 14–29. 0.41 0.19 0.31 0.23 0.37. Column (1) gives the name of the survey; (2) and (3) show the redshift of the observations and nearest output from the simulations, respectively; (4) shows the redshift width of the survey, based on the full width at half-maximum filter width; (5) shows the area covered by each survey; (6) and (7) show the EW and Lyα flux limits of the samples, respectively; (8) shows the number of galaxies detected in each survey; (9) and (10) show the median of the number of galaxies and the 10-90 percentile range found in the mock catalogues for each survey. Finally, column (11) gives the fractional variation of the number of galaxies, defined in equation (13).. The procedure to build the mock catalogues is the following. (i) We extract a catalogue of galaxies from an output of the Millennium Simulation that matches (as closely as possible) the redshift of a given survey. The simulation output contains 64 snapshots spaced roughly logarithmically in the redshift range [127, 0]. (ii) We choose one of the axes (say, the z-axis) as the line of sight, and we convert it to redshift space, to match what is observed in real surveys. To do this, we replace rz (the comoving space coordinate) with vz (h−1 Mpc), sz = rz + (4) aH(z) where vz is the peculiar velocity along the z-axis, a = 1/(1 + z) and H(z) is the Hubble parameter at redshift z. (iii) We then apply the flux limit of the particular survey, to mimic the selection of galaxies. Table 1 shows the flux limits of the surveys considered. (iv) Then, we extract many mock catalogues using the same geometry as the real survey. We extract slices of a particular depth z (different for each survey), and within each slice we extract as many mock catalogues as possible using the same angular geometry as the real sample. z is determined using the transmission curves of the narrow-band filters used in each survey. To derive the angular sizes we use Dt (θ, z) = dc (z) θ,  z dz c  dc (z) = , H0 0 m (1 + z )3 + . (5) (6). where Dt is the transverse comoving size in h−1 Mpc, dc is the comoving radial distance, c and H0 are the speed of light and the Hubble constant, respectively, m and  are the density parameters of matter and the cosmological constant, respectively. Equation (5) is valid for θ 1 (rad), which is the case for the surveys we analyse in this work. We assume a flat cosmology. (v) From the line-of-sight axis, we invert equation (6) to obtain the redshift distribution of Lyα galaxies within each mock catalogue, converting galaxy position to redshift. This information is then used to take into account the shape of the filter transmission curve for each survey, which controls the minimum flux and EW as a function of redshift. The value given in Table 1 corresponds to the minimum flux and EW obs at the peak of the filter transmission curve. For redshifts at which the transmission is smaller (the tails of the curve), the minimum flux and EW obs required for a Lyα emitter to be included are proportionally bigger. (vi) Finally, we allow for incompleteness in the detection of Lyα emitters at a given flux due to noise in the observed images (where  C. this information is available). To do this, we randomly select a fraction of galaxies in a given Lyα flux bin to match the completeness fraction reported for the survey at that flux. Real surveys of Lyα emitters usually lack detailed information about the position of galaxies along the line of sight. Hence, instead of measuring the spatial correlation function defined in equation (1), it is only possible to estimate the angular correlation function, w(θ), which is the projection on the sky of ξ (r). We estimate w(θ ) from mock catalogues using the following procedure, which closely matches that used in real surveys. To compute the angular correlation function, we use the estimator (Landy & Szalay 1993): wLS (θ ) =. DD(θ) − 2DR(θ ) + RR(θ) , RR(θ ). (7). where DR stands for data-random pairs, RR indicates the number of random–random pairs and all of the pair counts have been appropriately normalized. In the case of a finite volume survey, this estimator is more robust than the one defined in equation (1) because it is less sensitive to errors in the mean density of galaxies, such as could arise from boundary effects. In practice, the measured angular correlation function can be approximated by a power law:  −δ θ , (8) w(θ ) = Aw 1◦ where Aw is the dimensionless amplitude of the correlation function, and δ is related to slope of the spatial correlation function, γ , from equation (2) by δ = γ − 1. A relation between r0 and Aw can be obtained using a generalization of Limber’s equation (Simon 2007). Surveys of Lyα emitters typically cover relatively small areas of sky and can display significant clustering even on the scale of the survey. As a result, the mean galaxy number density within the survey area will typically differ from the cosmic mean value. If the number of galaxies within the survey is used to estimate the mean density, used in equation (7), rather than the unknown true underlying density, this leads to a bias in the estimated correlation function. This effect is known as the integral constraint (IC) bias. Landy & Szalay (1993) show that when their estimator is used, the expected value of the estimated correlation function wLS (θ ) is related to the true correlation function w(θ ) by wLS (θ ) =. w(θ ) − w , 1 + w. where the IC term w is defined as  1 d1 d2 w(θ12 ), w ≡ 2 . C 2008 RAS, MNRAS 391, 1589–1604 2008 The Authors. Journal compilation . (9). (10). Downloaded from https://academic.oup.com/mnras/article-abstract/391/4/1589/1746848 by Pontificia Universidad Catolica de Chile user on 12 April 2019. MUSYC SXDS. (2) zsurvey.

(10) 1598. A. Orsi et al.. integrating over the survey area, and is equal to the fractional variance in number density over that area. When the clustering is weak, equation (9) simplifies to wLS (θ )  w(θ ) − w  . This motivates the additive IC correction which is customarily used in practice: (11). We use this to correct the angular correlation functions from our mock catalogues. In order to estimate the term w , we approximate the true correlation function as a power law, as in equation (8), and use  −δ i RRi θi (12) w  Aw  RRi  (Daddi et al. 2000), where RR is the same random pairs as used in the estimate of wLS (θ ). To quantify the sample variance expected for a particular survey, we use the mock catalogues to calculate a coefficient of variance (Cv ), which is a measure of the fractional variation in the number of galaxies found in the mocks N90 − N10 , (13) Cv = 2Nmed where N10 and N90 are the 10 and 90 percentiles of the distribution of the number of galaxies in the mocks, respectively, and N med is the median. The value of Cv allows us to compare the sampling variance between different surveys in a quantitative way. To analyse the clustering in the mock catalogues, we measured the angular correlation function of each mock catalogue using the procedure explained above. Then we fit equation (8) to each of the mock w(θ) and we choose the median value of Aw as the representative power-law fit. We fix the slope of w(θ ) to δ = 0.8 for all surveys, except for ELVIS, where we found that a steeper slope, δ = 1.2, agreed much better with the simulated data. To express the variation in the correlation function amplitude found in the mocks, we calculate the 10 and 90 percentiles of the distribution of Aw for each set of mock surveys. We also calculate w(θ ) using the full transverse extent of the simulation, with the same selection of galaxies as for the real survey. This estimate of w(θ ), which we call the Model w(θ ), represents an ideal measurement of the correlation function without boundary effects (so there is no need for the IC correction). The surveys we mimic are the following: the MUSYC (Gawiser et al. 2007; Gronwall et al. 2007), which is a large sample of Lyα emitting galaxies at z = 3.1; the SXDS (Ouchi et al. 2005, 2008), which covers three redshifts: z = 3.1, 3.7 and 5.7, and finally, we make predictions for the forthcoming ELVIS (Nilsson et al. 2007a,b), which is designed to find Lyα emitting galaxies at z = 8.8. We now describe the properties of the mock catalogues for each of these surveys in turn.. Figure 8. The observed EWobs distribution of the MUSYC at z = 3.1 (solid black line) and the simulation (solid green line).. To test how well the model reproduces the Lyα emitters seen in the MUSYC, we first compare the predicted (green) and measured (black) distributions of Lyα EWs in Fig. 8. Here, the predicted distribution comes from the full simulation volume. Overall, the simulation shows remarkably good agreement with the real data, with a slight underestimation in the range 200 < EW obs (Å) < 400. For EW obs (Å) > 400, both distributions seem to agree well, although the number of detected Lyα emitters in the tail of the distribution is small. For the MUSYC, we built 252 mock catalogues from the Millennium Simulation volume using the procedure outlined above. Fig. 9 shows an example of one of these mock catalogues. Many of the Lyα emitters are found in high dark matter density regions, and thus they are biased tracers of the dark matter. Fig. 10 shows the. 5.1 The MUSYC Survey The MUSYC (Gawiser et al. 2006, 2007; Gronwall et al. 2007; Quadri et al. 2007) is composed of four fields covering a total solid angle of 1 deg2 , each one imaged from the ground in the optical and near-IR. Here we use data from a single MUSYC field consisting of narrow-band observations of Lyα emitters made with the Cerro Toledo Inter-American Observatory (CTIO) 4-m telescope in the Extended Chandra Deep Field South (ECDFS) (Gronwall et al. 2007). The MUSYC field, centred on redshift z = 3.1, contains 162 Lyα emitters in a redshift range of z ∼ 0.04 over a rectangular area of 31 × 31 arcmin2 with flux and EW obs limits described in Table 1.  C. Figure 9. An image of a mock catalogue of the MUSYC of Lyα emitters at z = 3.1. The colour format and legend are the same as used in Fig. 1. The angular size of the image is 31 × 31 arcmin2 . C 2008 RAS, MNRAS 391, 1589–1604 2008 The Authors. Journal compilation . Downloaded from https://academic.oup.com/mnras/article-abstract/391/4/1589/1746848 by Pontificia Universidad Catolica de Chile user on 12 April 2019. wcorr (θ ) = wLS (θ ) + w ..

(11) The clustering of Lyα emitters. distribution of the number of galaxies in the ensemble of mocks. The green line shows the number detected in the real survey (162), which falls within the 10–90 percentile range of the mock distribution and is close to the median (142). The 10–90 percentile range spans an interval of 89 < N gal < 207, indicating a large cosmic variance for this survey configuration, with Cv = 0.41. The next step is to compare the clustering in the simulations with the real data. Fig. 11 plots the correlation functions from the mock catalogues alongside that measured in the real survey (Gawiser et al. 2007). There is reasonable agreement between the mock catalogue results and the observed data. The median w(θ ) from the mocks is. Figure 11. Angular clustering for the MUSYC. Green circles show w(θ ) calculated from the observed catalogue (Gawiser et al. 2007). The blue circles show the median w(θ ) from all mock catalogues, corrected for the IC effect. The dark and light grey shaded regions, respectively, show the 68 and 95 per cent ranges of the distribution of w(θ ) measured in the mock catalogues. The red open circles show the Model correlation function, obtained using the width of the entire simulation box (and the same EW, flux and redshift limits). The dashed lines show the power-law fit to the observed w(θ ) (green) and the median fit to w(θ ) from the mock catalogues (blue). The amplitudes Aw of these fits are also given in the figure.  C. slightly higher than the observed values, but the observed w(θ ) is within the range containing 95 per cent of the mock w(θ ) values (i.e. between the 2.5 and 97.5 per cent percentiles, shown by the light grey shaded region). We quantified this difference by fitting the power law of equation (8) to both real and mock data. The power-law fits were made over the angular range 1–10 arcmin. We find the value of Aw (equation 8) for each of the mock catalogue correlation functions by χ 2 -fitting (using the same expression as in Section 4.2 for the error on each model data point) and then we plot the power law corresponding to the median value of Aw . We find Aw = 0.53+1.01 −0.33 for the mocks, where the central value is the median, and the range between the error bars contains 95 per cent of the values from the mocks. For the real data, we find the best-fitting Aw and the 95 per cent confidence interval around it by χ 2 -fitting, using the error bars on the individual data points reported by Gawiser et al. This gives Aw = 0.29 ± 0.17 for the real data. We again see that the observed value is within the 95 per cent range of the mocks, and is thus statistically consistent with the model prediction. We also see that the 95 per cent confidence error bar on the observed Aw is much smaller than the error bar we find from our mocks. This latter discrepancy arises from the small errors quoted on w(θ ) by Gawiser et al. (2007), which are based on the modified Poisson pair count errors, but neglect variations between different sample volumes (i.e. cosmic variance). On the contrary, using our mocks, we are able to take cosmic variance fully into account. This underlines the importance of including the cosmic variance in the error bars on observational data, to avoid rejecting models by mistake. The red open circles in Fig. 11 show the correlation function obtained using the full angular size attainable with the Millennium Simulation but keeping the same flux, EW and redshift limits as in the MUSYC (averaging 7 different slices), and so this measurement has a smaller sample variance. The area used here is ∼120 times bigger than the MUSYC area, so IC effects are negligible on the scales studied here. We refer to this as the Model prediction for w(θ ). The median of the mock correlation functions (including the IC correction, blue circles) is seen to agree reasonably well with the Model correlation function (red open circles) for θ < 20 (arcmin). This shows that for this survey it is possible to obtain an observational estimate of the correlation function which is unbiased over a range of scales, by applying the IC correction. However, on large scales, the median w(θ) of the mocks (with IC correction included) lies above the Model w(θ ), which shows that the IC correction is not perfect, even on average. Presumably this failure is due (at least in part) to the fact that the IC correction procedure assumes that w(θ ) is a power law, while the true w(θ ) departs from a power law on large scales. It is also important to note that these statements only apply to the median w(θ ) derived from the mock samples – the individual mocks show a large scatter around the true w(θ ) (as shown by the grey shading), and the IC correction does not remove this. This scatter rapidly increases at both small and large angular scales, so the best constraints on w(θ) from this survey are for intermediate scales, 1  θ  5 (arcmin).. 5.2 The SXDS Survey The SXDS (Ouchi et al. 2005, 2008; Kashikawa et al. 2006) is a multiwavelength survey covering ∼1.3 deg2 of the sky. The survey is a combination of deep, wide area imaging in the X-ray with XMM–Newton and in the optical with the Subaru Suprime-Cam.. C 2008 RAS, MNRAS 391, 1589–1604 2008 The Authors. Journal compilation . Downloaded from https://academic.oup.com/mnras/article-abstract/391/4/1589/1746848 by Pontificia Universidad Catolica de Chile user on 12 April 2019. Figure 10. Histogram of the number of Lyα emitters found in the mock MUSYC catalogues. The red line shows the median, the dashed blue lines show the 10–90 percentile range, and the green line shows the number of galaxies detected in the real survey.. 1599.

(12) 1600. A. Orsi et al..  C. Downloaded from https://academic.oup.com/mnras/article-abstract/391/4/1589/1746848 by Pontificia Universidad Catolica de Chile user on 12 April 2019. Here, we are interested in the narrow-band observations at three different redshifts: 3.1, 3.6 and 5.7 (Ouchi et al. 2008). We build mock SXDS catalogues following the same procedure as outlined above. Fig. 12 shows examples of our mock catalogues for each redshift. As in the previous case, we see that Lyα emitters on average trace the higher density regions of the dark matter distribution. The real surveys have a well-defined angular size. However, the area sampled is slightly different at each redshift. In order to keep the cross-like shape in our mock catalogues and be consistent with the exact area surveyed, we scaled the cross-like shape to cover the same angular area as the real survey at each redshift. Fig. 13 shows the distribution of the number of galaxies in the mock catalogues for the three redshifts surveyed. The median number of galaxies in the mocks at z = 3.1 is 316, which is remarkably similar to the observed number, 356. The 10–90 percentile range of the mocks covers 256–379 galaxies. The coefficient of variation is Cv = 0.19, less than half the value found for the MUSYC mock catalogues, Cv = 0.41. This reduction is due mainly to the larger area sampled by the SXDS. In the second slice (z = 3.6), the redshift is only slightly higher than in the previous case, but the number of galaxies is much lower. Looking at the top panel of Fig. 2, we see that the observed LFs are basically the same for these two redshifts. The difference between the two samples is explained mostly by the different Lyα flux limits [1.2 × 10−17 (erg s−1 cm−2 ) for z = 3.1 and 2.6 × 10−17 (erg s−1 cm−2 ) for z = 3.6]. For the z = 3.6 mocks, we find a median number of 80 and 10–90 per cent range 60–110, in reasonable agreement with the observed number of galaxies, 101. The fractional variation in the number of galaxies, quantified by Cv = 0.31, is larger than in the previous case, due to the smaller number of galaxies. The z = 5.7 case is similar to the lower redshifts. The median number in the mocks is 329, with 10–90 per cent range 255–407, again consistent with the observed number, 401. The coefficient of variation for this survey is Cv = 0.23, so the sampling variance is intermediate between that for the z = 3.1 and 3.6 surveys. The angular correlation functions of the mock catalogues are compared with the real data in Fig. 14. The observational data shown are preliminary angular correlation function measurements in the three SXDS fields with error bars based on bootstrap resampling (Ouchi et al., in preparation). As in our comparison with the MUSYC, we plot the median correlation function measured from the mocks, after applying the IC correction, as a representative w(θ) . As before, we also perform a χ 2 fit of a power law to the w(θ ) measured in each mock, and to the observed values, to determine the amplitude Aw . The fit is performed over the range 1 < θ < 10 (arcmin) as before. The left-hand panel of Fig. 14 shows the correlation functions at z = 3.1. According to both the error bars on the observational data, and the scatter in w(θ) in the mocks (shown by the grey shading), this survey provides useful constraints on the clustering for 1  θ  10 (arcmin), but not for smaller or larger angles, where the scatter becomes very large. The fitted amplitude Aw for the observed correlation function is Aw = (0.32 ± 0.22) (95 per cent confidence, using the error bars reported by Ouchi et al., in preparation), somewhat below the median value found in the mocks, Aw = 0.60, but within the 95 per cent range for the mocks. Figure 12. Mock catalogues of the SXDS for redshifts 3.1 (top), 3.6 (centre) and 5.7 (bottom). The colour scheme and legend are the same as used previously. The angular size of the image is 1.4◦ × 1.4◦ .. C 2008 RAS, MNRAS 391, 1589–1604 2008 The Authors. Journal compilation .

(13) The clustering of Lyα emitters. 1601. Figure 14. Angular correlation functions for the mock SXDS catalogues at z = 3.1 (left-hand side), z = 3.6 (centre) and z = 5.7 (right-hand side). The blue circles show the median w(θ ) from the mock catalogues (after applying the IC correction). The dark and light grey shaded regions, respectively, show the 68 and 95 per cent ranges of the distribution of w(θ ) measured in the mock catalogues. The red open circles are the Model w(θ ) calculated using the full simulation width, averaged over many slices. The green circles show the observational data from Ouchi et al. (in preparation). The dashed lines show the power-law fit to the observed w(θ ) (green) and the median fit to w(θ ) from the mock catalogues (blue). The amplitudes Aw of these fits are also given in the figure.. (Aw = 0.23 − 1.35). Based on the mocks, the model correlation function is consistent with the SXDS data at this redshift. Comparing these results with those we found for the MUSYC (which has a very similar redshift and flux limit to SXDS at z = 3.1), we see that the results seem very consistent. The MUSYC has a larger sample variance than SXDS, but the measured clustering amplitude is very similar in the two surveys. The middle panel of Fig. 14 shows the correlation function for the z = 3.6 survey. In this case, the error bars on the observational data and the scatter in the mocks are both larger, due to the lower surface density of galaxies in this sample. For the observed correlation amplitude, we obtain Aw = 0.75 ± 0.72, while for the mocks we find a median Aw = 0.99, with 95 per cent range 0.06–2.01, entirely consistent with the observational data. The right-hand panel of Fig. 14 shows the correlation function predictions for z = 5.7. According to the spread of mock catalogue results, the w(θ ) measured here is the most accurate of the three surveys, due to the large number of galaxies. For the mocks, we find a median correlation amplitude Aw = 0.82, with 95 per cent range 0.42–1.49. For the observations, we find Aw = 1.56 ± 0.27,  C. if we assume a slope δ = 0.8. The average correlation function in the mocks agrees well with this slope over the range fitted, but the observational data for w(θ) at this redshift prefer a flatter slope. The model is however still marginally consistent with the observational data at 95 per cent confidence. Similarly flat shapes were also found in some previous surveys (Hayashino et al. 2004; Shimazaku et al. 2004) in the same field, but at redshifts 3.1 and 4.9, respectively. However, these surveys are much smaller in terms of area surveyed and number of galaxies (this is particularly so in Shimazaku et al. 2004). This behaviour in w(θ ) might be produced by the highdensity regions associated with protoclusters in the SXDS fields (Ouchi et al., in preparation), but still this behaviour of w(θ) must be confirmed to prove that it is a real feature of the correlation function. 5.3 ELVIS Survey One of the main goals of the public surveys planned for the VISTA is to find a significant sample of very high-redshift (z ∼ 8.8) Lyα emitters. This program is called the ELVIS (Nilsson et al. 2007a,b).. C 2008 RAS, MNRAS 391, 1589–1604 2008 The Authors. Journal compilation . Downloaded from https://academic.oup.com/mnras/article-abstract/391/4/1589/1746848 by Pontificia Universidad Catolica de Chile user on 12 April 2019. Figure 13. The distribution of the number of galaxies in mock SXDS catalogues, for z = 3.1 (left-hand side), z = 3.6 (centre) and z = 5.7 (right). The red line shows the median of the number of galaxies inside the mock catalogues, the blue lines show the 10–90 percentiles of the distribution, and the green line shows the number observed in the SXDS..

(14) 1602. A. Orsi et al.. The plan for ELVIS is to image four strips of 11.6 × 68.27 arcmun2 , covering a total area of 0.878 deg2 , as shown in Fig. 15. This configuration is dictated by the layout of the VISTA IR camera array. The only current detections of Lyα emitters at z > 8 are those of Stark et al. (2007), which have not yet been independently confirmed. Lyα emitting galaxies at such redshifts will provide us with valuable insights into the reionization epoch of the Universe, as well as galaxy formation and evolution. For our mock ELVIS catalogues, we select galaxies with a minimum flux of F Lyα = 3.7 × 10−18 (erg s−1 cm−2 ) and EW obs > 100 Å, as listed in Table 1. (The EW obs limit is just a rough estimate, although our predictions should not be sensitive to the exact value chosen.) Fig. 15 shows the expected spatial distribution of z = 8.5 galaxies in one of the ELVIS mock catalogues. Each mock catalogue has four strips, matching the configuration planned for the real survey. The median number of galaxies within the mock catalogues is 20, with a 10–90 percentile spread of 14–29 galaxies, as shown in Fig. 16. The fractional variation in number between. Figure 16. Histogram of the number of galaxies in mock catalogues expected for the ELVIS. The red line shows the median of the distribution, and the blue dashed lines 10–90 percentiles of the distribution.  C. Figure 17. Angular correlation functions in the mock catalogues of the ELVIS. The blue circles show the median w(θ ) from the mock catalogues, while the orange circles show the mean. The dark and light grey shaded regions, respectively, show the 68 and 95 per cent ranges of the distribution of w(θ ) in the mocks. The red open circles show the Model w(θ ) obtained using the full width of the simulation box. The amplitude and slope of the median power-law fit to the mocks are also given.. different mocks is Cv = 0.37, which is quite large, but no worse than for the MUSYC at z = 3.1, even though that survey has 10 times as many galaxies. The angular correlation functions of the mock ELVIS catalogues were calculated in the same way as before (including the IC correction). Fig. 17 shows the median of the w(θ ) values measured from each mock catalogue (blue circles), and also the mean (orange circles). In this case, the distribution of w(θ ) values within each angle bin is very skewed, due to the small number of galaxies in the mock catalogues, and so the mean and median can differ significantly. The dark and light grey shaded regions show the ranges containing 68 and 95 per cent of the w(θ ) values from the mocks, from which it can be seen that the cosmic variance for this survey is very large. We also show the Model w(θ ) (red circles), which provides our best estimate of the true correlation function based on the Millennium Simulation, and was calculated by averaging over 10 slices of the simulation, using the full width of the simulation box. Even here, the error bars on w(θ ) are quite large, due to the very low number density of galaxies predicted. We see that the mean and median w(θ ) in the mocks lie close to the Model values for 2 (arcmin) < θ < 20 (arcmin), so in this sense they provide an unbiased estimate. The most noticeable feature of Fig. 17 is the large area covered by both the 68 and 95 per cent ranges of the distribution of w(θ ) in the mocks, which extend down to w(θ ) = 0. This indicates that the ELVIS will only be able to put a weak upper limit on the clustering amplitude of z ∼ 8.8 Lyα emitters, if our model is correct. As before, we can quantify this by fitting a power law to w(θ ) in our mocks. We notice that the Model w(θ ) for this sample has a significantly steeper slope, δ = 1.2, than the canonical value δ = 0.8, and so we do our fits to the mocks using δ = 1.2. We find a median amplitude in the mocks Aw = 3.57+12.0 −33.5 , where the error bars give the 95 per cent range. 6 S U M M A RY A N D C O N C L U S I O N S We have combined a semi-analytical model of galaxy formation with a high resolution, large volume N-body simulation to make C 2008 RAS, MNRAS 391, 1589–1604 2008 The Authors. Journal compilation . Downloaded from https://academic.oup.com/mnras/article-abstract/391/4/1589/1746848 by Pontificia Universidad Catolica de Chile user on 12 April 2019. Figure 15. An example of a mock catalogue for the ELVIS. The image shows the observed field of view (four strips). The legend and colour format are the same as in Figs 9 and 12..

Referencias

Documento similar

In addition, precise distance determinations to Local Group galaxies enable the calibration of cosmological distance determination methods, such as supernovae,

Astrometric and photometric star cata- logues derived from the ESA HIPPARCOS Space Astrometry Mission.

The photometry of the 236 238 objects detected in the reference images was grouped into the reference catalog (Table 3) 5 , which contains the object identifier, the right

as a function of the parent galaxy stellar mass are shown in Fig. 12, in three bins of the family classes of bars. In Buta et al. It appears that for the strong bars the inner,

In the context of galaxy formation, two fundamental properties are given by the underlying dark matter structure: the distribution of their masses at a certain redshift as the halo

To do that, we have introduced, for both the case of trivial and non-trivial ’t Hooft non-abelian flux, the back- ground symmetric gauge: the gauge in which the stable SU (N )

Many and different are the techniques employed to obtain nanowires and the choice of one or other is crucial for the properties of the final products, since the growth

An example of projected stellar mass density for the disk, spheroid, and galaxy stellar particles is given in Figure 2 for G-1536, as well as the position of the radial cut we use