4. ORIENTACIONES PARA UNA VIVENCIA EMOCIONAL COMPARTIDA A lo largo de estas páginas hemos ido haciendo un recorrido por los meses de
4.3 Ha llegado el momento
We then generated data using 30 latent variables with 100 % of active weights. Data was simulated for 800 samples and 1600 features and the latent variables explained 75% of the
E.1 Identifiability of the latent structure for sparse and dense factors 175
total variability (due to the higher density of the weights).
Again, we compared three models: a first model with no spike-and-slab prior; a second model with spike-and-slab priors on the weights and a third model with both spike-and-slab priors on the weights and on the factors. Like in Section E.1.1, the three models were fitted three times with random initialisations of the latent variables in order to assess the robustness of the inference process. Correlation of the inferred weights with the simulated weights showed that none of the models could identify the true latent weights (Fig. E.4). Across our three random trials, inference was only reproducible for the model with spike-and-slab priors on both weights and factors (Fig. E.4), although the inferred weights were in any case rotated compared to the true simulated weights.
Fig. E.3 Correlation of the simulated weights with weights inferred with a model without any spike-and-slab prior (left), with spike-and-slab priors on the weights (middle) and with both spike-and-slab priors on the weights and on the factors (right). The generative latent factors were all dense (100% of active weights).
Taken together, these results confirm the usefulness of sparsity-inducing priors for the identification of sparse latent factors, while showing that dense latent factors pose in any case an identifiability problem.
Fig. E.4 Robustness of weights inference for a model without any spike-and-slab prior (left), with spike-and-slab priors on the weights (middle) and with both spike-and-slab priors on the weights and on the factors (right). Each heat map shows the correlation matrix between all inferred weights for all three separate runs, so that off-diagonal blocks correspond to correlation plots between runs. The generative latent factors were all dense (100% of active weights).
References
Achim, K., Pettit, J.-B., Saraiva, L. R., Gavriouchkina, D., Larsson, T., Arendt, D., and Marioni, J. C. (2015). High-throughput spatial mapping of single-cell RNA-seq data to tissue of origin. Nat. Biotechnol., 33(5):503–509.
Ahmed, S., Rattray, M., and Boukouvalas, A. (2017). GrandPrix: Scaling up the Bayesian GPLVM for single-cell data. bioRxiv, page 227843.
Aichler, M. and Walch, A. (2015). MALDI imaging mass spectrometry: current frontiers and perspectives in pathology research and practice. Lab. Invest., 95(4):422–431.
Akerfelt, M., Morimoto, R. I., and Sistonen, L. (2010). Heat shock factors: integrators of cell stress, development and lifespan. Nat. Rev. Mol. Cell Biol., 11(8):545–555.
AM, F., K, F., LM, L., W, C., and RH, S. (2003). Visualization of single molecules of mrna in situ. Nat. Methods, 361:245–304.
Amari, S.-I. (1998). Natural gradient works efficiently in learning. Neural Comput., 10(2):251–276.
Androulakis, I. P., Yang, E., and Almon, R. R. (2007). Analysis of time-series gene expression data: methods, challenges, and opportunities. Annu. Rev. Biomed. Eng., 9:205–228. Angelo, M., Bendall, S. C., Finck, R., Hale, M. B., Hitzman, C., Borowsky, A. D., Levenson,
R. M., Lowe, J. B., Liu, S. D., Zhao, S., Natkunam, Y., and Nolan, G. P. (2014). Multiplexed ion beam imaging of human breast tumors. Nat. Med., 20(4):436–442.
Angermueller, C., Clark, S. J., Lee, H. J., Macaulay, I. C., Teng, M. J., Hu, T. X., Krueger, F., Smallwood, S. A., Ponting, C. P., Voet, T., Kelsey, G., Stegle, O., and Reik, W. (2016). Parallel single-cell sequencing links transcriptional and epigenetic heterogeneity. Nat. Methods, 13:229.
Angulo, M. C., Kozlov, A. S., Charpak, S., and Audinat, E. (2004). Glutamate released from glial cells synchronizes neuronal activity in the hippocampus. J. Neurosci., 24(31):6920– 6927.
Anscombe, F. J. (1948). The transformation of poisson, binomial and Negative-Binomial data. Biometrika, 35(3-4):246–254.
Argelaguet, R., Velten, B., Arnol, D., Dietrich, S., Zenz, T., Marioni, J. C., Buettner, F., Huber, W., and Stegle, O. (2018). Multi-Omics factor analysis—a framework for unsupervised integration of multi-omics data sets. Mol. Syst. Biol., 14(6):e8124.
Arnold, S. J. and Robertson, E. J. (2009). Making a commitment: cell lineage allocation and axis patterning in the early mouse embryo. Nat. Rev. Mol. Cell Biol., 10(2):91–103. Asa-aki Sato, M. (2001). Online model selection based on the Variational. Neural Comput.,
13:1649–1681.
Banerji, S., Ni, J., Wang, S. X., Clasper, S., Su, J., Tammi, R., Jones, M., and Jackson, D. G. (1999). LYVE-1, a new homologue of the CD44 glycoprotein, is a lymph-specific receptor for hyaluronan. J. Cell Biol., 144(4):789–801.
Bangdiwala, S. I. (1989). The wald statistic in proportional hazards hypothesis testing. Biom. J., 31(2):203–211.
Bartholomew, D. J. (1985). Foundations of factor analysis: Some practical implications. Br. J. Math. Stat. Psychol., 38(1):1–10.
Battich, N., Stoeger, T., and Pelkmans, L. (2013). Image-based transcriptomics in thousands of single human cells at single-molecule resolution. Nat. Methods, 10(11):1127–1133. Battich, N., Stoeger, T., and Pelkmans, L. (2015). Control of transcript variability in single
mammalian cells. Cell, 163(7):1596–1610.
Baud, A., Mulligan, M. K., Casale, F. P., Ingels, J. F., Bohl, C. J., Callebert, J., Launay, J.-M., Krohn, J., Legarra, A., Williams, R. W., and Stegle, O. (2017). Genetic variation in the social environment contributes to health and disease. PLoS Genet., 13(1):e1006498. Beal, M. J. and Others (2003). Variational algorithms for approximate Bayesian inference.
university of London London.
Benjamini, Y. and Hochberg, Y. (1995). Controlling the false discovery rate: A practical and powerful approach to multiple testing. J. R. Stat. Soc. Series B Stat. Methodol., 57(1):289–300.
Bishop, C. M. (2006). Pattern Recognition and Machine Learning (Information Science and Statistics). Springer-Verlag, Berlin, Heidelberg.
Biswas, S. (2008). Introduction to coding theory: basic codes and shannon’s theorem. [online] http://www.math.uchicago.edu/~may/VIGRE/VIGRE2008/REUPapers/Biswas.pdf. Blei, D. M., Kucukelbir, A., and McAuliffe, J. D. (2016). Variational inference: A review
for statisticians.
Bochner, S. (1959). Lectures on Fourier integrals. Princeton University Press.
Bodenmiller, B. (2016). Multiplexed Epitope-Based tissue imaging for discovery and healthcare applications. Cell Syst, 2(4):225–238.
Bonnans, J. F., Gilbert, J. C., Lemaréchal, C., and Sagastizábal, C. A. (2006). Numerical Optimization: Theoretical and Practical Aspects (Universitext). Springer-Verlag, Berlin, Heidelberg.