• No se han encontrado resultados

6 1 MARCO TEÓRICO

6.1.8 Infiltración en los suelos.

As discussed in each component of this thesis, extensions and development of the proposed and adapted methods can be followed. Indeed, the development of Bayesian modelling approaches for prediction of patient survival presents many possibilities for future work. As discussed in Chapter 4, it may be worthwhile to pursue the Reversible Jump Markov Chain Monte Carlo (RJMCMC), proposed by Green (1995) and described in a general context by Lunn et al. (2009) as an alter- native approach to post-analysis model comparison and BMA. It may be possible to apply this model comparison by transforming the Weibull model to the extreme value model.

The other research area that is not featured in this thesis but presents opportuni- ties for future work is the use of random censoring (Liu, 2012, Miller, 1998) in the simulation study to generate the data in Chapter 5. This censoring mechanism is also commonly encountered in practice and could have been explored to investigate the impact of different censoring mechanisms on mixture model estimation.

The research in Chapter 6 indicated that BMA approach enables us to provide more precise predictions of patient survival. However, this study only involved three candidate models. More models can be obviously included in the analysis. This

7.2. FUTURE WORK 139

study has also focused on the marginal likelihood p(D | Qs) estimation methods

based on the Laplace approximation. However, other approaches are also possible. Indeed marginal likelihood estimation is possible using nested sampling (Skilling, 2006), where the marginal likelihood is viewed as the expectation, with respect to the prior, of the likelihood. The other approach is Chib’s method (Chib and Greenberg, 1995), that presented a generic method which can be applied to output from the Gibbs sampler. Applying BMA to other datasets, other applications where robust prediction is desired.

References

K. Abrams, D. Ashby, and D. Errington. A Bayesian approach to Weibull survival models: Application to cancer clinical trial. Lifetime Data Analysis, 2:159–174, 1996.

A. O. M. Ahmed, S. Hadeel, Al-Kutubi, and N. A. Ibrahim. Comparison of the Bayesian and maximum likelihood estimation for Weibull distribution. Journal of Mathematics and Statistics, 6(2):100–104, 2010.

H. Akaike. Information theory and extension of the maximum likelihood principle. In B. N. Petrov and F. Csaki. Second International Symposium on Information Theory (pp. 267-281), 1973.

A. A. Alizadeh, M. B. Eisen, and R. E. Davis. Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling. Nature, 403:503–511, 2000. A. A. Alizadeh, A. J. Gentles, A. J. Alencar, C. L. Liu, H. E. Kohrt, R. Houot, M. J.

Goldstein, S. Zhao, Y. Natkunam, R. H. Advani, R. D. Gascoyne, J. Briones, R. J. Tibshirani, J. H. Myklebust, S. K. Plevritis, I. S. Lossos, and R.Levy. Prediction of survival in diffuse large B-cell lymphoma based on the expression of 2 genes reflecting tumor and microenvironment. Blood., 118(5):1350–1358, 2011.

P. K. Andersen. Survival analysis 1982-1991: The second decade of the

proportional hazards regression model. Statistics in Medicine, 10:1931–1941, 1991.

R. D. Angelis, R. Cappocaccia, T. Hakulinen, B. Soderman, and A. Verdeccia. Mixture models for cancer survival analysis: Application to population based data with covariates. Biostatistics, 8(3):576–594, 2007.

A. Annest, R. E. Bumgarner, A. E. Raftery, and K. E. Yeung. Iterative Bayesian model averaging: A method for the application of survival analysis to high dimensional microarray data. BMC Bioinformatics, 10(72):1471–2105, 2009. C. C. Australia. Lymphoma, 2013a. [Online; accessed 10-May-2013].

L. Australia. What is lymphoma, 2013b. [Online; accessed 10-May-2013].

E. Bair and R. Tibshirani. Semisupervised methods to predict patient survival from gene expression data. PLOS Biology, 2:511–522, 2004.

P. Baldi and A. D. Long. A Bayesian framework for the analysis of microarray expression data: Regularised t-test and statistical inferences of gene expression changes. Bioinformatics, 17:509–519, 2001.

S. Basu and R. C. Tiwari. Breast cancer survival, competing risk and mixture cure model: A Bayesian analysis. Journal of the Royal Statistical Society, 173(2): 307–329, 2010.

S. Bea, L. Yang, A. Lopez-Guillermo, W. C. Chan, T. C. Greiner, D. D. Weisenburger, J. O. Armitage, R. D. Gascoyne, J. M. Connors, T. M. Grogan, R. Braziel, R. I. Fisher, E. B. Smeland, S. Kvaloy, H. Holte, J. Delabie, R. Simon, J. Powell, W. H. Wilson, E. S. Jaffe, E. Montserrat, H. K. Muller-Hermelink, L. M. Staudt, E. Campo, A. Rosenwald, and for the Lymphoma/Leukemia

Molecular Profiling Project. Diffuse large B-cell lymphoma subgroups

have distinct genetic profiles that influence tumor biology and improve gene expression based survival prediction. Blood, 106(9):3183–3190, 2005.

REFERENCES 143

D. Beer, S. Kardia, C. Huang, T. Giordano, A. Levin, D. Misek, L. Lin, G. Chen, T. Gharib, D. Thomas, M. Lizyness, R. Kuick, S. Hayasaka, J. Taylor, M. Iannettoni, M. Orringer, and S. Hanash. Gene expression profiles predict survival of patients with lung adenocarcinoma. Nature Medicine, 8:816–824, 2002.

J. Berger. Statistical Decision Theory and Bayesian Analysis. Springer, Verlag, 2nd edition, 1985.

J. Berger. Robust Bayesian analysis: Sensitivity to the prior. Journal of Statistical Planning and Inference, 25:303–328, 1990.

J. O. Berger and J. M. Bernardo. Estimating a product of means: Bayesian analysis with reference priors. Journal of the American Statistical Association, 84(405): 200–207, 1989.

J. Berkson and R. P. Gage. Survival curve for cancer patients following treatment. Journal of the American Statistical Association, 47(259):501–515, 1952.

J. M. Bernando and A. F. M. Smith. Bayesian Modeling. John Wiley & Sons, 1994. J. M. Bernardo. Noninformative priors do not exist. Journal of Statistical Planning

and Inference, 65:159–189, 1997.

J. Besag, E. Green, D.Higdon, and K. Mengersen. Bayesian computation and

stochastic systems. Statistical Science, 10(1):3–41, 1995.

A. Bhadra and B. K. Mallick. Joint high dimensional Bayesian variable and

covariance selection with an application to eQTL analysis. Biometrics, 69:447– 457, 2013.

J. W. Boag. Maximum likelihood estimates of the proportion of patients cured by cancer therapy. Journal of the Royal Statistical Society Series B, 11:15–44, 1949.

W. M. Bolstad. Understanding Computational Bayesian Statistics. John Wiley and Sons Inc, New Jersey, 2010.

P. Bonato, V. Baladandayuthapani, B. M. Broom, E. P. Sulman, K. D. Aldape, and K. A. Do. Bayesian ensemble methods for survival predictions in gene expression data. Bioinformatics, 27(3):359–367, 2011.

K. W. Broman and T. P. Speed. A model selection approach for identification of quantitative trait loci in experimental crosses. Journal of Royal Statistical Society, Series B, 64:641–656, 2002.

S. Buckland, K. Burnham, and N. Augustin. Model selection: An integral part of inference. Biometrics, 53:603–618, 1997.

K. Burnham and D. Anderson. Multimodel inference: Understanding AIC and BIC in model selection. Sociological Methods Research, 33:261–304, 2004.

K. P. Burnham and D. R. Anderson. Model Selection and Multimodel Inference: A Practical Information Theoretic Approach. Springer-Verlag, New York, 2002. B. Cai and R. Meyer. Bayesian semiparametric modelling of survival data based on

mixtures of b-spline distributions. Computational Statistics and Data Analysis, 55:1260–1272, 2011.

V. G. Cancho, J. Rodrigues, and M. de Castro. A flexible model for survival data with a cure rate: A Bayesian approach. Journal of Applied Statistics, 38(1): 57–70, 2011.

V. G. Cancho, M. de Castro, and J. Rodrigues. A Bayesian analysis of the Conway- Maxwell-Poisson cure rate model. Statistics Papers, 53:165–176, 2012.

REFERENCES 145

mixture posterior distributions. Journal of the American Statistical Association, 95:957–970, 2000.

M. H. Chen, J. G. Ibrahim, and D. Sinha. A new Bayesian model for survival data with a surviving fraction. Journal of American Statistical Association, 94: 909–919, 1999.

M. H. Chen, D. P. Harrington, and J. G. Ibrahim. Bayesian cure rate models for malignant melanoma: A case study of Eastern cooperative oncology group trial E1690. Journal of Applied Statistics, 51(2):135–150, 2002.

W. C. Chen, B. M. Hill, J. B. Greenhouse, and J. V. Fayos. Bayesian analysis of survival curves for cancer patients following treatment. In: J. M. Bernardo, M. H. Degroot, D. V. Lindley and A. F. M. Smith. Bayesian Statistics 2, pages 299–328, 1985.

S. Chib and E. Greenberg. Understanding the Metropolis Hastings algorithm.

American Statistician, 4:327–335, 1995.

G. Claeskens and N. L. Hjort. Model Selection and Model Averaging. Cambridge University Press, 2008.

M. Clyde. Model uncertainty and health effect studies for particulate matter.

Environmetrics, 11:745–763, 2000.

D. Collet. Modelling Survival Data in Medical Research. Chapman and Hall, 1st edition, 1994.

P. Congdon. Applied Bayesian Modeling. John Wiley & Sons, 2003.

C. Conigliani, J. I. Castro, and A. O’hagan. Bayesian assessment of goodness of fit against nonparametric alternatives. The Canadian Journal of Statistics, 28(2): 327–342, 2000.

F. Corbiere and P. Joly. A SAS macro for parametric and semiparametric mixture cure models. Computer Methods and Programs in Biomedicine, 85(2):173–180, 2007.

D. R. Cox and D. Oakes. Analysis of Survival Data. Chapman and Hall, 1972. P. Damien, J. Wakefield, and S. Walker. Gibbs sampling for Bayesian non-conjugate

and hierarchical models by using auxiliary variables. Journal of Royal Statistical Society, Series B, 61(2):331–344, 1999.

S. Datta and J. Le-Rademacher. Predicting patient survival from microarray

data by accelerated failure time modeling using partial least squares and lasso. Bioinformatics, 63:259–271, 2007.

A. P. Dempster, N. M. Laird, and D. M. Rubin. Maximum likelihood from

incomplete data via EM algorithm (with discussion). Journal of the Royal

Statistical Society: Series B, 39:1–38, 1977.

J. Diebolt and C. P. Robert. Estimation of finite mixture distributions through

Bayesian sampling. Journal of the Royal Statistical Society: Series B, 56:363– 375, 1994.

B. Dodson. Weibull Analysis. American Society Quality, Milwaukee, 1994.

D. Draper. Assessment and propagation of model uncertainty. Journal of the Royal Statistical Society Series B, 57:45–97, 1995.

F. Drouet. Non-hodgkin’s lymphomas. Cancer radiothrapie, 14(1):210, 2007. L. Duchateau and P. Janssen. The Frailty Model. Springer, 2008.

REFERENCES 147

A. Excellence. Protein synthesis. ‘http://www.accessexcellence.org/, 2013. [Online; accessed 05-May-2013].

A. Farcomeni and A. Nardi. A two-component Weibull mixture to model early and late mortality in a Bayesian framework. Computational Statistics and Data Analysis, 54:416–428, 2010.

V. T. Farewell. Mixtures models in survival analysis: Are they worth the risk? Canadian Journal of Statistics, 14:257–262, 1986.

N. Friel and A. N. Pettitt. Marginal likelihood estimation via power posteriors. Journal of the Royal Statistical Society: Series B, 70:589–607, 2008.

S. Fruhwirth-Schnatter. Markov Chain Monte Carlo estimation of classical and dynamic switching and mixture models. Journal of the American Statistical Association, 96:194–209, 2000.

A. Gelman. A Bayesian formulation of exploratory data analysis and goodness of fit testing. International Statistical Review, 71(2):369–382, 2003.

A. Gelman. Objections to Bayesian statistics. Bayesian Analysis, 3(3):445–450, 2008.

A. Gelman, J. B. Carlin, H. S. Stern, and D. B. Rubin. Bayesian Data Analysis. Chapman and Hall, 2004.

S. Geman and D. Geman. Stochastic relaxation, gibbs distribution and the Bayesian restoration of images. Pattern Analysis and Machine Intelligence, 6:721–742, 1984.

W. R. Gilks, S. Richardson, and D. J. Spiegelhalter. Markov Chain Monte Carlo in Practice. Chapman and Hall, 1996.

A. I. Goldman. Estimation in a proportional hazards cure model. Biometrics, 56: 227–336, 2000.

R. Gottardo. Statistical analysis of microarray data: A Bayesian approach.

Biostatistics, 4:597–620, 2003.

P. J. Green. Reversible jump Markov Chain Monte Carlo computation and Bayesian model determination. Biometrika, 82(4):711–732, 1995.

R. Griffiths, M. Gleeson, K. Knopf, and M. Danese. Racial differences in treatment and survival in older patients with diffuse large B-cell lymphoma (dlbcl). BMC Cancer, 10(625), 2010.

S. Y. Grodzenskii and V. G. Domrachev. Estimating of the parameters of a

mixture of Exponential and Weibull distributions with progressive censoring. Measurement Techniques, 45(11):1115–1118, 2002.

Y. Gu, D. Sinha, and S. Banerjee. Analysis of cure rate survival data under

proportional odds model. Life Time Data Analysis, 17(1):123–134, 2011.

A. Gupta, B. Mukherjee, and S. K. Upadhyay. Weibull extension model: A Bayes study using Markov Chain Monte Carlo simulation. Reliability Engineering and System Safety, 93(10):1434–1443, 2008.

T. E. Hanson. Modeling censored lifetime data using a mixture of gammas baseline. Bayesian Analysis, 1(3):575–594, 2006.

W. K. Hastings. Monte carlo sampling methods using Markov Chains and their applications. Biometrika, 1:97–109, 1970.

REFERENCES 149

G. B. Hazen and M. Huang. Parametric sensitivity analysis using large sample approximate Bayesian posterior distributions. Decision Analysis, 3(4):208–219, 2006.

N. L. Hjort and G. Claeskens. Frequentist model average estimators (with

discussion). Journal of the American Statistical Association, 98:879–899, 2003. J. Hoeting, D. Madigan, A. E. Raftery, and C. Volinsky. Bayesian model averaging:

A tutorial. Statistical Science, 14:382–417, 1999.

D. W. Hosmer and S. Lemeshow. Applied Survival Analysis: Regression Modeling of Time to Event Data. John Wiley and Sons, 1999.

P. Hougaard. Frailty models for survival data. Lifetime Data Analysis., 1(3):255– 273, 1995.

P. Hougaard. Analysis of Multivariate Survival Data. Springer-Verlag, 2000. J. Z. Huang, W. G. Sanger, and T. C. Greiner. The t(14;18) defines an unique subset

of diffuse large B-cell lymphoma with a germinal center B-cell gene expression profile. Blood, 99:2285–2290, 2002.

K. E. Hunt and K. K. Reichard. Diffuse large B-cell lymphoma. Archives of

Pathology and Laboratory Medicine, 132:118–124, 2008.

J. G. Ibrahim and P. W. Laud. On Bayesian analysis of general linear models using jeffreys prior. Journal of the American Statistical Association, 86:981–986, 1991. J. G. Ibrahim, M. H. Chen, and D. Sinha. Bayesian semiparametric models for

survival data with a cure fraction. Biometrics, 57:383–388, 2001a.

J. G. Ibrahim, M. H. Chen, and D. Sinha. Bayesian Survival Analysis. Springer, 2001b.

J. G. Ibrahim, M. H. Chen, and R. J. Gray. Bayesian models for gene expression with DNA microarray data. Journal of American Statistical Association, 97:88– 99, 2002.

J. G. Ibrahim, M. H. Chen, and S. Kim. Bayesian variable selection for the Cox regression model with missing covariates. Life Time Data Analysis, 14:496–520, 2008.

N. H. G. R. Institute. Gene expression. http://www.genome.gov/

glossary, 2013. [Online; accessed 05-May-2013].

C. H. Jackson, S. G. Thompson, and L. D. Sharples. Accounting for uncertainty in health economic decision models by using model averaging. Journal of the royal statistical society Series A, 172(2):383–404, 2009.

A. Jasra, C. C. Holmes, and D. A. Stephens. Markov Chain Monte Carlo methods

and the label switching problem in Bayesian mixture modeling. Statistical

Science, 20(1):50–67, 2005.

N. L. Johnson, S. Kotz, and N. Balakrishnan. Continuous Univariate Distributions. John Wiley and Sons, 1995.

L. Kaderali, T. Zander, U. Faigle, J. Wolf, J. L. Schultze, and R. Schrader. CASPAR: A hierarchical Bayesian approach to predict survival times in cancer from gene expression data. Bioinformatics, 22(12):1495–1502, 2006.

J. D. Kalbfleisch and R. L. Prentice. The Statistical Analysis of Failure Time Data. Wiley & Sons, New Jersey, 2nd edition, 2002.

M. P. Kaminskiy and V. V. Krivtsov. A simple procedure for Bayesian estimation of the Weibull distribution. IEEE Transactions On Reliability, 54(4):612–616, 2005.

REFERENCES 151

E. L. Kaplan and P. Meier. Nonparametric estimation from incomplete observations. Journal of American Statistical Association, 53:457–481, 1958.

J. F. Kenney and E. S. Keeping. Root mean square. In J.F. Kenney and E.S. Keeping (eds.). Mathematics of Statistics (3rd ed., pp. 59-60), 1962.

S. Kim, M. H. Chen, D. K. Dey, and D. Gamerman. Bayesian dynamic models for survival data with a cure fraction. Lifetime Data Analysis, 13:17–35, 2007. S. W. Kim and J. G. Ibrahim. On Bayesian inference for proportional hazards

models using noninformative priors. Life Time Data Analysis, 6(4):331–341, 2000.

S. Kima and D. K. Dey. Modeling multilevel survival data using frailty models. Communications in Statistics - Theory and Methods., 37(11):1734–1741, 2008. D. G. Kleinbaum and M. Klein. Survival analysis: A Self-learning Text. Springer,

2005.

P. Kostoulas, S. S. Nielsen, W. J. Browne, and L. Leontides. A Bayesian Weibull

survival model for time to infection data measured with delay. Preventive

Veterinary Medicine, 94:191–201, 2010.

D. E. Krane and M. L. Raymer. Fundamental Concepts of Bioinformatics. Benjamin Cummings: San Fransisco, 2003.

D. Kundu. Bayesian inference and life testing plan for the Weibull distribution in presence of pregressive censoring. Technometrics, 50(2):144–154, 2008.

S. W. Lagakos. General right censoring and its impact on the analysis of survival data. Biometrics, 35(1):139–156, 1979.

J. F. Lawless. Statistical Models and Methods for Lifetime Data. Wiley & Sons, New Jersey, 2nd edition, 2002.

K. E. Lee. Bayesian Models for DNA Microarray Data Analysis. PhD thesis, Texas A and M University, USA, 2004.

K. E. Lee and B. K. Mallick. Bayesian methods for variable selection in survival models with application to DNA microarray data. The Indian Journal of statistics, 66(4):756–778, 2004.

G. Lenza, G. W. Wrightb, N. T. Emrea, H. Kohlhammera, S. S. Davea, R. E. Davisa, S. Cartya, L. T. Lama, A. L. Shaffera, W. Xiaoc, J. Powellc, A. Rosenwald, G. Ott, H. K. Muller-Hermelink, R. D. Gascoynef, J. M. Connorsf, E. Campog, E. S. Jaffeh, J. Delabiei, E. B. Smelandj, L. M. Rimsza, R. I. Fisherm, D. D. Weisenburgero, W. C. Chano, and L. M. Staudta. Molecular subtypes of diffuse large B-cell lymphoma arise by distinct genetic pathways. Proceedings of the National Academy of Sciences of the United States of America, 105(36):13520– 13525, 2008.

K. M. Leung and R. M. Elashoff. Censoring issues in survival analysis. Annual Review of Public Health, 18:83–104, 1997.

A. Lewin, S. Richardson, C. Marshall, A. Glazier, and T. Aitman. Bayesian

modelling of differential gene expression. Biometrics, 62:10–18, 2005.

L. Lexin. Survival prediction of diffuse large B-cell lymphoma based on both clinical and gene expression information. Bioinformatics., 22(4):466–471, 2006. H. Li and J. Gui. Partial cox regression analysis for high dimensional microarray

gene expression data. Bioinformatics, 20(1):208–215, 2004.

X. Liu. Survival Analysis Models and Applications. Wiley & Sons, New Jersey, 1nd edition, 2012.

I. S. Lossos, C. D. Jones, R. Warnke, Y. Natkunam, H. Kaizer, J. L. Zehnder, R. Tibshirani, and R. Levy. Expression of a single gene, bcl-6, strongly predicts

REFERENCES 153

survival in patients with diffuse large B-cell lymphoma. Blood., 98(4):945–951, 2001.

I. S. Lossos, D. K. Czerwinski, A. A. Alizadeh, M. A. Wechser, R. Tibshirani, D. Botstein, and R. Levy. Prediction of survival in diffuse large B-cell lymphoma based on the expression of six genes. The New England Journal of Medicine., 350(18):1828–1837, 2004.

Y. Lu, W. Lemon, P. Liu, Y. Yi, C. Morrison, P. Yang, Z. Sun, J. Szoke, W. Gerald, M. Watson, and R. G. aand M. You. A gene expression signature predicts survival of patients with stage i non-small cell lung cancer. PLOS Medicine, 3:2229–2243, 2006.

D. J. Lunn, A. Thomas, N. Best, and D. Spiegelhalter. WinBUGS – A Bayesian

modelling framework: Concepts, structure, and extensibility. Statistics and

Computing, 10:325–337, 2000.

D. J. Lunn, N. Best, and J.Whittaker. Generic reversible jump MCMC using

graphical models. Statistics and Computing, 19:395–408, 2009.

D. Madigan and A. E. Raftery. Model selection and accounting for model

uncertainty in graphical models using occam’s window. Journal of American Statistical Association, 89:1535–1546, 1994.

R. A. Maller and X. Zhou. Survival Analysis with Long-term Survivors. Wiley, New York, 1996.

J. M. Marin and C. P. Robert. Bayesian Core. Springer Science: New York, 2007. J. M. Marin, M. R. Bernal, and M. P. Wiper. Using Weibull mixture distributions

to model hererogeneous survival. Communication in Statistics Simulation and Computation, 34(3):673–684, 2005a.

J. M. Marin, K. Mengersen, and C. P. Robert. Bayesian Modelling and Inference on Mixtures of Distributions. Handbook of Statistics 25, D. Dey and C.R. Rao (eds). Elsevier-Sciences, 2005b.

C. A. McGrory and D. M. Titterington. Variational approximations in Bayesian model selection for finite mixture distributions. Computational Statistics and Data Analysis, 51(11):5352–5367, 2007.

K. Mengersen, C. P. Robert, and M. Titterington. Mixture: Estimation and

Applications. Wiley, 2011.

R. G. Miller. Survival Analysis. Wiley & Sons, New Jersey, 2nd edition, 1998. A. Mira and L. Tierney. Efficiency and convergence properties of slice samplers.