• No se han encontrado resultados

5. ESPECIFICACIONES Y DISEÑO DEL SOFTWARE DEL SISTEMA

5.1 PANTALLA

DNN3describes the neural network that maps a combination of 1,000 mRNA and 1,000

CNA features to PAM subtypes with five possible values: LumA, LumB, HER2-enriched, basal-like, and normal-like. The best average accuracy (75.99%) over five folds was achieved with the network’s structure 2000-256-32-2, 100 epochs, and batch size 64. Due to the poor time performance of the decompositional baseline, DeepRED, on the first modelDNN1, we present the results for modelDNN3only for our algorithm and

the pedagogical C5.0 baseline.

Table 18 shows the accuracy of the neural network, whose activation values on the training set were used for rule extraction with our algorithm, along with the accuracy and fidelity of the rule sets, generated with our algorithm and pedagogical C5.0. Note that due to the high number of rules in fold 5, we were not able to calculate their accu- racy and fidelity in a reasonable time (annotated with *). The neural network’s accuracy is 73.66% as this is a harder task than classifying for the two classes of ER expression. With the acquired results, pedagogical C5.0 generates more accurate rules, but our al- gorithm has slightly higher fidelity. Table 19 shows that the pedagogical C5.0 algorithm extracts very comprehensible rules. In the average rule set, there were twelve rules, each consisting of less than four conditions. Our algorithm extracts more than four million rules per rule set with the average of 16.11 conditions per rule, which does not seem very comprehensible, but it could be improved with further pruning of the rule set. As we reported in Section 4.1, with large rule sets there is a big proportion of rules that are never applicable, so these could be removed from the rule set. Further studies are needed on how to approach further processing of extracted rules to extract compre-

hensible, but still accurate rules with a high level of fidelity. Table 20 gives the required time (in minutes) and used memory (in megabytes). Pedagogical C5.0 does not seem very affected by more input features and outputs. Our rule extraction algorithm uses, on average, almost 55 minutes to extract a rule set. However, there is a high variance in time used across folds. The minimum time needed was about 23 minutes, while the maximum was a bit over 2.5 hours.

Accuracy [%] Fidelity [%]

Fold NN ped C5.0 our alg. ped C5.0 our alg.

1 78.73 65.57 60.76 70.89 65.57 2 73.42 62.28 65.32 64.81 71.65 3 72.66 63.04 60.51 65.82 65.57 4 68.86 61.01 54.94 62.03 61.77 5 74.62 59.64 * 62.94 * Average 73.66 62.31 60.38 65.30 66,14

Table 18: Accuracy and fidelity for modelDNN3.

Rule set size Average rule length

Fold ped C5.0 our alg. ped C5.0 our alg.

1 12 1,134,997 3.33 18.10 2 10 3,472,408 3.30 16.08 3 14 850,138 3.43 12.01 4 16 1,084,973 3.56 16.67 5 10 13,549,053 3.30 17.69 Average 12 4,018,313.8 3.38 16.11

Table 19: Comprehensibility for modelDNN3.

Time [min] Memory [MB]

Fold ped C5.0 our alg. ped C5.0 our alg.

1 0.34 28.78 0.20 22.75 2 0.30 41.95 0.20 21.54 3 0.32 24.90 0.20 24.04 4 0.32 22.77 0.20 18.39 5 0.31 154.82 0.20 24.65 Average 0.32 54.64 0.20 22.27

Table 20: Time and memory consumption for modelDNN3.

Our algorithm extracts rules from deep neural networks, which have conditions on the features in the input layer and map to the output layer. The algorithm takes into ac- count the whole structure of a neural network as it splits the network into neuron levels and aggregates the results obtained from each neuron to represent the neural network

as a whole. With these simple and understandable IF-THEN rules that approximate the behaviour of the neural network, the algorithm makes the network more interpretable and explains the network’s decision, which was the main goal of this thesis.

References

[1] D. Amodei et al., Deep speech 2: End-to-end speech recognition in English and Mandarin, in: Proceedings of the 33rd International Conference on International Conference on Machine Learning - Volume 48, ICML’16, JMLR.org, New York, NY, USA, 2016, p. 173–182.

[2] R. Andrews, J. Diederich and A. Tickle,Survey and critique of techniques for ex- tracting rules from trained artificial neural networks, Knowledge-Based Systems6 (1995) 373–389.

[3] M. Augasta and T. Kathirvalavakumar,Reverse engineering the neural networks for rule extraction in classification problems, Neural Processing Letters35(2011) 131– 150.

[4] M. Baudis,Genomic imbalances in 5918 malignant epithelial tumors: An explo- rative meta-analysis of chromosomal CGH data, BMC cancer7(2007).

[5] E. S. Berner,Clinical Decision Support Systems Theory and Practice, Springer, 3rd ed., 2016.

[6] R documentation for C5.0, [accessed 7. 12. 2019], available at https://www. rdocumentation.org/packages/C50.

[7] A. C. Calderon and S. Thorne,Neural Networks for Medical Applications, ITNOW 60(1) (2018) 46–49.

[8] C. Chi, L. Murphy and P. Hu,Recurrent copy number alterations in young women with breast cancer, Oncotarget9(2018) 11541–11558.

[9] K. Cho et al.,On the properties of neural machine translation: Encoder-decoder approaches, in: Proceedings of SSST-8, Eighth Workshop on Syntax, Semantics and Structure in Statistical Translation, Association for Computational Linguis- tics, Doha, Qatar, 2014, pp. 103–111.

[10] P. Clark and T. Niblett, The CN2 induction algorithm, Machine Learning 3(4) (1989) 261–283.

[11] M. W. Craven and J. W. Shavlik, Extracting tree-structured representations of trained networks, in: Proceedings of the 8th International Conference on Neural Information Processing Systems, MIT Press, Cambridge, MA, USA, 1995, p. 24–30.

[12] C. Curtis et al.,The genomic and transcriptomic architecture of 2,000 breast tumors reveals novel subgroups, Nature486(7403) (2012) 346–352.

[13] DeepRED’s code, [accessed 9. 4. 2020], available at https://github.com/ camgbus/DeepRED.

[14] J. Fürnkranz, D. Gamberger and N. Lavraˇc, Foundations of Rule Learning, Springer, Berlin, 2012.

[15] A. Géron, Hands-On Machine Learning with Scikit-Learn and TensorFlow: Con- cepts, Tools, and Techniques to Build Intelligent Systems, O’Reilly Media, Se- bastopol, CA, 1st ed., 2017.

[16] L. H. Gilpin et al.,Explaining explanations: An overview of interpretability of ma- chine learning, in: 5th IEEE International Conference on Data Science and Ad- vanced Analytics, DSAA, IEEE, 2018, pp. 80–89.

[17] C. González, E. Loza Mencía and J. Fürnkranz,Re-training deep neural networks to facilitate boolean concept extraction, in: Discovery Science (eds. A. Yamamoto et al.), Springer International Publishing, Cham, 2017, pp. 127–143.

[18] I. Goodfellow, Y. Bengio and A. Courville,Deep Learning, MIT Press, 2016,http: //www.deeplearningbook.org.

[19] Python library Guppy 3, [accessed 9. 7. 2020], available athttps://pypi.org/ project/guppy3.

[20] T. Hailesilassie,Rule extraction algorithm for deep neural networks: A review, IJC- SIS14(7) (2016) 376–381.

[21] J. Han, M. Kamber and J. Pei,Data mining concepts and techniques, third edition, Morgan Kaufmann Publishers, Waltham, USA, 2012.

[22] G. Hinton et al., Deep neural networks for acoustic modeling in speech recogni- tion: The shared views of four research groups, Signal Processing Magazine, IEEE 29(2012) 82–97.

[23] Keras, [accessed 2. 6. 2020], available athttps://keras.io.

[24] M. Kuhn and K. Johnson, Classification Trees and Rule-Based Models, Springer, New York, NY, chap. 14, 2013, pp. 369–413.

[25] G. Litjens et al.,A survey on deep learning in medical image analysis, Medical Im- age Analysis42(2017) 60–88.

[26] S. V. Liu et al.,Neoadjuvant therapy for breast cancer, Journal of Surgical Oncology 101(4) (2010) 283–291.

[27] F. Lumachi et al., Treatment of estrogen receptor-positive breast cancer, Current medicinal chemistry20(5) (2013) 596–604.

[28] I. Lungu and A. Pîrjan,Research issues concerning algorithms used for optimizing the data mining process, Romanian Economic Business Review4(2010) 108–125. [29] Metabric, [accessed 17. 2. 2020], available at https://molonc.bccrc.ca/

aparicio-lab/research/metabric.

[30] S.-E. Olsson et al., Neural networks - a diagnostic tool in acute myocardial in- farction with concomitant left bundle branch block, Clinical physiology and func- tional imaging22(2002) 295–299.

[31] Adam optimizer, [accessed 2. 6. 2020], available at https://keras.io/api/ optimizers.

[32] J. Parker et al., Supervised risk predictor of breast cancer based on intrinsic sub- types, Journal of Clinical Oncology27(8) (2009) 1160–1167.

[33] J. R. Quinlan,Induction of decision trees, Machine Learning1(1) (1986) 81–106. [34] J. R. Quinlan,C4.5: Programs for machine learning, Morgan Kaufmann Publishers

Inc., San Francisco, CA, USA, 1993.

[35] R. Rathinasamy and L. Raj,Comparative analysis of C4.5 and C5.0 algorithms on crop pest data, International Journal of Innovative Research in Computer and Communication Engineering5(1) (2017) 50–58.

[36] M. Ruano and A. Ruano, On the use of artificial neural networks for biomedical applications, Advances in Intelligent Systems and Computing195(2012) 433–451. [37] S. Russell and P. Norvig,Artificial Intelligence: A Modern Approach, Prentice Hall

Press, USA, 3rd ed., 2009.

[38] E. W. Saad and D. C. Wunsch,Neural network explanation using inversion, Neural Netw.20(1) (2007) 78–93.

[39] T. N. Sainath et al.,Deep convolutional neural networks for LVCSR, in: 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, 2013, pp. 8614–8618.

[40] M. Sato and H. Tsukimoto,Rule extraction from neural networks via decision tree induction, in: IJCNN’01. International Joint Conference on Neural Networks. Pro- ceedings (Cat. No.01CH37222),3, 2001, pp. 1870–1875.

[41] R. Setiono and W. K. Leow,Fernn: An algorithm for fast extraction of rules from neural networks, Applied Intelligence12(1–2) (2000) 15–25.

[42] D. Silver et al.,Mastering the game of Go with deep neural networks and tree search, Nature529(2016) 484–489.

[43] N. Spielberg et al., Neural network vehicle models for high-performance auto- mated driving, Science Robotics4(28) (2019).

[44] D. Stuart and W. R. Sellers,Linking somatic genetic alterations in cancer to thera- peutics, Current opinion in cell biology21(2) (2009) 304—310.

[45] D. Tang, B. Qin and T. Liu,Document modeling with gated recurrent neural net- work for sentiment classification, in: EMNLP, The Association for Computational Linguistics, 2015, pp. 1422–1432.

[46] B. Taylor and M. Darrah,Rule extraction as a formal method for the verification and validation of neural networks,5, 2005, pp. 2915 – 2920 vol. 5.

[47] Y. Tian et al., Deeptest: Automated testing of deep-neural-network-driven au- tonomous cars, in: 2018 IEEE/ACM 40th International Conference on Software Engineering (ICSE), 2018, pp. 303–314.

[48] H. Tsukimoto,Extracting rules from trained neural networks, IEEE Transactions on Neural Networks11(2) (2000) 377–389.

[49] R. Verhaak and G. Mills,Regulation of mRNA expression in breast cancer - a cis- tematic trans-action, Breast cancer research : BCR14(5) (2012) 322.

[50] B. Weir et al.,Characterizing the cancer genome in lung adenocarcinoma, Nature 450(7171) (2007) 893–898.

[51] C. Xiao, E. Choi and J. Sun,Opportunities and challenges in developing deep learn- ing models using electronic health records data: A systematic review, Journal of the American Medical Informatics Association25(10) (2018) 1419–1428.

[52] J. Zilke, E. Mencía and F. Janssen,DeepRED – rule extraction from deep neural net- works, in: International Conference on Discovery Science, Springer, Berlin, 2016, pp. 457–473.

[53] J. R. Zilke,Extracting rules from deep neural networks, Master’s thesis, TU Darm- stadt, Knowledge Engineering Group, 2015.

Documento similar