Subcategoría – Interacciones en el aula y participación de los estudiantes

5. Análisis de resultados

5.2. Categoría Competencia argumentativa

5.2.2 Subcategoría – Interacciones en el aula y participación de los estudiantes

1: Initialize M=0d×d and H(i)=0n1×mfor i=1, . . . ,d.

2: %calculate and pre-store the univariate KDE 3: for k=1, . . . ,d do 4: for k′=1, . . . ,m do 5: bp(x(_kk′))← 1 n1_s_∈D1

∑

1 h1 K X (s) k −x (k′) k h1 ! 6: for k′=1, . . . ,m do

7: %calculate the components used for the bivariate KDE 8: for i′=1, . . . ,n1do 9: for i=1, . . . ,d do 10: H(i)(i′,k′)← 1 h2 K X i′ i −x (k′) i h2 !

11: %calculate the mutual information matrix 12: forℓ′=1, . . . ,m do 13: for i=1, . . . ,d−1 do 14: for j=i+1, . . . ,d do 15: bp(x_i(k′),x(_jℓ′))←0 16: for i′=1, . . . ,n1do 17: pb(x(_ik′),x(_jℓ′))←p_b(x(_ik′),x(_jℓ′)) +H(i)(i′,k′₎_·_H(j)₍_i′_{, ℓ}′₎ 18: pb(x_i(k′),x(_jℓ′))←_bp(x(_ik′),x(_jℓ′))/n1 19: M(i,j)←M(i,j) + 1 m2pb(x (k′) i ,x (ℓ′₎ j )·log b p(x_i(k′),x(_jℓ′))/(p_b(x(_ik′))·p_b(x(_jℓ′)))

evaluate the kernel density estimates on

D

₁. The space complexity for the Chow-Liu algorithm is

O(d2), and thus the total space complexity is O(d2+m2n1).

The quadratic time and space complexity in the number of variables d is acceptable for many practical applications but can be prohibitive when the dimension d is large. The main bottleneck is to calculate the empirical mutual information matrix M. Due to the use of the kernel density estimate, the time complexity is O(d2m2n1). The straightforward implementation in Algorithm 1 is conceptually easy but computationally inefficient, due to many redundant operations. For example, in the nested for loop, many components of the bivariate and univariate kernel density estimates are repeatedly evaluated. In Algorithm 6, we suggest an alternative method which can significantly reduce such redundancy at the price of increased but still affordable space complexity.

The main technique used in Algorithm 6 is to change the order of the multiple nested for loops, combined with some pre-calculation. This algorithm can significantly boost the empirical perfor- mance, although the worst case time complexity remains the same. An alternative suggested by Bach and Jordan (2003) is to approximate the mutual information, although this would require fur- ther analysis and justification.

References

Martin Aigner and G¨nter Ziegler. Proofs from THE BOOK. Springer-Verlag, 1998.

Francis R. Bach and Michael I. Jordan. Beyond independent components: Trees and clusters.

Journal of Machine Learning Research, 4:1205–1233, 2003.

Onureena Banerjee, Laurent El Ghaoui, and Alexandre d’Aspremont. Model selection through sparse maximum likelihood estimation. Journal of Machine Learning Research, 9:485–516, March 2008.

Arthur Cayley. A theorem on trees. Quart. J. Math., 23:376–378, 1889.

Anton Chechetka and Carlos Guestrin. Efficient principled learning of thin junction trees. In In Ad-

vances in Neural Information Processing Systems (NIPS), Vancouver, Canada, December 2007.

C. K. Chow. and C. N. Liu. Approximating discrete probability distributions with dependence trees.

IEEE Transactions on Information Theory, 14(3):462–467, 1968.

Jianqing Fan and Ir`ene Gijbels. Local Polynomial Modelling and Its Applications. Chapman and Hall, 1996.

Jerome H. Friedman, Trevor Hastie, and Robert Tibshirani. Sparse inverse covariance estimation with the graphical lasso. Biostatistics, 9(3):432–441, 2007.

Michael Garey and David Johnson. Computers and Intractability: A Guide to the Theory of NP-

Completeness. W. H. Freeman, 1979. ISBN 0-7167-1044-7.

Evarist Giné and Armell Guillou. Rates of strong uniform consistency for multivariate kernel density estimators. Annales de l’institut Henri Poincaré (B), Probabilités et Statistiques, 38:907–921, 2002.

Dirk Hausmann, Bernhard Korte, and Tom Jenkyns. Worst case analysis of greedy type algorithms for independence systems. Math. Programming Studies, 12:120–131, 1980.

Joseph B. Kruskal. On the shortest spanning subtree of a graph and the traveling salesman problem.

Proceedings of the American Mathematical Society, 7(1):48–50, 1956.

Steffen L. Lauritzen. Graphical Models. Clarendon Press, 1996.

Han Liu, John Lafferty, and Larry Wasserman. The nonparanormal: Semiparametric estimation of high dimensional undirected graphs. Journal of Machine Learning Research, 10:2295–2328, October 2009.

Joseph A. Lukes. Efficient algorithm for the partitioning of trees. IBM Jour. of Res. and Dev., 18 (3):274, 1974.

Nicolai Meinshausen and Peter B¨uhlmann. High dimensional graphs and variable selection with the Lasso. The Annals of Statistics, 34:1436–1462, 2006.

Renuka Nayak, Michael Kearns, Richard Spielman, and Vivian Cheung. Coexpression network based on natural variation in human gene expression reveals gene interactions and functions.

Genome Research, 19:1953–1962, 2009.

Deborah Nolan and David Pollard. U-processes: Rates of convergence. The Annals of Statistics, 15 (2):780 – 799, 1987.

Philippe Rigollet and R´egis Vert. Optimal rates for plug-in estimators of density level sets.

Bernoulli, 15(4):1154–1178, 2009.

Alessandro Rinaldo and Larry Wasserman. Generalized density clustering. Ann. Statist., 38(5): 2678–2722, 2010.

Mohit Singh and Lap Chi Lau. Approximating minimum bounded degree spanning trees to within one of optimal. In STOC’07—Proceedings of the 39th Annual ACM Symposium on Theory of

Computing, pages 661–670. ACM, New York, 2007.

Vincent Tan, Animashree Anandkumar, and A. Willsky. Learning Gaussian tree models: Analysis of error exponents and extremal structures. IEEE Trans. on Signal Processing, 58(5):2701–2714, 2010.

Vincent Tan, Animashree Anandkumar, Lang Tong, and Alan Willsky. A large-deviation analysis for the maximum likelihood learning of tree structures. IEEE Trans. on Info. Theory, 57(3): 1714–1735, 2011.

Alexandre B. Tsybakov. Introduction to Nonparametric Estimation. Springer Publishing Company, Incorporated, 2008.

Anja Wille, Philip Zimmermann, Eva Vranová, Andreas Fürholz, Oliver Laule, Stefan Bleuler, Lars Hennig, Amela Prelić, Peter von Rohr, Lothar Thiele, Eckart Zitzler, Wilhelm Gruissem, and Peter Bühlmann. Sparse Gaussian graphical modelling of the isoprenoid gene network in

In document Secuencias didácticas como elemento de transformación de las prácticas pedagógicas para fortalecer la competencia argumentativa (página 129-137)