B) OBLIGACIÓN TRIBUTARIA
B.1 EL HECHO GENERADOR.
Since there have been research showing that the face images taken under different lighting conditions and postures may reside on a nonlinear submanifold Chang et al. (2003b) Lee et al. (2003b) Roweis and Saul (2000b) Roweis et al. (2002b), a model that can capture such char- acteristic is necessary for more accurate face recognition. Although recognitions by Eigenfaces or Fisherfaces provide sufficient results on different databases, these two methods focus on the global structures of training sets by exploring the data structures in terms of Euclidean dis- tances between images, which could be misleading if a more accurate distance defined on the nonlinear manifold is available. A model that attempts to preserve the local information and incorporates it into the global view may provide us a better face recognition algorithm without knowing the exact nonlinear submanifold. The face recognition algorithm by Laplacianfaces proposed in He et al. (2005) utilizes the local data structures of a given training set, extracts features of the latter by an optimization that preserves local similarities, then projects the testing images onto a lower dimensional subspace. This process is then followed by clustering or classification methods depending on the users’ preferences or goals. The detailed procedures are described as follows.
Recognition by Laplacianfaces is based on the Locality Preserving Projection (LPP) He and Niyogi (2002) which learns a lower dimensional subspace while preserving the intrinsic geometry of the given dataset. Let the given dataset be X = {xi}ni=1, then the LPP procedure
is realized through the optimization of the following: minX
ij
(yi− yj)2Sij (5.39)
a high dimensional space. Sij is the similarity measure between xi and xj, and should be one
that preserves local data structure. One possibility is to define Sij as follows:
Sij = exp (−||xi− xj||2/t), if ||xi− xj|| < 0 else (5.40)
where is chosen by the user and should sufficiently small to capture the local properties.
Or Sij can be defined using k nearest neighborhood:
Sij = exp (−||xi− xj||2/t), if xi ∈ Nk(xj) or xj ∈ Nk(xi) 0 else (5.41)
where Nk(xi) = {x ∈ X | x is among the k nearest neighbors of xi}. k is decided by the user
and usually the nearest neighbors are found by using Euclidean distances.
Since the minimization given in (5.39) searches for new representation yi for xi (i =
1, 2, · · · , n), it penalizes yi and yj for being far apart if the similarity Sij is large. That is,
the similarity between xi and xj leads to the similarity between yi and yj. Thus, the local
structure is preserved through an appropriately defined similarity matrix S.
Assume that yi is obtained by performing a linear operation on xi, that is , yi = wTxi.
The following computation shows the relationship between (5.39) and the Graph Laplacian: 1 2 X ij (yi− yj)Sij = 1 2 X ij (wTxi− wTxj)2Sij =X ij wTxiSijxTi w − wTxiSijxTjw =X i wTxiDiixTiw − wTXSXTw = wTX(D − S)XTw = wTXLXTw
where the degree matrix D is a diagonal matrix with Dii=PjSij, i = 1, 2, · · · , n. L = D − S
is the Graph Laplacian matrix. Since Dii can be considered as a measure of “importance” of
the data point xi, a constraint can be posed to the minimization as follows:
yTDy = 1 ⇒ wTXDXTw = 1 (5.42) Thus the minimization can be rewritten as
arg min
wTXDXTww
TXLXTw (5.43)
And the minimizer w can be found as the eigenvector that corresponds to the smallest eigen- value of the following generalized eigenvalue problem
XLXTw = λXDXTw (5.44) The Graph Laplacian can be considered as the discrete approximation of the Laplace- Beltrami operator that is defined on the nonlinear submanifold, and the eigenvectors of the former are approximations of the eigenfunctions of the latter under some conditions as dis- cussed in the previous chapters. Since the eigenfunctions of the Laplace Beltrami operator can capture the structure of the manifold that the dataset resides on, so can the eigenvectors of the Graph Laplacian in the approximation sense.
The above described LPP is a general method for manifold learning by approximating the eigenfunctions of the Laplace Beltrami operator. To derive a lower dimensional representation (of dimensionality k) of the given dataset, we can use the first k eigenvectors of (5.44) that correspond to the smallest k eigenvalues. Then these eigenvectors form the basis for the desired lower dimensional subspace. Although LPP gives a linear subspace as the result of dimension- ality reduction, it is a locally topology-preserving mapping and is able to encode the local data structure on the nonlinear manifold in the linear subspace.
LPP can be used in the dimensionality reduction process of the face recognition algorithm. But one problem of solving (5.44) directly is that the matrix XDXT is usually singular He et al.
(2005). Thus, a PCA process is applied first to the training set in order to solve this problem, and at the same time, reduce the noise. We can denote such a procedure using the mapping x → WP CAT x, where WP CA is the projection matrix with the first several significant compo-
nents of XTX as its columns. Then, both XTLX and XDTX are symmetric and positive semi-definite, and the solutions of (5.44) can be denoted as w0, w1, · · · , wk−1 corresponding to
eigenvalues 0 ≤ λ0 ≤ λ1≤ · · · ≤ λk−1.
In all, the mapping between the original training set X to the lower dimensional subspace that is derived from PCA and LPP is:
x → y = WTx (5.45)
W = WP CAWLP P (5.46)
where WLP P = [w0, w1, · · · , wk−1]. The columns of W are called Laplacianfaces.
The recognition algorithm by Laplacianfaces are applied on the Yale dataset and the CMU PIE face database respectively, compared with the algorithms associated with Eigenfaces and Fisherfaces He et al. (2005). Both of the datasets consist of images taken for different individual under varying pose, illumination conditions, and facial expressions. Part of the dataset is used as the training set for both datasets, and Laplacianfaces, Eigenfaces, Fisherfaces are used in the dimensionality reduction procedures, respectively. For each dataset, the best recognition results obtained from these three procedures are compared. It is shown in He et al. (2005) that the Laplacianfaces achieves better recognition result compared to the other two methods when they all reach their best performances on a range of values of the dimension k.