2 ESPECIFICACIONES GENERALES PARA REDES ELECTRICAS
2.1. Cámaras y Ducterías
3.1.4. Construcción de las Canalizaciones Telefónicas
Gradient-Based Feature Distributions
Ojala et al. [1996] compared the joint distribution histogram of the gradient magnitudes and directions computed using the Sobel operators [Sobel, 1990] from a texture image, namely, GMAG/GDIR termed as “GMAGGDIRSOBEL” in this thesis , with other feature sets. Local derivatives are computed firstly. Gradient magnitudes and directions are then calculated from these data. Finally, a joint distribution histogram is extracted from the gradient magnitudes and directions.
65
Stage 1 The computation of gradient magnitudes and directions is regarded as the first stage. Both gradient magnitude and direction matrices are higher order statistics. The maximal spatial extents exploited by the Sobel operators are 3×3 pixels.
Stage 2 The histogram accumulation is considered as the second stage. The joint distri- bution histogram extracted in this stage is a 2nd-order statistic.
Local Binary Patterns (LBP)
Wang and He [1990] originally introduced the concept of “texture unit” and used the co-occurrence of the distribution of texture units computed in neighbourhoods as the texture spectrum. Ojala et al. [1996] further proposed its two-stage version (referred to as “LBPBASIC”), i.e. local binary patterns (LBP). In nature, it uses a mask-based filter- ing scheme firstly and then generates a histogram by thresholding response matrices. LBP with a circular neighbourhood is defined as:
∑ ( )
, (3.33)
where corresponds to the grey value of the central pixel in the neighbourhood and stands for the grey values of equally spaced pixels on a circle of the radius R (R > 0) . In addition, { is applied. Furthermore, the idea of “uniform” was suggested [Ojala et al., 2002b] and the grey-scale and rotation invariant description (“LBPRIU2”) was proposed as
{∑ ( ) ( )
, (3.34)
where ( ) | | ∑ | ( ) ( )|.
However, does not exploit other local texture characteristics, e.g. contrast. The
performance of can be further enhanced via combining it with one rotation in-
variant variance measure (“VAR”, see Equation 3.35). Hence, the joint distribu- tion, i.e.
⁄ (“LBPRIU2 & VAR”), was introduced. In addition, multi- resolution ,
and ⁄ were also proposed.
66
Recently, Ahonen et al. [2009] extracted another set of LBP-based features, i.e. LBPHF, using discrete Fourier transform (DFT). The LBPHF features are invariant to the rota- tion of one image. Besides, Ahonen and Pietikäinen [2009] also developed a local de- rivative filters based LBP, i.e. LBPDF.
Stage 1 The computation of various local binary patterns is conducted in the first stage. The response matrices obtained using LBPDF, the LBP maps derived using LBPBASIC, LBPRIU2 and LBPHF, and the local variance matrix calculated using VAR are consid- ered as higher order statistics. The maximal spatial extents employed by LBPBASIC, LBPDF, LBPRIU2, LBPHF and VAR in this stage are 3×3, 3×3, 5×5, 5×5 and 5×5 (cir- cular neighbourhood with a radius of 2) pixels, in sequence.
Stage 2 The accumulation of histograms is regarded as the second stage. The 1D histo- gram and 2D joint distribution histogram are 1st- and 2nd-order statistics respectively.
Varma and Zisserman Textons (VZ-Textons)
Varma and Zisserman [2005] improved the 3D texton-based features proposed by Leung and Malik [2001]. The similar filter bank (see Figure 3.7) was used but only the maximal of the responses obtained using Gaussian derivative filters at each direction and the responses of two isotropic filters, i.e. maximal response sets (MR8), were used. Then K-means was applied on a number of images of each texture. The centroids ob- tained from the images of each texture were concatenated into a global textons diction- ary which is different from the local textons dictionaries constructed by Leung and Ma- lik [2001]. However, histograms are obtained in the similar way for both approaches. The texton-based method proposed by Varma and Zisserman [2005] is well-known as “VZ-MR8”. Furthermore, when image patches were used to extract textons instead of filter responses, three sets of features referred to as “VZ-NEIGHBOURHOOD”, “VZ- JOINT” and “VZ-MRF” were introduced [Varma and Zisserman, 2009] respectively. Stage 1 The extraction of filtering responses or local image exemplars is performed in the first stage. Both the filtering responses and image exemplars can be regarded as higher order statistics. The maximal spatial extents exploited by VZ-MR8, VZ- NEIGHBOURHOOD, VZ-JOINT and VZ-MRF are 49×49, 19×19, 19×19 and 19×19 pixels, respectively.
Stage 2 The accumulation of texton histograms is conducted in the second stage. The 1D histogram used by VZ-MR8, VZ-NEIGHBOURHOOD and VZ-JOINT is a 1st-
67
order statistic. Regarding the 2D joint distribution histogram utilised by VZ-MRF, it is the co-occurrence matrix of the central pixel and texton labels of neighbouring pixels in each local neighbourhood in essence. As a result, it is a 2nd-order statistic.
Local Phase Quantisation
In the original local phase quantisation (LPQ) [Ojansivu and Heikkila, 2008], the phase is considered in neighbourhoods centred at of one image . By using a short-term Fourier transform, local phase spectra are obtained ( denotes the frequency). Furthermore, local Fourier coefficients are calculated at four frequency val- ues: , , , and , and a vector
(3.36) is obtained at each pixel position. The phase information in the Fourier coefficients is derived via { where is the i-th component of { } { } . Finally, eight binary coefficients are converted into inte- ger values in the range of [0, 255] by the quantisation ∑
and a
256-bin histogram is accumulated from these values. In addition, a rotation invariant local phase quantisation (RI-LPQ) method [Ojansivu et al., 2008] is also developed in order to obtain rotation invariance.
Stage 1 The local phase is computed in the first stage. The phase information is a higher order statistic. The maximal spatial extent used in this stage is 9×9 pixels.
Stage 2 The generation of the histogram is regarded as the second stage. The histogram is a 1st-order statistic.
Local Derivative Patterns
The nth-order local derivative pattern (LDP) [Zhang et al., 2010] is used to encode gra- dient changes in local neighbourhoods of and is defined as
{ ( ) ( ) ( )} ,
(3.37) where is the (n-1)th-order derivative images at the direction of (0°, 45°, 90°
and 135°) with . Meantime, ( ) {
68
represents the (n-1)th-order gradient transitions with binary patterns. The nth- order LDP is defined as { | }. At each
direction , is encoded into an 8-bit binary string at each pixel position and then a 256-bin histogram is generated. Finally, four histograms are concatenated into one feature histogram. Furthermore, in order to capture the spatial pattern in larger spa- tial extent, the input image is first divided into a series of sub-regions. An LDP histo- gram is first extracted from each sub-region at each direction independently. Four histo- grams are concatenated into one histogram for each sub-region. All concatenated histo- grams are then combined into a spatially enhanced histogram (referred to as “LDPSE”). Stage 1 The extract of LDP maps is conducted in the first stage. The LDP maps are higher order statistics. The maximal spatial extent exploited in this stage is 3×3 pixels. Stage 2 The second stage accumulates a histogram at each direction and concatenated these into one histogram. The histogram obtained in this stage is a 1st-order statistic. In addition, LDPSE is the concatenation of multiple 1st-order histograms. As a result, it is still a 1st-order statistic.