IV. RESULTADOS Y DISCUSIÓN
4.1. RESULTADOS
4.1.1. ANÁLISIS DESCRIPTIVO DE LAS VARIABLES
A probabilistic segment model is defined using the concept of a trajectory, combined with the idea of modelling variability in the trajectory separately from variability in the realisation of any one trajectory. For notational simplicity, in the following description it will be assumed that the observation sequence y = yo,...,yT is one-dimensional and corresponds to a single segment!** Suppose that P^ = {fa)a&A is & parameterised family of functions /^ :{ 0 ,l,...,r} ^ 7 . In principle F can be used as a set of trajectories to define segment probability distributions 6, over {0,...,T}. Let the intra-segment variance around the trajectory, x>0 be fixed. Given that f a , the conditional probability of y given is defined to be the product over t of the probabilities of the individual elements yt given a normal distribution
Ff^(^t),x with mean /^ ( O and variance i:
P(y\fa)
=\[ P(yt\fa (0) = n
^/„(r),T(yi )
■t=0 t=0
' As continuous distributions are used, all likelihoods computed with segmental HMMs are in fact probability densities, but they are loosely referred to throughout this thesis as “probabilities”.
^ In the segmental-HMM description given in this chapter, the observations have been numbered from 0 to T. In consequence, the segment duration is equal to T+1 and the mid-point of the segment occurs at time 772.
^fa(t),x is the intra-segment distribution. Hence, the alternative to the HMM independence assumption is that, although individual observations yt are mutually independent, they depend on the trajectory . Assuming that this trajectory is a good general representation of the observations it is intended to describe, the intra-segment variance will be much smaller than the total variance. Thus P ( y \ f a ) will be small unless the sequence y is well-represented by the trajectory , and so there is a fairly tight continuity constraint This is a major advantage over conventional HMMs, which treat all observations for all examples represented by any one state in the same way. Furthermore, the impact of the independence assumption is reduced to a much greater extent than is possible if all the variance is represented (time-independently) around a single trajectory (as in the model described by Deng et al., 1994), while avoiding the need for modelling error-correlations between successive observations (as in the approach of Goldenthal and Glass, 1994).
To obtain the probability of y and the trajectory given a model state, it is necessary to include the extra-segment probability P(a), which is the probability that a defines a plausible trajectory for that state. This leads to the joint probability P ( y , f a ) = P l y \ f a ) P(^) ■ The probability P(a) may include a segment duration probability component, or this may appear separately as in the general specification of a segment model given in Chapter 3.
A relationship with both segmental-feature models and constrained mean trajectory models can be identified by treating these alternative models as different special cases of the segmental HMM. Firstly, it could be assumed that the trajectory is an exact description of the observations, such that the trajectory is defined by the observation sequence with
P { y \ f a ) = l, and hence P { y , f a ) = P{<^) ■ This model uses a segmental feature which is a
parametric description of a segment of feature-vectors, and has similarities with the model suggested by Krishnan and Rao (1994). An alternative approach would be to assume that there is a single trajectory fa which is specified by the model, so that P{a) =1 and P i y ^ f a ) - P {y \ f a ) • This model represents a constrained mean trajectory with no sampling of the trajectory, which was the approach adopted by Deng et al. (1994).
In a segmental HMM, the trajectory parameters can vary and are “hidden” from the observer, and so the uncertainty in the trajectory realisation needs to be accommodated in the probability calculation. The trajectory parameters A are thus unknown, and so from a formal mathematical
Theory o f segmental HMMs 91
viewpoint the correct solution is to obtain the probability 6 , (y) of the sequence of observations
in the segment y given the z* model state by integrating out over all possible values o f ^ , thus:
bi(y) = P ( y ) = j P ( y , f J . a e A
This form of segmental HMM has been studied by Gales and Young (1993a, 1993b), for the simplest case of a “static” model in which the underlying trajectory is assumed to be constant over time. A conceptually-simpler alternative, which was proposed by Russell (1992, 1993), is only to consider P ( y ,/ ^ ) for one specified trajectory . Considering only one trajectory is advantageous for studying and evaluating the model representation of particular speech segments. When using a model based on a single trajectory, a useful definition is to specify:
bi (y) = P{y) = max P { y , f a ) and â{y) = argmax P(y, ).
a ç A aeA
a(y) is the optimal trajectory, whose parameters are those that maximise the joint probability of the observations and trajectory, given the model parameters. The optimal trajectory a{y) is not the same as a '(y), the trajectory which is the best fit to the data. The optimal trajectory is a maximum a posteriori estimate which takes into account prior information that y represents the particular model state, and is thus model-dependent. The studies which are described in this thesis have focused on investigating modelling of extra- and intra-segmental variability within an optimal trajectory approach. A discussion and evaluation of the theoretical and practical implications of using the optimal trajectory is included in subsequent chapters.
The exact form of the optimal trajectory depends on the trajectory parameterisation which is adopted. Two particular parameterisations have been studied for Gaussian segmental HMMs (GSHMMs). The simplest case is a static GSHMM (Russell, 1993), where the underlying trajectory is assumed to be constant over time and is thus represented by a single “target” vector. By assuming that the underlying trajectory changes linearly, a linear dynamic GSHMM can be formulated (Holmes and Russell, 1995b; Russell and Holmes, 1997). These two types of segmental HMM are described in more detail in the following two sections, including specifications for the optimal trajectories and Baum-Welch-type parameter re estimation formulae. These re-estimation formulae have been derived by Russell (1992; 1996a), for static and linear GSHMMs respectively, using the same principles as had been applied for conventional HMMs (Liporace, 1982).