de comunicación apropiados (4.1.6.)
3. Nombre Sistema de Calidad
This is where the output variables of the wind power model, which include prediction of 6- hour, three-hour or 72-hour ahead as the case may be, are presented for performance measure comparison especially with other algorithms.
Pre-Processing Neural Network Post-Processing
Sets of inputs
77 | P a g e
4.6.1 Descriptive Statistics
To identify or eliminate a candidate distribution, descriptive statistics are applied. For instance, the sample mean and median times will be close for a symmetrical or nearly symmetrical distribution, such as the normal or Weibull with a shape parameter between 3 and 4. If the mean is considerably larger than the median, then the exponential or lognormal distribution will provide a better fit.
The mean, 𝜇 or wind variability, Median, 𝑡𝑚𝑒𝑑, Mode,𝑡𝑚𝑜𝑑𝑒, and variance, 𝜎2,of the
lognormal distribution are discussed in [62], hence, realisation of Eq. (4.23) μ = tmedexp1
2𝜎
2 (4.23)
The mean of the wind speed data over the wind farm 𝜇′ is described in terms of 𝑡𝑚𝑒𝑑, and 𝜎
is given by Eq. (4.24) μ′= ln(tmed) − 1 2ln ( σ2 tmed+ 1) (4.24)
Furthermore, the median of this distribution is given by Eq. (4.25):
t
med= e
μ′ (4.25)The mode is estimated using Eq. (4.26) while the standard deviation is obtained using Eq. (4.27):
tmode= tmed
exp(S2) (4.26)
σ2 = tmed2 exp(S2)[exp(S2) − 1] (4.27)
4.6.2 Probability Plots
A probability plot is utilized to provide a better visual test of a distribution than comparison of a histogram with a probability density function. Initial estimates of the parameters of the
78 | P a g e
distribution fitted are made possible with probability plots, such as exponential, normal distributions, and lognormal plots.
4.6.2.1 Exponential Plots
The cumulative distribution function plot for the exponential distribution is expressed by the computation of Eq. (4.28)
F(t) = 1 − e−λt (4.28)
Taking the natural logarithm of both sides ln[1 − F(t)] = −λt
− ln[1 − F(t)] = ln [ 1
1−F(t)] = λt
An accurate fit to the observed times data may be obtained by performing a least-squares fit of Eq. (4.29) λ̂ = b = ∑ni=1xiyi ∑ni=1xi2 (4.29) where yi = ln 1 [1−F(t)] and xi = ti
The estimate for wind power from adjacent turbines is expressed by =1
𝑏
4.6.2.2 Normal Distribution Plots
For the normal distribution, which represents random wind speed variable selection from different turbines is estimated with Eq. (4.30) to form the tradition bell-shaped graph shown in Figure 5.7.
F(t) = ϕ (t−μ
σ ) = ϕ(z) (4.30)
The inverse function is rewritten as Eq. (4.31), which is linear in time 𝑡.
zi = ϕ−1[F(t)] =ti−μ
79 | P a g e
The points (ti, ℱ̂(ti)) are plotted with appropriate transformation of the vertical scale. A least
squares fit is obtained by setting
𝑥𝑖 = 𝑡𝑖 and 𝑦𝑖 = 𝑧𝑖 (4.32)
The values of 𝑧𝑖 are obtained from a Table of standardized normal probabilities, based on the
corresponding values of ℱ̂ (𝑡𝑖). From the least-squares fit and Eq. (4.33),
𝜎̂ = 1
𝑏 and 𝜇̂ = −𝑎𝜎̂ = −
𝑎
𝑏 (4.33)
4.6.2.3 Lognormal Plots.
The lognormal plot is made with the relationship of the distribution with the normal distribution. Since F(t) = ϕ (1 Sln t tmed) = ϕ(z) and Z = ϕ −1[F(t)] =1 Slnt − 1
Slntmed, The points (𝑙𝑛𝑡𝑖, 𝑍𝑖)
are plotted. For the least squares fit, Eq. (4.35) is presented, where;
xi = lnti and yi = Zi (4.35)
The shape parameter 𝑆 is the reciprocal of the slope of the plotted line and 𝑡𝑚𝑒𝑑, the median,is
obtained from the intercept of the fitted line. That is
Ŝ = 1
b and tmed = e
−Ŝa (4.36)
4.6.2.4 Parameter Estimation
Probability plots and least squares data fitting only provides estimates of the distribution parameters and does not provide best results. For this reason, a maximum likelihood
estimator (MLE) 𝑃 is defined in Eq. (4.37).
P ̂ = n n+∑ni=1(xi−1) =∑ n xi n i=1 (4.37)
80 | P a g e
If the probability of a failure remains a constant 𝑃 and each trial is independent, then
Pr{X = x} = f(x) = (1 − P)x−1P x = 1,2, … (4.38)
Where 𝑋 is the variable representing the number of trials necessary to obtain the first failure
and 𝑥 represents the sample size, f(x) is the likelihood function and represents the probability
of obtaining the observed sample.
4.6.3 Stationarity Test.
Prior to building predictive models for training time series algorithms, a stationarity test is one of the required steps. The research employed the Dickey-Fuller Test (DFT) to study stationarity considering daily wind variations within five days of historical data. From the test, the series is not stationary as in Figure3. The non-stationarity is attributed to trend and issues of normal distribution.
To obtain stationarity, the first level DFT is computed (d = 1) where d = differencing which is
the difference between current series (𝛾𝑡) and previous series (γt−1) as in Δγt = γt− γt−1.
From our differenced data, Figure 3.6 is generated. This figure implies that maximum wind speed is experienced from 4AM to noontime leaving the afternoon time with low wind speeds.
The insight gained in the figure led to data split; 06-14-2003 to 06-17-2003, equivalent to 80% of the data for training while the rest in 6-17-2003 to 06-18-2003 equivalent to 20%, within the high wind time is set for testing the regularised models.
The non-linear nature of RNN and improvements in LSTM as demonstrated by [70, 91] shows the dynamic nature of wind in relation to geography, climate, landforms, seasonality can be addressed although in [34], statistical wind power transform appears strenuous.
81 | P a g e