Capítulo III: Propuesta de metodología para diseño de programa estratégico para enfrentar el delito y conductas desajustadas en el municipio de Viñales y
3.3 Elementos estructurales del Programa Estratégico para la prevención de conductas antisociales.
3.3.4 Acciones Estratégicas Generales.
A final dataset of 1,551 min was compiled, consisting of all the minute data from each of the 47 cases from the 16 min before lightning cessation through the 16 min following lightning cessation. Each minute/observation corresponds to a probability value for lightning cessation. Results test how the method performed by treating each 1-min interval as an independent observation. Similar to JP17, the 95.0%, 97.5%, and 99.0% probability thresholds were tested.
Hits, misses, false alarms, and correct nulls were determined in accordance with Table 4. It is important to note that with respect to lightning cessation these terms should not be interpreted in the same manner as traditional forecast metrics. This
Hits False Alarms Misses Correct Nulls
95.0% 247 6 505 793
97.5% 186 2 566 797
99.0% 126 1 626 798
Table 5. Minute by minute forecast verification metrics for the four possible outcomes using JP17’s lightning cessation GLM in Washington, D.C.
is primarily due to the nature of the forecast; the goal is to correctly predict the end of a weather phenomenon, not its onset. This point is best demonstrated by the definition of a false alarm in terms of lightning cessation. A false alarm represents the prediction of lightning cessation when lightning is still ongoing. Indeed, this is the most dangerous outcome in terms of safety.
The forecast metrics for the Washington, D.C. area are listed in Table 5. By examining the distribution of the metrics, the greatest weight for all three probabil- ity thresholds was in the correct nulls. In Washington, D.C., the GLM performed well correctly delaying the prediction of lightning cessation before the event actu- ally occurred. This also contributed to the low number of false alarms; the false alarms only comprise .40%, .13%, and .06% of the total metrics for the 95.0%, 97.5%, and 99.0% thresholds, respectively. This is encouraging since false alarms pose the greatest threat to public safety and resource protection.
The second most prevalent metric was miss. If the model did not forecast lightning cessation, yet lightning cessation had indeed occurred, it was recorded as a miss. Although this is not a dangerous result, it is not desired since it impedes operations by imposing unnecessary and costly delays. Furthermore, the number of misses is relatively high in relation to the number of hits. There are roughly twice as many misses than hits. Simply looking at the ratio of misses to hits gives the impression that the model forecasted lightning cessation incorrectly more often than correctly. However, the large number of correct nulls, which is a favorable forecast
outcome, should also be accounted for. Additionally, in the context of how this study was executed, the misses to hits ratio can be misleading.
During analysis, a time constraint was applied which limited the observation time to 33 min. This means that observations were not necessarily taken until the storm dissipated and lightning cessation was guaranteed. In consequence, the dis- proportion of misses to hits could be due to the fact that the model was not tested long enough for all the possible number of hits to be identified. This disproportion of metrics will also be self-evident in a few of the performance statistics for Washington, D.C.
Once the forecast metrics for the data were determined, statistical performance measures defined by Equations 2-7 and presented in Table 6 were calculated. These measures were used to determine the skill of the GLM in the Washington, D.C. area. The low FAR is congruent with the low number of false alarms in the dataset. The FARs for each of the three probability thresholds fall below 2.5%. A lower value is ideal since it indicates that a very small number of observations using the GLM forecasted lightning cessation too early. As discussed in section 3.6, the FAR and POD are best interpreted together. The POD values range from ∼ .10 to ∼ .30 (Table 6); these values are rather low since the desired POD score is a value near 1.0. Analyzing the POD and FAR conjointly gives conflicting results in terms of model performance. While the model correctly refrains from prematurely forecasting lightning cessation, it is also slow to identify lightning cessation when it is occurring.
DC Statistics POD FAR HSS TSS CSI
95.0% 0.3285 0.0237 0.6422 0.3209 0.3259
97.5% 0.2473 0.0106 0.6109 0.2448 0.2467
99.0% 0.1676 0.0079 0.5790 0.1663 0.1673
Table 6. Minute by minute performance statistics for the Washington, D.C. area using JP17’s lightning cessation GLM.
Thus, the number of hits in relation to misses is low.
The TSS and CSI reflect values similar to the POD, which has an approximate range from ∼ .10 to ∼ .30 (Table 6). A perfect score for both TSS and CSI would be 1.0. Thus, the calculated TSS and CSI values indicate that the model has relatively low skill. In contrast, the HSS indicates satisfactory performance with values ranging from ∼ 0.55 to ∼ .65. As mentioned in section 3.6, the HSS combines all forecast metrics in Table 5 to measure the fractional improvement of the forecast over the standard forecast. Thus, these HSS values mean that the model is performing much better than the standard forecast, or chance. However, it is important to note that these performance results do not necessarily imply that the model will perform in the same manner in real-time.
Next, the verification measures of the model’s performance in the Washington, D.C. area were compared to the model’s performance in central Florida. Table 7 displays the performance measures calculated in JP17 for central Florida. Figures 18 through 19 illustrate the similarities and differences in the GLM’s performance statistics in the Washington, D.C. area versus central Florida. The error bars for each performance measure were calculated by bootstrapping the data using 10,000 resamples over a 95% confidence interval. These error bars reveal the variability, or range of values, of each performance statistic for the Washington, D.C. dataset. JP17’s results for central Florida are overlaid, showing the relationship of the per- formance statistics for the two locations. If the performance statistics for central
FL Statistics POD FAR HSS TSS CSI
95.0% 0.6989 0.0044 0.6650 0.6949 0.6968
97.5% 0.6318 0.0013 0.5976 0.6307 0.6312
99.0% 0.5451 0.0005 0.5098 0.5448 0.5448
Table 7. Minute by minute performance statistics from JP17 for central Florida using the lightning cessation GLM.
Figure 18. POD, FAR, and HSS performance statistics for JP17’s GLM in both Wash- ington, D.C. and central Florida. Washington, D.C. performance statistics and boot- strapped results utilizing the 95th percentile are depicted by the diamonds and error bars, respectively. Central Florida performance statistics are overlaid in circles.
Florida are contained within the Washington, D.C. error bars for the corresponding performance statistic, then it can be concluded with confidence that the performance statistics are statistically similar. Likewise, if the performance statistics for central Florida are not contained within the Washington, D.C. error bars then it can be concluded that the performance statistics are statistically different.
Figure 18 illustrates the comparison of POD, FAR, and HSS values for the three probability thresholds. The FARs, which are a measure of the reliability of the model, are relatively low for both locations and all probability thresholds. The central Florida FARs are contained within the Washington, D.C. FAR error bar ranges, thus they are statistically similar. In contrast, the POD values for both locations vary significantly. In fact, the Florida POD values double those of Washington, D.C. This contrast can possibly be construed by a key difference in methodologies. Theoretically, there should be a minimum of one hit for each probability threshold for each case (at
Figure 19. (a) TSS (left) and (b) CSI (right) performance statistics for JP17’s GLM in Washington, D.C. and central Florida. Washington, D.C. performance statistics and bootstrapped results using the 95th percentile are depicted by the diamonds and error bars, respectively. Central Florida performance statistics are overlaid in blue circles.
least 47 hits per probability threshold). This is true because, for all lightning events, lightning cessation will eventually take place. However, in this study, time restrictions were set, limiting the analysis observation time to 33 min, 16 min before and after lightning cessation. JP17 did not utilize time restrictions in his study. He observed each thunderstorm case until the storm dissipated (or composite reflectivity dropped below 25 dBZ) and the model forecasted lightning cessation. His higher values for POD (Figure 18), TSS (Figure 19a), and CSI (Figure 19b) reflect this. This difference in methodology will be accounted for in section 4.2 with the alternative performance evaluation approach.
As discussed in section 3.6, the HSS and TSS are both truly equitable mea- sures; therefore, they are more robust indicators of skill than CSI, POD, and FAR used as stand-alone measures. One distinction between the TSS and HSS is that for models with positive skill, TSS has a proclivity to treat overpredicting models more generously than HSS, and underpredicting models more harshly. In this study, this distinction is evidenced by the significant contrast between the TSS and HSS (Tables 6 and 7). Indeed, the model in the Washington, D.C. area tends to underpredict
lightning cessation as indicated by the disproportionate ratio between misses and hits in Table 5. This could explain why the HSS values are higher and more congruent with JP17’s Florida results, while the TSS values are much lower in comparison.