Dimensiones de sostenibilidad a) Dimensión ambiental

The corpora collecting procedure of section 5.4 resulted in a corpus of 376, 007 sentences, each one containing one or more multiword expressions. In 85, 073 sentences (22.75%), the shallow parsing output before the replacement is different to the shallow parsing output after the replacement. The corresponding percentage is larger for non- compositional than compositional multiword expressions; 25.33% and 18.17%, respectively. Likewise, more changes happen for adjective-noun than for noun-noun multiword expressions; 28.76% and 16.06%, respectively. All change percentages are presented in the third and fourth column of table 5.5.

7.20% of all 85, 073 change instances are due to one or more parts of speech changes, and are classified to change class PoS. In other words, in 7.20% of cases where there is a difference between the shallow parses before and after replacing the multiword expression tokens there is one or more tokens that were assigned a different part of speech. However, excluding parts of speech from the comparison, there is no other difference between the two parses. The focus of this study is to quantify the effect of unifying multiword expression tokens in shallow parsing. Part of speech tagging is a component

included them in the examples for presentational reasons, mostly. For this reason, we chose to ignore part of speech changes, the changes of change class PoS. Below, we present our results for all other change classes.

Let kXk be the function that returns the number of instances assigned to change class X. With respect to the discussion of subsection 5.3.1 about how the instances of each class should be counted towards the final results, the number of sentences whose parsing was corrected after the replacement is the sum of sentences of the change classes that contribute positively, according to the evaluation hypotheses:

positive = kL2PMwk+kL2Pk+kPL2Pk+kP2Pk+

kPL2P+MwAk+kP2P+MwAk (5.1)

In contrast, the sum of sentences of change classes that contribute negatively towards to result according to the evaluation hypotheses is:

negative =kP2LMwk+kP2Lk+kP2PLk

To compute the minimum improvement in shallow parsing, positively contributions classes are computed positively, negatively contributing classes are computed negatively and the undecidable class PM negatively as well:

min = positive − negative−kPNk (5.2) For the maximum improvement, the undecidable class PM is computed as contributing positively:

max = positive − negative+kPNk (5.3)

Table 5.5 summarises experimental results for adjective-noun and noun-noun sequences, either compositional or non-compositional. The table also presents average statistics for multiword expressions of the same parts of speech independently of their compositionality and of the same compositionality independently of parts of speech. The third column shows the number of sentences of each class in the corpus. For each one

Multiword Shallow Parsing

expressions improvement

class PoS sentences changed minimum maximum

Compositional N N 143,229 15.74% 4.48% 5.47% Non-Compositional N N 77,812 16.67% 4.39% 5.84% Compositional J N 68,707 23.24% 7.34% 9.21% Non-Compositional J N 86,259 33.15% 15.32% 19.67% Any N N 221,041 16.06% 4.45% 5.60% Any J N 154,966 28.76% 11.78% 15.03% Compositional Any 211,936 18.17% 5.41% 6.68% Non-Compositional Any 164,071 25.33% 10.14% 13.11% Any Any 376,007 21.30% 7.47% 9.49%

Table 5.5: Summary of experimental results. “PoS” stands for parts of speech, “N N” for noun noun sequences and “J N” for adjective noun sequences.

of the classes of table 5.5, the fifth and sixth columns show the minimum and maximum improvement in shallow parsing, respectively, caused by unifying multiword expression tokens. It should be noted that this improvement are computed on corpora whose all sentences contain at least one known multiword expression. To project this improvement on any general text, one needs to know the percentage of sentences that contain known multiword expressions. Then the projected improvement can be computed by multiplying these two percentages.

On average of all multiword expressions, unifying multiword expression tokens contributes from 7.47% to 9.49% in shallow parsing accuracy. Both for noun-noun and adjective-noun multiword expressions, non-compositional ones improve accuracy more than compositional ones do, due to the idiosyncratic nature of non-compositional multiword expressions. However, the improvement is much larger for non-compositional adjective-noun multiword expressions.

Recall that the shallow parser accuracy is reported to decrease as the length of phrases increases. For this reason, we expect some increase in parsing accuracy due to the fact that we reduce by one the length of the phrases which contain the multiword expressions. To assess the level of accuracy improvement due to this phrase length decrease, we introduce the following baseline. Using the corpora already collected, we

with same PoS improvement PoS sentences changed minimum maximum

N N 221,041 10.83% 3.21% 3.96%

J N 154,966 16.49% 5.26% 5.76%

Any 376,007 13.17% 4.05% 4.70%

Table 5.6: Summary of experimental results - Baseline of random sequences. “PoS” stands for parts of speech, “N N” for noun-noun sequences and “J N” for adjective-noun sequences.

assess the improvement in shallow parsing when we replace a random two-word sequence of specific parts of speech with a special made-up token. In noun-noun multiword expressions corpora we replace a random noun-noun sequence in each sentence. In adjective-noun corpora we replace a random adjective-noun sequence in each sentence.

Table 5.6 presents the results of the baseline. We observe that the improvements are as expected smaller than the improvements when multiword expressions are replaced. The differences among the improvements for random noun-noun sequences, compositional noun-noun multiword expressions and non-compositional noun-noun expressions are not large, thus integrating noun-noun multiword expressions does not improve shallow parsing significantly. The difference is also small between random adjective-noun sequences and compositional adjective-noun multiword expressions. In contrast, there is large increase in accuracy between compositional and non-compositional adjective-noun multiword expressions, thus adjective-noun non-compositional multiword expressions are clearly worth considering.

Figure 5.11 shows the percentage of each change class over the sum of sentences whose parse output before unifying multiword expression tokens is different for the parsing output after the replacement. The ravdogram shows percentages for compositional and non-compositional noun-noun sequences, adjective-noun sequences and on average. The most common change class for all multiword expression categories is PL2P, accounting for 34.03% of the changes. It contains instances where after replacing the multiword expression with a single made-up token a number of phrases and leaves are assigned to a single phrase. This class contributes positively to the result according to the evaluation hypothesis. Classes of medium frequency, around 10%, are P2LMw, P2PL,

Figure 5.11: Change percentages per change class on average and per multiword expressions category. “N N” stands for noun-noun sequences and “J N” for adjective-noun sequences.

PL2P+MwAand P2P. In contrast, P2L, L2PMw L2P, PN and P2P+MwA are infrequent, around 5%.

In figure 5.11 we observe that compositional and non-compositional noun-noun sequences do not differ much in any change class. This explains why there is no significant increase on the overall improvement between these multiword expressions. The distribution of changes is very different for adjective-noun sequences; P2LMw and PL2P account for 10% and 20% more changes for compositional against non-compositional adjective-noun multiword expressions, respectively. In opposition, PL2P+MwA and P2P+MwAaccount for 20% and 10% more changes for non-compositional against compositional adjective-noun multiword expressions, respectively. This shows that non- compositional adjective-noun multiword expressions behave differently than compositional ones.

multiword expressions (table 5.3), the change distribution differences can explain why this class of multiword expressions leads to the largest improvement. The components of non-compositional adjective-noun multiword expressions are commonly assigned into different phrases (change class MWA), and this mistake is corrected after the replacement of the multiword expression with a single token.

Recall that Munoz et al. (1999) reported that the shallow parser is commonly mis- taken when the phrases contain conjunctions, gerunds and adverbial noun phrases. Our results agree since there is one gerund among compositional adjective-noun multiword expressions, i.e. parking brake against 4 among non-compositional adjective-noun multiword expressions, i.e. holding pattern, living rock, missing link and stocking stuffer. Moreover, error analysis revealed that conjunctions between two noun phrases are very common among instances of change classes PL2P and PL2P+MwA.

In document UNIVERSIDAD NACIONAL DEL CENTRO DEL PERÚ (página 23-37)