De conformidad con la Norma Técnica Complementaria por colonia establecida en el citado instrumento de planeación urbana, y con el fin de conservar y mejorar la calidad de vida de los residentes de las

GOBIERNO MUNICIPAL

8. De conformidad con la Norma Técnica Complementaria por colonia establecida en el citado instrumento de planeación urbana, y con el fin de conservar y mejorar la calidad de vida de los residentes de las

Looking at Figure 5.1, we see that if cache sizes are small, both attacks results in very high cache miss rate, including for ρ = 0.01, when the attack has a rate of only 644 packets per second. In this range, miss rate is almost independent of attack overlap, only slightly higher for δ = 0 due to the informed selection of attack address space. However, when δ = 0 the cache performance is much improved after a certain threshold is passed whereas for δ = 1 it barely changes up to when cache size becomes BGPϕ and miss rate drops to 0. In other words, the non overlapping

attack may be absorbed with larger cache sizes while the overlapping one cannot. Perhaps counter-intuitively, overlapping attacks are more damaging against a map- cache. They are easier to generate, as they do not require prior knowledge about the attacked network, but they are also harder to defend against since, after a certain point and for a wide ranges of values, increasing the cache sizes does not yield much improved performance, if less than BGPϕ.

Arguably, the most worrisome result we observe is the rather high miss rate which barely drops under 0.01, even for ρ = 0.01 and only when the normalized cache size is higher than 0.2. As a comparison, under normal operation this miss rate would be obtained with a normalized cache size of about 0.02, an order of magnitude less. Therefore, considering the high packet throughput of border routers, some more complex cache management strategies should be set in place to avoid hundreds to

5.4 Chapter Summary 57

thousands of packet drops per second.

One possible first step to circumventing the effect of cache polluting attacks would be to detect them prior to taking action. This may be achieved with s(u), if a ground truth estimation of its shape exists. So, if s(u) is known beforehand, an esti- mate of the average miss rate for the cache size used can be computed. Then, if the instantaneous miss rate surpasses this estimate by more than configured threshold, a cache protecting action could be taken. For instance, the top, or most recently used, part of the cache could be “protected” against eviction. Other measures could include the implementation of a cache hierarchy or the limiting of user request rate for new destinations. In the former case, evicted entries should be stored in a larger but higher access time cache while in the latter some of the network elements should monitor per user traffic and filter out attack attempts. Since it is not within the scope of our work we do not explore other aspects related to the implementation of such tools.

5.4 Chapter Summary

To prove the versatility of our cache model but also to better understand the vul- nerability of the map-cache in the face of malicious users, in this chapter we develop an extension capable to account for polluting attacks. If previously we observed that, in normal conditions, increasing cache size quickly diminishes miss rates we now found that it has little to no effect under simple cache pollution attacks. As a result, we advise that more complex management strategies be devised and set in place.

Chapter 6

LISP Control Plane Scalability

In this chapter we investigate the main source of temporal locality of reference in network traffic. We show that destination popularity is enough to entirely characterize the average working-set, independent of the much harder to model short term correlations. Leveraging the result from Chapter 4, we obtain a cache model that together with a small set of assumptions allows us to reason about asymptotic cache performance scalability.

6.1 Sources of Temporal Locality in Network Traffic

Consider the following formalization of traffic, either at Web page or packet level, we previously presented in Section 4.1 and which we repeat here for clarity. Let D be a set of objects (Web pages, destination IP-prefix, program page etc.). Then, we define traffic to be a strings of references r1, r2, . . . , ri. . . where ri = o ∈ D is

a reference at the ith unit of time that has as destination, or requests, object o. Generally, we consider the length of the reference string to be N. Also, note that we use object and destination interchangeably.

Two of the defining properties of reference strings, important in characterizing cache performance, are the heavy tailed popularity distribution of destinations and the temporal locality exhibited by the requests pattern. We discuss both in what follows.

6.1.1 Popularity Distribution

Copious amounts of studies in fields varied as linguistics [105, 146], Web traffic [23, 94], video-on-demand [29], p2p overlays [40] and flow level traffic [119] found

the probability distribution of objects to have a positive skew. Generally, such dis- tributions are coined Zipf-like, i.e., they follow a power law; whereby the probability of reference is inversely proportional to the rank of an object. Typically, the relation is surmised as:

ν(k) = Ω

kα (6.1)

where ν is the frequency, or number of requests observed for an object, k is the rank, Ω = 1/H(n, α) is a normalizing constant and H(n, α) is the nth _generalized

harmonic number.

It is interesting to note that although Zipf’s law has its origins in linguistics, it was found to be a poor fit for the statistical behavior of words frequencies with low or mid-to-high values of the rank variable. That is, it does not fit the head and tail of the distribution. Furthermore, it’s extension due to Mandelbrot (often called the Zipf-Mandelbrot law) only improves the fitting for the head of the distribution. Such discrepancies were also observed for Web based and p2p reference strings. Often the head of the distribution is flattened, i.e., frequency is less than the one predicted by the law, or the tail has an exponential cutoff or a faster power law decay [40, 105]. But these differences are usually dismissed on the basis of poor statistics in the high ranks region corresponding to objects with a very low frequency.

Nevertheless, Montemurro solved recently the problem in linguistics by extending the Zipf-Mandelbrot law such that for high ranks the tail undergoes a crossover to an exponential or larger exponent power-law decay. Surprisingly, he found this features, i.e. deviations from the Zipf-like behavior, to hold especially well when very large corpora [105] are considered. We further refer to this model as the Generalized Zipf law or GZipf and, in light of these observations, we assume the following:

Assumption 1. The popularity distribution of destination IP-prefix reference strings

can be approximated by a GZipf distribution.

6.1.2 Temporal locality

Temporal locality can be informally defined as the property that a recently referenced object has an increased probability of being re-referenced. One of the well established ways of measuring the degree of locality of reference strings is the inter-reference distance distribution.

Breslau et al. found in [23] that strings generated according to the Indepen- dent Reference Model (IRM), that is, assuming that references are independent

6.1 Sources of Temporal Locality in Network Traffic 61

and identically distributed random variables, from a popularity distribution have an inter-reference distribution similar to that of the original string. Additionally, they inferred that the probability of an object being re-referenced after t units of time is proportional to 1/t. Later, Jin and Bestavros proved that in fact temporal locality emerges from both long-term popularity and short-term correlations. However, they found that the inter-reference distance distribution is mainly induced through long- term popularity and therefore is insensitive to the latter. Additionally, they showed that by ignoring temporal correlations and assuming a Zipf-like popularity distribu- tion then an object’s re-reference probability after t units of time is proportional to 1/t(2−1/α)_{. These observations then lead to our second assumption:}

Assumption 2. Temporal locality in destination IP-prefix reference strings is mainly

due to the prefix popularity distribution.

We contrast the two assumptions with the properties of several packet-level traces in 4.4. In what follows we are interested in characterizing the inter-reference distribution of a GZipf distribution and further on the cache miss rate using the two statements as support.

6.1.3 GZipf generated inter-reference distribution

In this section we compute the inter-reference distance distribution for a GZipf popularity. The result is an extension of the one due to Jin and Bestavros for a Zipf-like popularity. As a first step we compute the inter-reference distribution for a single object and then by integration obtain the average for the whole reference string, which we denote by f(t).

If ν is the normalized frequency, namely, the number of reference to an object divided by the length of the reference string N, then, as shown in [105] the probability of observing objects with frequency ν in the reference string is:

pν(ν) ∝ 1

µνr_{+ (λ − µ)ν}q (6.2)

where 1 ≤ r < q are the exponents that control the slope of the power laws in the two regimes and µ and λ are two constants that control the frequency for which the tail undergoes the crossover.

From Assumption 2 it follows that references to an object are independent whereby the normalized inter-reference distance t is distributed exponentially with

expected value of 1/(Nν) and 0 < t < 1. Then, if we denote by d(t, ν) the number of times the inter-reference distance for an object with frequency ν is t, we can write:

d(t, ν) ∼ (νN − 1)νNe−νN t (6.3)

If νmin and νmax are the minimum and respectively the maximum normalized

frequency observed for the reference string, we can compute the inter-reference distance for the whole string as:

f(t) ∼ Z νmax νmin pν(ν) d(t, ν)dν ∼ Z 1 0 (νN − 1)νNe−νN t µνr+ (λ − µ)νq dν (6.4)

Unfortunately, the integral is unsolvable, nevertheless, we can still characterize the properties of f(t) in the two regimes of the GZipf distribution. In the high frequency region, where term having q as exponent dominates the denominator we can write: fq(t) ∼ Z 1 νk ν2 e−νt νq dν = Γ(3 − q, νkt) − Γ(3 − q, t) t3−q (6.5) where, Γ(n, z) =R∞

z xn−1e−xdx is the incomplete Gamma function. νk = (µ/(λ −

µ))1/(q−r) is the frequency for which the two terms that make up the denominator

are equal. It is useful to note that for low t values that correspond to high frequencies the nominator presents a constant plateau that quickly decreases, or bends, at the edges as t → 0 and t → 1/νk. Therefore, we can approximate:

fq(t) ∼ 1

t3−q (6.6)

Similarly, it may be shown that for low frequencies, that is, in the region where term with r as exponent dominates:

fr(t) ∼ 1

t3−r (6.7)

In document TOMO CXLII Santiago de Querétaro, Qro., 21 de agosto de 2009 No. 59 (página 61-71)