Multi-Tenant Radio Access Network Slicing: Statistical Multiplexing of Spatial Loads

(1)

Multi-Tenant Radio Access Network Slicing:

Statistical Multiplexing of Spatial Loads

Pablo Caballero, Albert Banchs, Senior Member, IEEE, Gustavo de Veciana, Fellow, IEEE, and Xavier Costa-Pérez, Member, IEEE

Abstract— This paper addresses the slicing of radio access network resources by multiple tenants, e.g., virtual wireless operators and service providers. We consider a criterion for dynamic resource allocation amongst tenants, based on a weighted proportionally fair objective, which achieves desirable fairness/protection across the network slices of the different tenants and their associated users. Several key properties are established, including: the Pareto-optimality of user association to base stations, the fair allocation of base stations’ resources, and the gains resulting from dynamic resource sharing across slices, both in terms of utility gains and capacity savings. We then address algorithmic and practical challenges in realizing the proposed criterion. We show that the objective is NP-hard, making an exact solution impractical, and design a distributed semi-online algorithm, which meets performance guarantees in equilibrium and can be shown to quickly converge to a region around the equilibrium point. Building on this algorithm, we devise a practical approach with limited computational information and handoff overheads. We use detailed simulations to show that our approach is indeed near-optimal and provides substantial gains both to tenants (in terms of capacity savings) and end users (in terms of improved performance).

Index Terms— Wireless networks, multi-tenant networks, RAN-sharing, network slicing, resource allocation.

I. INTRODUCTION

D

RIVEN by the capacity requirements forecasted for future mobile networks as well as the decreasing mar- gins obtained by operators, infrastructure sharing has established itself as a key business model for mobile operators to reduce the deployment and operational costs of their networks (e.g., [1] reports a 280% increase in deals within the last 5 years). While passive and active sharing solutions, ranging from exclusive allocation of resources to roaming agreements, are used and have been standardized, these sharing approaches are based on fixed contractual agreements with Mobile Virtual Network Operators (MVNO) over long time

Manuscript received May 8, 2016; revised December 23, 2016 and May 14, 2017; accepted June 12, 2017; approved by IEEE/ACM TRANS- ACTIONS ONNETWORKINGEditor K. Psounis. Date of publication July 17, 2017; date of current version October 13, 2017. The work of P. Caballero and G. de Veciana was supported by the NSF Award under Grant CNS-1343383.

The work of A. Banchs was supported in part by the H2020-ICT-2014-2 5G NORMA project under Grant 671584 and in part by the Spanish project DRONEXT under Grant TEC2014-58964-C2-1-R. The work of X. Costa- Pérez was supported by the H2020-ICT-2014-2 5G NORMA project under Grant 671584. (Corresponding author: Pablo Caballero.)

P. Caballero and G. de Veciana are with The University of Texas at Austin, Austin, TX 78712 USA (e-mail: [email protected]).

A. Banchs is with the University Carlos III of Madrid, 28911 Leganés, Spain, and also with the IMDEA Networks Institute, 28918 Leganés, Spain.

X. Costa-Pérez is with NEC Europe Ltd., 69115 Heidelberg, Germany.

Digital Object Identifier 10.1109/TNET.2017.2720668

periods (typically on a monthly/yearly basis). In this paper, we focus on a structured dynamic slicing approach which enables a much more efficient sharing of network resources, as envisioned by the 3GPP Network Sharing Enhancements for future mobile networks which the authors contributed to [2].

Following [3], our approach divides the infrastructure into network slices, assigning a different slice to each operator, and implements the sharing of network resources among operators by dynamically allocating resources to slices. Such a novel network slicing approach is expected to result in new business models and revenue sources for infrastructure providers (see, e.g., [3]). Indeed, this approach supports not only classical players (mobile operators) but also new ones such as Over-The-Top (OTT) service providers that may buy a slice of the network to ensure satisfactory service to their users (e.g., Amazon Kindle’s support for downloading con- tent or a pay TV channel including a premium subscription).

In the literature, the term tenants is often used to refer to the different types of players, and multi-tenancy refers to approaches enabling dynamic network slicing and resource sharing for multiple tenants. For simplicity, hereafter we use the term operator in a broad sense to refer to classical (virtual) operators as well as the new players enabled by this approach.

In designing a practical solution for dynamic resource sharing among slices we face multiple challenges. To start with, we need a sharing criterion that not only allocates resources to operators (and their corresponding slices) fairly, but also, shares the resources of each operator fairly among its users. Furthermore, the criterion should allow for flexible sharing “levels” to meet operators’ heterogeneous requirements; for practical purposes, these levels should be coarse- grained, rather than based on instantaneous resource needs.

When allocating resources to an operator, one should take into account the numbers and locations of active users on the network–indeed some locations may see higher demand and (consequently) the associated resources might be scarce.

Beyond the criterion itself, designing an algorithm to imple- ment it, while realizing timely adaptation to network changes, is also very challenging. Given the amount of information involved (including the channel quality of each user) and its dynamic nature, the algorithm should be as distributed as possible. Also, since the algorithm may be triggered fre- quently (whenever a user joins, leaves or changes its location), it should be computationally efficient. When adapting to net- work changes, the algorithm should control the number of handoffs triggered, as those may represent a high overhead.

See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.

(2)

Key Contributions

This paper proposes a criterion for slicing the network infrastructure amongst operators and an algorithm to allocate resources accordingly. The key contributions are as follows.

In Section II, we introduce a criterion for dynamic resource sharing among operators; while the criterion has been proposed before, we provide a characterization supporting its use in a multi-tenant network setting. These properties are developed in Section II-C, providing insights on the optimality and fairness of the resulting allocations, and the benefits are studied in Section II-D, by characterizing the capacity savings by means of a closed-formula. We show that the criterion not only improves overall network utility but also that of each individual operator, thus guaranteeing that operators are not harmed by the sharing of resources amongst slices.

In Section III-A, we show the criterion corresponds to an NP-hard problem, motivating the need to devise an efficient approximation algorithm which is introduced in Section III-B.

The proposed algorithm is semi-online, distributed, incurs low computational complexity, and has been specifically designed to control overheads associated with handoffs and/or mobile user reassociations; we rely on several intermediate analytical results to drive the key design choices underlying our algorithm. Section IV provides a comprehensive performance evaluation based on detailed simulations, showing that (i) operators can save up to 80% capacity while providing the same quality to their users, and (ii) for a fixed capacity, we improve user performance in terms of file download times by up to 30%, among other results.

Related Work

We next review and contrast our work to the state-of-the- art in (i) resource allocation based on proportional fairness, and (ii) resource sharing among operators.

Considerable research effort has been devoted to address the problem of fair resource allocation in networks. In wireline networks, fair resource allocation based on utility function maximization has been extensively studied following the sem- inal work of [4]. Building on this work, further algorithms for congestion control in multi-path environments have been proposed [5], [6]. Not unlike our work, these algorithms are distributed. However, they allow users to decide among multiple routes while we focus on a wireless setting where each user can only use one resource (her base station).

In the specific context of wireless networks, several approaches have been proposed [7]–[9] to the problem of resource allocation and user association based on weighted and unweighted proportional fairness, respectively. The unweighted case has been largely studied in the literature in different contexts (e.g., power control [10], interference avoidance [11]). Bu et al. [7] and Son et al. [11] analyzed the complexity of the problem and proved the existence of polynomial time algorithms which provide an exact solution, and [9] designed a distributed algorithm with convergence guarantees. In contrast to the above, the resource allocation criterion proposed in this paper relies on weighted proportional fairness, with operator-specific weights; this is a more difficult

problem as it is NP-hard [7] and the convergence of distributed greedy algorithms cannot be guaranteed [12].

Weighted proportional fair resource allocation in the context of wireless networks has also been studied from different angles. In [8], an algorithm with tight worst-case performance bounds is proposed, while [13] proposes an heuristic algorithm. In contrast to the distributed approach proposed in this paper, both algorithms are centralized and require the availabil- ity of the full network state information, which may be very challenging to gather in a timely manner. Hou and Chen [14]

and Hou and Gupta [15] propose a Gibbs-sampling mechanism based on simulated annealing that converges to an optimal solution. However, the convergence of such mechanisms is known to be very slow and for this reason the authors resort to a more practical greedy solution. For the proposed greedy solution, the authors neither provide performance bounds nor analyze convergence; additionally, the overhead is not controlled, which limits their practical deployment. All the approaches mentioned above address the problem of a single- operator network, in contrast to our work which focuses on the slicing and sharing of resources among multiple operators.

Multi-operator network sharing has been studied from many different angles, including planning, economics, coverage, performance, etc. (see e.g. [16]–[18]). This paper focuses specifically on the design of algorithms for resource sharing among operators, which has been previously addressed by [19]–[23];

however, all these works differ substantially from ours in terms of scope, criterion or approach. In [19] and [20], the optimization of the network utility follows a different criterion from the one in this paper, weighted proportional fairness, which (as we show) provides many desirable properties. The works of [21] and [22] present a proportional fair formulation similar to ours; however, they do not provide a rationale for their choice, in contrast to the solid analytical arguments provided here. Furthermore, [21] does not address the algorithm design, while [22] uses a general non-linear solver that incurs a very high computational complexity (as confirmed by our results of Section IV-D). Finally, [23] follows a game theoretic approach where operators bid for resources, which results in a fundamentally different problem from the one addressed here.

In summary: (i) while there has been substantial research on proportional fair resource allocation, its application to multi- operator settings and the associated problems have not been studied, and (ii) in spite of the substantial work devoted to proportional fairness in general settings, there is a gap in the systematic study of distributed mechanisms for joint resource allocation and user association that build on analytical results.

II. RESOURCEALLOCATIONCRITERION

In this section, we formulate the optimization problem that will drive (i) the association of users to base stations, and (ii) the allocation of base stations’ resources to users.

Hereafter, we refer to this optimization as the multi-operator resource allocation (MORA) criterion. We show analytically that the criterion satisfies desirable properties in terms of optimality and fairness, and develop a simple model to evaluate the potential sharing gains of our network slicing approach.

(3)

A. System Model

We start by presenting our system model which was developed with LTE/LTE-A systems in mind, but is generally applicable to cellular systems. Consider a network consisting of a set B of base stations (or sectors in case of sector antennas) that are shared by a set of operators O. At any given time, we let U denote the set of users sharing the network and Uo, o ∈ O the subsets of users belonging to each operator. An allocation of resources involves two sets of variables: (i) the association of users to base stations, denoted by x = (xub : u ∈ U, b ∈ B), where each user u is associated with a single base station, i.e., x_ub = 1 for one of the base stations and 0 otherwise, and (ii) the allocation of the resources of each base station among its associated users, denoted by f = (fub : u ∈ U, b ∈ B), where fub is the fraction of the base station b’s resources which are allocated to user u.¹ Note that in our model we ignoring the discrete nature of such resources, and assume that f_u,b can take any value in the continuous range [0,1].

We let ˜c_ubdenote the average rate per resource unit seen by user u at base station b under current radio conditions,²and let Cb be the base station’s total amount resources. Given that the user is allocated a fraction f_ub of the base station’s resources, her rate is given by f_ubC_b˜c_ub. For notational convenience, we define the achievable rate of the user as c_ub := ˜c_ubC_b, which yields the following rate allocation:

r_u(x, f) :=

b∈B

x_ubf_ubc_ub.

Note that the definition of c_ubactually represents an abstrac- tion of the underlying physical resources, accounting for the various physical layer techniques (such as, e.g., power control or MU-MIMO) as well as the interference from different sources (including that of neighboring base stations).

In line with similar analyses in the literature [19], [21], [22], [24]–[26], we shall assume that c_ub is fixed for each user and base station pair.

B. MORA Criterion

In line with previous approaches [19], [21], [22], the underlying assumption behind our criterion is that operators share the cost of deploying and/or maintaining the infrastructure, and the resources received by each operator should be based on the level of its (financial) contribution to the shared network:

if an operator contributes twice as much as another, it should roughly get twice the resources. To this end, each operator is assigned a network share s_o ∈ [0, 1], to represent its level of contribution to the network. Without loss of generality, these shares are normalized so that

o∈Os_o= 1.

The proposed criterion allocates resources across operators dynamically, tracking changes in the numbers and locations of operators’ mobile users and the associated transmission rates c_ub. When doing this, we need to make sure that

1For instance, in LTE/LTE-Afubdenotes the fraction of physical Resource Blocks, in FDM the fraction of bandwidth and in TDM the fraction of time

2Note that such average rates depend on the choice of modulation and coding scheme(s) selected for the user, after averaging out short-term fluctuations.

(i) network resources are fairly shared among the various operators according to their share, and (ii) at the same time, the resources allocated to a given operator are fairly shared among the users of that operator. To achieve this, we follow an approach akin to that in [27]³: we maximize the overall network utility resulting from aggregating operator utilities, where the utility of an operator is in turn the aggregation of its users’ utilities. To this end, we define the overall network utility as the sum of operators’ utilities weighted by the shares,

W(x, f) =

o∈O

s_oU_o(x, f),

and the operator utility as the sum utility of the operator’s users normalized by the number of users (where a user’s utility is logarithmic in its rate),

U_o(x, f) = 1

|Uo|

u∈Uo

log(r_u(x, f)),

By weighting the operator utilities with the shares, we give higher priority to operators with larger shares, and by nor- malizing with the number of users, we avoid that operators with more users are better off. For instance, with this choice, under uniformly loaded base stations an operator with twice the share of another one will get twice as many resources, independent of the number of users of each. Combining the above equations, one can rewrite the network utility as follows:

W(x, f) =

o∈O

u∈Uo

w_ulog(r_u(x, f)), (1) where the user weights w_uare defined as the operator network share divided by the current number of users of the operator, i.e., w_u = s_o/|Uo| (in simple terms, the network share of an operator is divided equally amongst its current users).⁴

With the above, we can now formulate the MORA optimization problem as follows. Such optimization corresponds to the weighted proportional fair criterion (see e.g. [4]) extended to a multi-operator setting that considers utilities of the operators, rather than the ones of the individual users⁵:

maxx,f W(x, f), (2a)

subject to: r_u(x, f) =

b∈B

x_ubf_ubc_ub, ∀u (2b)

b∈B

x_ub= 1 and x_ub∈ {0, 1}, ∀b, u (2c)

u∈U

f_ubx_ub≤ 1 and fub≥ 0, ∀b, u. (2d)

In the sequel we shall letx^MORA,f^MORAdenote a (possibly not unique) optimal solution to this optimization problem.

3Reference [27] addresses a similar problem to ours in the context of users and flows, as it aims at allocating resources fairly to users while preserving fairness among the flows of each user.

4While our definition of network utility coincides with that for weighted proportional fairness, the criterion proposed here is fundamentally different:

we consider resource allocation across time and vary the weights with the number of users, while weighted proportional fairness typically focuses on a static scenario and relies on fixed weights.

5Note that (2c) ensures that a user is associated with one (and only one) base station.

(4)

This formulation provides the optimal resource allocation at a given time under the current c_ub values (given by the selected modulation-coding schemes); in a dynamic setting, such allocations would be re-evaluated when any of the c_ub values change, due to changes in the (average) channel quality.

Note that, once MORA returns the user association x and resource allocation f, physical layer techniques (such as MU-MIMO or power control) are employed to optimize performance, under the constraint that users are provided with rates proportional to the r_u values given by MORA.

C. Properties of MORA Resource Allocation

Next, we show that the MORA criterion satisfies some desirable properties both in the way base stations’ resources are allocated to associated users, and the way users are associated with base stations.

1) Per-Base Station Resource Allocation: Let us first con- sider a general setting, where user associations to base sta- tions are fixed, to see how MORA allocates base station resources. Let x^∗ be the fixed (not necessarily optimal) user to base station association. If we optimize the resource allocation f for this user association, i.e., max_fW(x^∗,f) subject to (2b) and (2c), it can be seen from [8, Lemma 5.1]

that the resulting resource allocation is unique and given by f^M(x^∗) = (f_ub^M(x^∗) : u ∈ U, b ∈ B), where

f_ub^M(x^∗) =w_ux^∗_ub

v∈Uw_vx^∗_vb. (3) Further ifx^∗ = x^MORA, then f^M(x^∗) = f^MORA, i.e., we have MORA optimal allocation of network resources.

The above result is fairly intuitive. Users associated with a given base station are allocated resources proportionally to their weights w_u. This can be viewed as follows. The share of an operator represents the total budget of the operator.

When assigning a weight w_u= s_o/|Uo| to users, this share is distributed among the operator’s users, and hence the user’s weight represents the budget of a user. As the resources allocated to a user are inversely proportional to the sum of weights at her base station, the sum of weights can be viewed as the cost of a unit of resource at the base station.

Thus, operators with users associated with heavily loaded base stations will have to pay a higher cost (e.g, increase their network share or limit their overall number of users) or receive fewer resources.

The above shows that the number of active users that operators have on the network and their spatial distribution will impact the resources allocated under MORA. Indeed, allocations across base stations are coupled together through|U_o|, i.e., an operator with a large number of active users will have lower weights and likely lower per-user allocations. At the same time, the resources obtained by an operator heavily depend on the load at base stations to which its users will be associated with.

2) User Association: Next we study the MORA user associations. Building on the optimality of our formulation, we can show that the resource allocation resulting from MORA is Pareto-optimal, which means that for

any alternative allocation (x,f) for which r_u(x,f) >

r_u(x^MORA,f^MORA) for some u, we necessarily have r_v(x,f) < r_v(x^MORA,f^MORA) for some v = u. Indeed, if this was not the case then W (x,f) would be larger than W (x^MORA,f^MORA), which contradicts the fact that the optimal MORA allocation (x^MORA,f^MORA) maximizes W (x, f).

Thus, Pareto optimality in this context means that if under some other user association choice, a user sees a higher throughput than that under MORA then there must be another user which sees a lower throughput allocation. Note that this need not always be the case. Consider, for instance, a network with |U| users, such that the largest cub of each user corresponds to a different base station. While the optimal allocation would associate each user to the base station with largest c_ub, a criterion based on local decisions that looks at users one by one may lead to a different association. The above result guarantees that this will not happen under MORA.

D. Gains and Savings

In the following we evaluate the benefits of MORA. To that end, we introduce a simple baseline – static slicing (SS), a proxy for not sharing resources at all.⁶

1) Static Slicing (SS) Baseline: Suppose each operator contracts for a fixed slice/fraction s_oof the network resources at each base station for its exclusive use. The operator can of course still optimize its users associations, x^o = (x_ub : u ∈ Uo, b ∈ B), and allocation of resources f^o = (f_ub : u∈ Uo, b∈ B), so as to maximize its utility. Specifically each operator o∈ O can determine its user association and resource allocations based on:

maxxô,fô U_o(xô,fô)

subject to r_u(x^o,f^o) =

b∈B

x_ubf_ubc_ub, ∀u ∈ Uo,

b∈B

x_ub= 1, ∀u ∈ Uo,

u∈Uo

f_ubx_ub ≤ so, ∀b ∈ B,

x_ub ∈ {0, 1}, fub≥ 0, ∀b ∈ B, ∀u ∈ Uo. (4) This is similar to MORA except limited to the operator o’s current usersUo and the resource constraint corresponds only to the fixed slice s_o allocated to the operator at each base station. Although the user associations and resource allocations under static slicing are independently optimized by each operator, we shall let x^SS,f^SS be a (possibly not unique) optimal choice across all operators under static slicing. Also paralleling our discussion of MORA, it is easy to show that if one fixes a feasible user association x^∗, (4) is convex and yields resource allocations given by

f^S(x^∗) := (f_ub^∗ (x^∗) : ∀u ∈ U, ∀b ∈ B),

6By slicing we refer to the way resources are shared (or sliced) among operators (while resource allocation refers to the allocation of resources to specific users). In contrast to the dynamic nature of MORA-based slicing, static slicing divides the infrastructure in fixed fractions.

(5)

where

f_ub^∗(x^∗) =x^∗_ubs_o

v∈Uox^∗_vb1{u ∈ Uo}, (5) i.e., this is again a weighted proportionally fair allocation of the operators’ slice of the base station resources.

2) Operator Utility Gains and Protection: The overall net- work utility under MORA is clearly larger than that under the more constrained allocations possible under SS. This however does not guarantee that a given operator’s utility under MORA is greater than that under SS. Below we show that for the same user association an operator utility under MORA exceeds that under SS, indicating that beyond the overall network utility, we have that each operator is indeed better off. This shows that MORA effectively protects operators when sharing their resources with other operators, which is very important to ensure that operators accept this criterion. Note that the result is completely general and holds for any possible scenario.⁷

Theorem 1: For a given user association x, MORA’s resource allocationf^M(x) (see Eq. 3) achieves a higher utility than that of SS given byf^S(x) (see Eq. 5), i.e., for all o ∈ O

U_o(x, f^M(x)) ≥ Uo(x, f^S(x)).

3) Capacity Savings: Next we consider the capacity savings resulting from operators sharing infrastructure. Specifically we compare the spectrum capacities, i.e., total amount of resource, required to achieve the same average utility per operator under MORA and SS. The aim is to give some intuition on the typical savings one might expect and its dependence on the network load, number of operators and their shares. For tractability we will examine a scenario where traffic is spatially homogenous and operators’ network shares are proportional to their load.

We consider a network model in which there is a fixed total number of users|U| of which each operator contributes a fixed number of users proportional its network share s_o, i.e., n_o= s_o|U| which are assumed to be integer valued. Each operator’s users are randomly (uniformly) distributed amongst the |B|

base stations, so the number of users of operator o associated with base station b, is given by a random variable N_o,b, such that N_o,b ∼ Binomial(no,_|B|¹ ). The total number of users at base station b is denoted by a random variable N_b =

o∈ON_o,b ∼ Binomial(|U|,_|B|¹ ). We also assume for simplicity that users have the same capacity c_ub = c to the base stations with which they associate.

Note that under the above traffic model all users u have the same weight w_u = _n^s^o

o = 1/|U|. Thus expected overall network utility under MORA is given by:

W¯ = E

o∈O

b∈B

N_obw_ulog

c N_b

= E

b∈B

o∈O

N_ob

|U| log

c N_b

= E

b∈B

N_b

|U|log

c N_b

= |B|

|U|E

N_blog

c N_b

,

7The proofs of the theorems are provided in the Appendix.

where the last equality follows by using the uniformity of traffic across base stations. Moreover, under our model the network utility ¯W is the average utility across all users, which by symmetry is equal to the expected utility of a given operator o under MORA, i.e., ¯U_o^MORA= ¯W.

Now applying Taylor’s approximation to the function xlog(c/x) at E[Nb] we obtain

N_blog

c N_b

≈ E[Nb] log

c E[Nb]

+

log

c E[Nb]

− 1

· (Nb− E[Nb]) − 1

2E[Nb](N_b− E[Nb])², which in turn gives

E

N_blog

c N_b

≈ E[Nb] log

c E[Nb]

− 1

2E[Nb]Var(N_b).

Since N_b ∼ Binomial(|U|,_|B|¹ ) we have that Var(N_b) =

|U||B|(1 − _|B|¹ ) ≈^|U|_|B|, and so U¯_o^MORA≈ log

c E[Nb]

− |B|

2|U|. (6)

Let Δ_o denote the extra capacity that operator o would require under SS to achieve the above utility. The expected utility experienced by operator o under SS is given by

U¯_o^SS = E

b∈B

N_o,b n_o log

s_oc(1 + Δ_o) N_o,b

= |B|

n_oE

N_o,blog

s_oc N_o,b

+ log(1 + Δ_o).

Again using a Taylor expansion this can be approximated as

U¯_o^SS ≈ log

s_oc E[No,b]

−|B|

n_o

Var(N_o,b)

2E[No,b] + log(1 + Δ_o).

Noting that Var(N_o,b) ≈ so|U|

|B| = _|B|ⁿ^o we have that U¯_o^SS ≈ log

c E[Nb]

− |B|

2n_o + log(1 + Δ_o). (7) Finally equating the expected utilities, i.e., (6) and (7), we obtain the following estimate of the necessary extra capacity Δ_orequired when static slicing rather than MORA is used:

log(1 + Δ_o) ≈ |B|

2n_o × (1 − so). (8) where under our traffic load model n_o= s_o|U|.

This result gives a clear intuition on the possible savings resulting from sharing the infrastructure with MORA dynamic slicing. In particular, the savings increase exponentially in the product of two terms. The first is inversely proportional to the average number of users operator o has per base station, i.e., n_o/|B|; indeed, if the operator has a large number of users, its multiplexing gain is already high without sharing the infrastructure, and hence there is little gain from sharing.

The second term is large when the operator has a small network share: if its share is high, the operator is using most of the network resources and there is little sharing.

(6)

In summary, capacity savings will be highest when infrastructure is shared by a large number of operators each with a small number of users per base station. With current trends towards small cells, the number of users per base station is expected to be small, suggesting that infrastructure sharing may be particularly beneficial.

III. APPROXIMATIONALGORITHM FORMORA The analysis in previous section and simulations to be presented in the sequel suggest that MORA resource allocation across operators not only has desirable characteristics but will make efficient use of resources while protecting operators from one another. Unfortunately, as we show below, the complexity and information overheads associated with doing so for are already high for a static system, and excessive when operators’

mobile users and associated channels are subject to constant change. In this section we further discuss the state-of-the-art algorithms to tackle MORA, and then propose an approximation algorithm based on a sequence of theoretical results and insights that support the design.

A. Complexity and State-of-the-Art Algorithms

The optimization problem underlying MORA is a non- linear integer programming problem, which can be shown to be NP-hard and hence there is no polynomial time algorithm unless P = N P .

Theorem 2: The MORA problem is NP-hard.

There have been a number of works in the literature devoted to solving problems similar to MORA. In particular, [8] proposes an approximation algorithm for the single operator case with guaranteed performance bounds. However, their approach is still computationally demanding; indeed, the results in Section IV-D, show that for a network with only 100 users, the algorithm takes 20 seconds on a dual- core 2.8GHz processor. Given that this would need to be executed every time c_ubvalues change or new users enter/leave the network,this seems computationally impractical. Moreover the proposed approach is centralized, so there would be a substantial information overhead to gather the c_ubof each user to each potential base stations, given the amount of data and dynamic nature of mobile users.

In the multi-operator setting, [22] proposes an approach based on using a standard non-linear solver to address a problem similar to MORA. Unfortunately the approach is also very complex and centralized. Indeed, our evaluation of this proposal in Section IV-D, shows that the time required to execute this algorithm increases sharply with the number of users, making it impractical at about 50 users. Moreover, [22] does not provide any analytical performance bounds.

In summary, to make dynamic multi-operator resource sharing possible, a new radically simplified approach is required.

It should have low computational complexity and be based on distributed operation requiring only local information, to allow near real-time operation.

B. Algorithm Design

In the following, we devise an algorithm for MORA that can be used in practical deployments. In contrast to previous

approaches, our algorithm involves a low computational complexity and relies on data that can be gathered from neighboring base stations, allowing for a distributed implementation.⁸

Given the user dynamics, i.e., joining, moving and leaving the network, an offline algorithm that computes an optimal resource allocation for a fixed set of users is impractical.

Instead, we will pursue an approach that tracks users dynamics, and occasionally adjusts resource allocations by modifying current or new users’ associations. Since reassociations of current users correspond to handoffs, their number should be kept to a minimum. To design such an algorithm, we need to answer

• Do we really need to reassociate users?

• Where should users be (re)associated to?

• In which order should users be reassociated?

• How many reassociations do we need?

For each of these questions, in the following we provide some theoretical analysis that eventually leads to our proposed algorithm. In all cases, once a user association x is set, resources at each base station are allocated according MORA’s resource allocationf^M(x).

1) Need for Reassociations: Following the standard termi- nology of online algorithms, we say that an algorithm is online if, upon a user joining the network, it only decides how to associate the new user, without triggering any reassociations of existing users. We say the algorithm is semi-online if it can further trigger reassociations of a limited number of users.

Thus our first question is whether an online algorithm would suffice. The following theorem suggests that the performance of an online algorithm can be arbitrarily bad, motivating us to consider semi-online approaches.

Theorem 3: Consider an online algorithm that triggers no reassociations of existing users. Let(x,f) denote the solution resulting from this algorithm and(x^MORA,f^MORA) a MORA optimal solution. Then, W(x^MORA,f^MORA)−W (x,f) can- not be bounded.

2) Criterion for (Re)associations: Next we address the question regarding how to associate, or reassociate, users to base stations. In particular, consider a Distributed Greedy algorithm wherein we iteratively examine (in arbitrary order) if there is a user which could change her association to increase her rate, and if this is the case, she chooses to re-associate with the base station providing the largest rate. The following result characterizes the performance of this algorithm if an equilibrium is reached.

Theorem 4: Let (x,f) be an equilibrium allocation for the Distributed Greedy algorithm, and (x^MORA,f^MORA) a MORA optimal solution, then⁹

W(x,f) ≥ W (x^MORA,f^MORA) − log(e).

There exists an instance of the problem for which it holds that W(x,f) = W (x^MORA,f^MORA) − log (2).

8Note that, while the algorithm implementation is distributed, the logic is centralized: i.e., we assume that the algorithm is run centrally by a single entity, without the intervention of the different operators.

9To gain some intuition on this bound, we note that a log(e) gap is equivalent to reducing the throughput of each user by a factor ofe.

(7)

Note that the above bound of log(e) is fairly close to the log(2) bound provided by [8]. This is quite remarkable, considering that the algorithm proposed in [8] is centralized and much more complex. Furthermore, the theorem shows that the bound is rather tight, as there exists a problem instance that provides a gap of log(2), which is quite close to the log(e) bound.

While the above theorem bounds network utility in equilibrium, we have not established the convergence of this algorithm to an equilibrium. The convergence of this type of algorithms has received substantial attention in the litera- ture [12], [28], [29]. Indeed, since the throughput of user u is an increasing function of c_ub/

v∈Uw_vx_vb, the Distributed Greedy algorithm can be viewed as a congestion game in which the load at a base station is given by the sum of weights of the users at the base station, l_b =

v∈Uw_vx_vb, and a user seeks to minimize a_ubl_b, where a_ub= 1/c_ub. This game falls in the category of a singleton weighted congestion game with player-specific multiplicative constants and linear variable cost. Based on the lack of a counter-example and the existence of polynomial-time algorithms for special cases, [28] conjectures that this type of games have an equilibrium (see [28, Conjecture 3.7]). Based on the simulations we have run for numerous instances of the game, we further conjecture that the Distributed Greedy algorithm (which implements a best response dynamics) converges to this equilibrium.

In particular, Distributed Greedy satisfies W (x,f) ≥ W(x^MORA, f^MORA) − log(e), while [8] proposes an algorithm that provides a throughput larger than r_u(x^MORA,f^MORA)/(2 + ) to all users, which translates into W (x,f) ≥ W (x^MORA,f^MORA) − log (2 + ); hence, the algorithm of [8] provides only a slightly tighter bound than Distributed Greedy.

3) Order of Reassociations: While our analysis of the Dis- tributed Greedy algorithm suggests a user should (re)associate to maximize her rate, it does not indicate in which order user reassociations should be considered to speed up convergence.

To address this, we consider the Greedy Largest Gain algo- rithm, which operates as the Distributed Greedy algorithm but at each iteration updates the association of the user achieving the highest gain, i.e., the one achieving the largest r^new_u /r^old_u , where r_u^old is the user’s current throughput and r^new_u is the throughput she would receive under the improved association.

The following theorem shows that the Greedy Largest Gain algorithm exhibits a desirable convergence property. In particular, one can guarantee that at each iteration the network utility increases until it reaches W (x^MORA,f^MORA)−2 log(e), and from then on it never decreases below W (x^MORA,f^MORA)−

(2 + max_uw_u) log(e). Note that Distributed Greedy does not exhibit this kind of behavior: if we select users in an arbitrary order, the network utility may decrease at any iteration (as the increase in utility of the reassociated user may be smaller than the decrease experienced by the other users).

Theorem 5: Let (xⁱ,fⁱ) be the solution at the i^th iteration of the Greedy Largest Gain algorithm and (x^MORA,f^MORA) a MORA optimal solution. Then W (xⁱ,fⁱ)

increases at each iteration until W(xⁱ,fⁱ) ≥ W(x^MORA,f^MORA) − 2 log(e), and thereafter it never decreases below W(x^MORA,f^MORA)−(2+maxuw_u) log(e).

4) Proposed Algorithm: Greedy Local Largest Gain. Based on the above considerations we now propose our algorithm for MORA, the Greedy Local Largest Gain algorithm. We shall first describe how it operates at a high level, and then provide a more detailed algorithmic description. When a user joins the network, she greedily joins the base station providing the largest throughput. However, as we have seen, we may need to consider triggering user reassociations. To limit their number and associated handoffs overheads we constrain these to at most m. For the first m − 1 reassociations, users choose the base station that provides the largest throughput, but in the m^th the user chooses the base station so as to maximize the network utility W (x, f). In each of these steps, we select which user to reassociate (if any) based on Greedy Largest Gain criterion, but instead of considering all users in the network, involving possibly a high overhead, we restrict the selection locally to users associated with only two base stations (see below).

In a dynamic and time-varying setting, the algorithm needs to consider the following cases: (i) a user joins the network, (ii) leaves, or (iii) changes her location. The algorithm for a joining user is detailed in the pseudocode of Algorithm 1.

The rationale is as follows. In the optimal allocation, users are somehow balanced among base stations, users’ weights playing a role in this balance. When a new user joins the network, the balance is broken and the base station with which the user associates may have too many users. Hence, in the first step we reassociate one of the users of this base station.

In the next step, the base station that received the reassociated user may have too many users; however, depending on the weights of the joining and reassociated users, the original base station may still have too many users as well. Hence, we consider the users from the two base stations as candidates for reassociation. We repeat this, considering users from two base stations, in the subsequent steps. Finally, in the last step, to avoid that the reassociation of a user harms the overall performance, we select the base station association that maximizes the overall network utility rather than the throughput of the reassociated user.

When a user leaves the network, the algorithm is quite similar (pseudocode omitted for space reasons). When she moves, her c_ub values to the neighboring base stations may change; if, as a result of these changes, at some point the user would receive a larger throughput in a new base station, we reassociate her to this base station. Then, the old base station executes the same algorithm as when a user leaves the network while the new base station executes the algorithm corresponding to a joining user.

5) Controlling the Number of Reassociations: The remain- ing question is how to set the limit on the number of reassociations m, which determines the trade-off between the performance of the algorithm and reassociation overhead. Such trade-offs have been analyzed for a similar setting in [30], which aims to distribute tasks among servers (where each task can only be associated to a restricted set of servers)

(8)

Algorithm 1 GLLG User Joining Definitions:

r_v,b: throughput of user v if she associates to b;

r_v: current throughput of user v;

Ub: set of users associated to b, (u ∈ U s.t. xu,b= 1);

U_{c∪p}: set of users associated to c or p;

W_u,q : network utility if user u associates to q;

Input: x

User v joins the network:

b= arg max

b∈B

r_v,b;

x_v,b = 1 ← Associate user v with base station b; [u^∗, p^∗] = arg max

(u,p)∈Ub×B ru,p

ru ; if r_u∗,p^∗/r_u>1 then

Associate user u^∗ with base station p^∗, x_u∗p^∗ = 1;

else stop

c= p^∗ (current base station);

p= b (previous base station);

for m− 1 times do [u^∗, q^∗] = arg max

(u,q)∈U{c∪p}×B ru,q

ru ; if r_u^∗_,q^∗/r_u>1 then

Associate user u^∗ with base station q^∗, x_u^∗_q^∗ = 1;

c← q^∗; p← previous base station of user u^∗; else

stop

W ← current network utility;

[u^∗, q^∗] = arg max

(u,q)∈U{c∪p}×B Wu,q

W ; if W_u∗,q^∗/W >1 then

Associate user u^∗ with base station q^∗, x_u∗q^∗ = 1;

in such a way that the maximum load across all servers is minimized. This problem is similar to ours, with tasks and servers corresponding to users and base stations respectively, in the particular case where all users have the same w_uand c_ub. Not unlike their setting, the performance in this case is optimized when base station loads are as balanced as possible (i.e., the highest load is minimized). According to the analysis of [30], the performance in terms of the highest load with our algorithm (which has a limit of m reassociations) over the highest load with the optimal algorithm (with no constraint m) is given by O(e¹⁻^ln|B|^m ). This shows that algorithm’s perfor- mance improves rapidly (exponentially) in m, and suggests a small m suffices to achieve near-optimal network utility.

To further explore the impact of m on network utility, we present the following simulation results (see Section IV for a description of the simulation setup). Here, W (m) is the network utility achieved for a given m value, W (∞) is the utility with unconstrained overhead, W (0) is the utility with no reassociations, and G_W(m) .= 1 −W (m)−W (∞)

W (0)−W (∞) represents the normalized utility gain with m reassociations, showing how close we get to the unconstrained overhead utility.

Fig. 1 depicts this gain as a function of m for different

Fig. 1. Normalized utility gain as a function ofm.

scenarios. As can be seen, utility gains increase very sharply.

Furthermore, for m = 3 the gains are already very close to their maximum value; based on this, we set m equal to 3 (this is indeed the value used in the experiments of Section IV).

With this setting, the proposed algorithm only introduces a small overhead, since our approach may trigger up to three handovers for every handover performed by a “traditional”

solution [31].

IV. PERFORMANCEEVALUATION

Next, we evaluate the performance of our proposed approach. The mobile network scenario considered is based on the IMT Advanced evaluation guidelines for dense ‘small cell’

deployments [32]. It consists of base stations with an intersite distance of 200 meters in a hexagonal cell layout with 3 sector antennas (thus in this setting users will associate with sectors rather than the base stations we used in our algorithm description). The Signal Interference to Noise Ratio (SINR) is computed as in [25], SINR_ub = P_bg_ub/(

k∈B,k=bP_kg_uk + σ²), where P_b is the transmit power and g_ub denotes the channel gain between user u and base station b, which includes path loss, shadowing, fast fading and antenna gain. Following [32], we set P_b = 41 dBm, σ² = −104dB, path loss equal to 36.7 log₁₀(dist) + 22.7 + 26 log₁₀(f_c) for carrier frequency f_c = 2.5GHz, and antenna gain of 17 dBi. The shadowing factor is given by a log-normal function with a standard deviation of 8dB (as in [25]) updated every second, and fast fading follows a Rayleigh distribution dependent of the user speed and the angle of incidence (as in [33]). Achievable rates are then computed with the Shannon formula, BW log₂(1 + SINR_ub), for the average SINR_ub given by fading and shadowing [24] and a channel bandwidth of BW = 10MHz [24].

Finally, the modulation-coding scheme is selected according to the SINR_ub thresholds reported in [34]. Unless otherwise stated (i) users move according to the Random Waypoint Model (RWP) with speeds uniformly distributed between 0.2 and 4 m/s and pause intervals between 0 and 10 seconds, (ii) network size|B| is 57 sectors, (iii) all operators have the same share, and (iv) the number of users of each operator is proportional to s_o, i.e.,|Uo| = |U| · so. Confidence intervals are below 1%.

(9)

Fig. 2. Utility gains for different approaches as a function of the network size.

A. Utility Gains

We start by evaluating the gains in terms of the overall network utility. We consider a scenario with a user density of 10 users/sector and 3 operators, and plot W (x, f) as a function of the network size |B|. In this setting, we com- pare the performance of our algorithm for dynamic sharing, Greedy Local Largest Gain (‘GLLG’), against the following approaches:

i) SINR-based Static Slicing (‘SINR SS’): the resources of each sector are statically divided among operators and users associate with the based station with highest SINR;

ii) Distributed Greedy Static Slicing (‘DG SS’): resources are also sliced statically and user associations follow the Distributed Greedy algorithm discussed in Section III-B.2;

iii) Distributed Greedy (‘DG’): this is the algorithm for dynamic sharing presented in Section III-B.2;

iv) Centralized (‘Centralized’): this is the centralized algo- rithm proposed in [8].

The results are exhibited in Fig. 2. We draw the following conclusions: (i) significant gains result from both improving user association (DG SS vs. SINR SS) and sharing resources dynamically (DG vs. DG SS); (ii) the Distributed Greedy approach of Section III-B.2 performs almost at the same level of the baseline approach of [8] (DG vs. Centralized);

and (iii) the proposed approach performs closely to these two approaches, although it pays a small price for reducing the handoff overheads (GLLG vs. DG).

In addition to the overall network gain, it is also interesting to look at the gains of the individual operators. Theorem 1 showed that the difference in operator’s utility under MORA and SS is positive as long as we have the same user association in both approaches; however, we would expect this to hold in general, i.e., even when we have different user associations.

To this end, we have evaluated the difference between the operator’s utility under MORA and SS over a large number of different scenarios and settings. We have observed that in all cases, MORA always provides better performance than SS to all individual operators, which confirms that MORA effectively protects all operators, ensuring gains to all of them.

Fig. 3. Capacity savings for different scenarios as a function of the number of operators.

Fig. 4. Validation of the theoretical results on capacity savings.

B. Capacity Savings

We next evaluate the benefits of our approach to operators based on the capacity savings they would achieve. Specifi- cally, consider a network operated under our algorithm for dynamic sharing, where the capacity (i.e., total amount of resource) of each base station is given by C_GLLG, and let C_baseline be the base stations’ capacity required to achieve the same network utility under two baselines: (a) static slic- ing with SINR-based user association, and (b) static slicing with enhanced user association (i.e., using our algorithm for user association). These two baselines allow us to study the potential gains earned due to a smarter user association and the gains achieved by dynamic resource sharing. Fig. 3 illustrates the corresponding capacity savings, computed as Δ = (Cbaseline− CGLLG)/CGLLG, for different numbers of operators,|O| ∈ {2, . . . , 6}, and three different user densities,

|U|/|B| = 5 (low density), |U|/|B| = 10 (medium) and

|U|/|B| = 15 (high). The results show that substantial gains can be realized, and that gains increase with the number of operators and decrease with per-sector user load. The latter is indeed rather intuitive, since under light user loads static slicing performs poorly while MORA obtains substantial benefits from statistical multiplexing.

In order to gain additional insight into the impact of the various factors, Fig. 4 displays the influence of the share of the operator (s_o) and the average load per base station