Multi-Tenant Radio Access Network Slicing:
Statistical Multiplexing of Spatial Loads
Pablo Caballero, Albert Banchs, Senior Member, IEEE, Gustavo de Veciana, Fellow, IEEE, and Xavier Costa-Pérez, Member, IEEE
Abstract— This paper addresses the slicing of radio access network resources by multiple tenants, e.g., virtual wireless operators and service providers. We consider a criterion for dynamic resource allocation amongst tenants, based on a weighted proportionally fair objective, which achieves desirable fairness/protection across the network slices of the different tenants and their associated users. Several key properties are established, including: the Pareto-optimality of user association to base stations, the fair allocation of base stations’ resources, and the gains resulting from dynamic resource sharing across slices, both in terms of utility gains and capacity savings. We then address algorithmic and practical challenges in realizing the proposed criterion. We show that the objective is NP-hard, making an exact solution impractical, and design a distributed semi-online algorithm, which meets performance guarantees in equilibrium and can be shown to quickly converge to a region around the equilibrium point. Building on this algorithm, we devise a practical approach with limited computational information and handoff overheads. We use detailed simulations to show that our approach is indeed near-optimal and provides substantial gains both to tenants (in terms of capacity savings) and end users (in terms of improved performance).
Index Terms— Wireless networks, multi-tenant networks, RAN-sharing, network slicing, resource allocation.
I. INTRODUCTION
D
RIVEN by the capacity requirements forecasted for future mobile networks as well as the decreasing mar- gins obtained by operators, infrastructure sharing has estab- lished itself as a key business model for mobile operators to reduce the deployment and operational costs of their networks (e.g., [1] reports a 280% increase in deals within the last 5 years). While passive and active sharing solutions, ranging from exclusive allocation of resources to roaming agreements, are used and have been standardized, these sharing approaches are based on fixed contractual agreements with Mobile Virtual Network Operators (MVNO) over long timeManuscript received May 8, 2016; revised December 23, 2016 and May 14, 2017; accepted June 12, 2017; approved by IEEE/ACM TRANS- ACTIONS ONNETWORKINGEditor K. Psounis. Date of publication July 17, 2017; date of current version October 13, 2017. The work of P. Caballero and G. de Veciana was supported by the NSF Award under Grant CNS-1343383.
The work of A. Banchs was supported in part by the H2020-ICT-2014-2 5G NORMA project under Grant 671584 and in part by the Spanish project DRONEXT under Grant TEC2014-58964-C2-1-R. The work of X. Costa- Pérez was supported by the H2020-ICT-2014-2 5G NORMA project under Grant 671584. (Corresponding author: Pablo Caballero.)
P. Caballero and G. de Veciana are with The University of Texas at Austin, Austin, TX 78712 USA (e-mail: [email protected]).
A. Banchs is with the University Carlos III of Madrid, 28911 Leganés, Spain, and also with the IMDEA Networks Institute, 28918 Leganés, Spain.
X. Costa-Pérez is with NEC Europe Ltd., 69115 Heidelberg, Germany.
Digital Object Identifier 10.1109/TNET.2017.2720668
periods (typically on a monthly/yearly basis). In this paper, we focus on a structured dynamic slicing approach which enables a much more efficient sharing of network resources, as envisioned by the 3GPP Network Sharing Enhancements for future mobile networks which the authors contributed to [2].
Following [3], our approach divides the infrastructure into network slices, assigning a different slice to each operator, and implements the sharing of network resources among operators by dynamically allocating resources to slices. Such a novel network slicing approach is expected to result in new business models and revenue sources for infrastructure providers (see, e.g., [3]). Indeed, this approach supports not only classical players (mobile operators) but also new ones such as Over-The-Top (OTT) service providers that may buy a slice of the network to ensure satisfactory service to their users (e.g., Amazon Kindle’s support for downloading con- tent or a pay TV channel including a premium subscription).
In the literature, the term tenants is often used to refer to the different types of players, and multi-tenancy refers to approaches enabling dynamic network slicing and resource sharing for multiple tenants. For simplicity, hereafter we use the term operator in a broad sense to refer to classical (virtual) operators as well as the new players enabled by this approach.
In designing a practical solution for dynamic resource sharing among slices we face multiple challenges. To start with, we need a sharing criterion that not only allocates resources to operators (and their corresponding slices) fairly, but also, shares the resources of each operator fairly among its users. Furthermore, the criterion should allow for flexible sharing “levels” to meet operators’ heterogeneous require- ments; for practical purposes, these levels should be coarse- grained, rather than based on instantaneous resource needs.
When allocating resources to an operator, one should take into account the numbers and locations of active users on the network–indeed some locations may see higher demand and (consequently) the associated resources might be scarce.
Beyond the criterion itself, designing an algorithm to imple- ment it, while realizing timely adaptation to network changes, is also very challenging. Given the amount of information involved (including the channel quality of each user) and its dynamic nature, the algorithm should be as distributed as possible. Also, since the algorithm may be triggered fre- quently (whenever a user joins, leaves or changes its location), it should be computationally efficient. When adapting to net- work changes, the algorithm should control the number of handoffs triggered, as those may represent a high overhead.
1063-6692 © 2017 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.
See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
Key Contributions
This paper proposes a criterion for slicing the network infrastructure amongst operators and an algorithm to allocate resources accordingly. The key contributions are as follows.
In Section II, we introduce a criterion for dynamic resource sharing among operators; while the criterion has been pro- posed before, we provide a characterization supporting its use in a multi-tenant network setting. These properties are developed in Section II-C, providing insights on the optimality and fairness of the resulting allocations, and the benefits are studied in Section II-D, by characterizing the capacity savings by means of a closed-formula. We show that the criterion not only improves overall network utility but also that of each individual operator, thus guaranteeing that operators are not harmed by the sharing of resources amongst slices.
In Section III-A, we show the criterion corresponds to an NP-hard problem, motivating the need to devise an efficient approximation algorithm which is introduced in Section III-B.
The proposed algorithm is semi-online, distributed, incurs low computational complexity, and has been specifically designed to control overheads associated with handoffs and/or mobile user reassociations; we rely on several intermediate ana- lytical results to drive the key design choices underlying our algorithm. Section IV provides a comprehensive perfor- mance evaluation based on detailed simulations, showing that (i) operators can save up to 80% capacity while providing the same quality to their users, and (ii) for a fixed capacity, we improve user performance in terms of file download times by up to 30%, among other results.
Related Work
We next review and contrast our work to the state-of-the- art in (i) resource allocation based on proportional fairness, and (ii) resource sharing among operators.
Considerable research effort has been devoted to address the problem of fair resource allocation in networks. In wireline networks, fair resource allocation based on utility function maximization has been extensively studied following the sem- inal work of [4]. Building on this work, further algorithms for congestion control in multi-path environments have been proposed [5], [6]. Not unlike our work, these algorithms are distributed. However, they allow users to decide among multiple routes while we focus on a wireless setting where each user can only use one resource (her base station).
In the specific context of wireless networks, several approaches have been proposed [7]–[9] to the problem of resource allocation and user association based on weighted and unweighted proportional fairness, respectively. The unweighted case has been largely studied in the literature in different contexts (e.g., power control [10], interference avoidance [11]). Bu et al. [7] and Son et al. [11] analyzed the complexity of the problem and proved the existence of polynomial time algorithms which provide an exact solution, and [9] designed a distributed algorithm with convergence guarantees. In contrast to the above, the resource allocation criterion proposed in this paper relies on weighted proportional fairness, with operator-specific weights; this is a more difficult
problem as it is NP-hard [7] and the convergence of distributed greedy algorithms cannot be guaranteed [12].
Weighted proportional fair resource allocation in the context of wireless networks has also been studied from different angles. In [8], an algorithm with tight worst-case performance bounds is proposed, while [13] proposes an heuristic algo- rithm. In contrast to the distributed approach proposed in this paper, both algorithms are centralized and require the availabil- ity of the full network state information, which may be very challenging to gather in a timely manner. Hou and Chen [14]
and Hou and Gupta [15] propose a Gibbs-sampling mechanism based on simulated annealing that converges to an optimal solution. However, the convergence of such mechanisms is known to be very slow and for this reason the authors resort to a more practical greedy solution. For the proposed greedy solution, the authors neither provide performance bounds nor analyze convergence; additionally, the overhead is not controlled, which limits their practical deployment. All the approaches mentioned above address the problem of a single- operator network, in contrast to our work which focuses on the slicing and sharing of resources among multiple operators.
Multi-operator network sharing has been studied from many different angles, including planning, economics, coverage, per- formance, etc. (see e.g. [16]–[18]). This paper focuses specif- ically on the design of algorithms for resource sharing among operators, which has been previously addressed by [19]–[23];
however, all these works differ substantially from ours in terms of scope, criterion or approach. In [19] and [20], the optimiza- tion of the network utility follows a different criterion from the one in this paper, weighted proportional fairness, which (as we show) provides many desirable properties. The works of [21] and [22] present a proportional fair formulation similar to ours; however, they do not provide a rationale for their choice, in contrast to the solid analytical arguments provided here. Furthermore, [21] does not address the algorithm design, while [22] uses a general non-linear solver that incurs a very high computational complexity (as confirmed by our results of Section IV-D). Finally, [23] follows a game theoretic approach where operators bid for resources, which results in a fundamentally different problem from the one addressed here.
In summary: (i) while there has been substantial research on proportional fair resource allocation, its application to multi- operator settings and the associated problems have not been studied, and (ii) in spite of the substantial work devoted to proportional fairness in general settings, there is a gap in the systematic study of distributed mechanisms for joint resource allocation and user association that build on analytical results.
II. RESOURCEALLOCATIONCRITERION
In this section, we formulate the optimization problem that will drive (i) the association of users to base stations, and (ii) the allocation of base stations’ resources to users.
Hereafter, we refer to this optimization as the multi-operator resource allocation (MORA) criterion. We show analytically that the criterion satisfies desirable properties in terms of opti- mality and fairness, and develop a simple model to evaluate the potential sharing gains of our network slicing approach.
A. System Model
We start by presenting our system model which was devel- oped with LTE/LTE-A systems in mind, but is generally applicable to cellular systems. Consider a network consisting of a set B of base stations (or sectors in case of sector antennas) that are shared by a set of operators O. At any given time, we let U denote the set of users sharing the network and Uo, o ∈ O the subsets of users belonging to each operator. An allocation of resources involves two sets of variables: (i) the association of users to base stations, denoted by x = (xub : u ∈ U, b ∈ B), where each user u is associated with a single base station, i.e., xub = 1 for one of the base stations and 0 otherwise, and (ii) the allocation of the resources of each base station among its associated users, denoted by f = (fub : u ∈ U, b ∈ B), where fub is the fraction of the base station b’s resources which are allocated to user u.1 Note that in our model we ignoring the discrete nature of such resources, and assume that fu,b can take any value in the continuous range [0,1].
We let ˜cubdenote the average rate per resource unit seen by user u at base station b under current radio conditions,2and let Cb be the base station’s total amount resources. Given that the user is allocated a fraction fub of the base station’s resources, her rate is given by fubCb˜cub. For notational convenience, we define the achievable rate of the user as cub := ˜cubCb, which yields the following rate allocation:
ru(x, f) :=
b∈B
xubfubcub.
Note that the definition of cubactually represents an abstrac- tion of the underlying physical resources, accounting for the various physical layer techniques (such as, e.g., power control or MU-MIMO) as well as the interference from different sources (including that of neighboring base stations).
In line with similar analyses in the literature [19], [21], [22], [24]–[26], we shall assume that cub is fixed for each user and base station pair.
B. MORA Criterion
In line with previous approaches [19], [21], [22], the under- lying assumption behind our criterion is that operators share the cost of deploying and/or maintaining the infrastructure, and the resources received by each operator should be based on the level of its (financial) contribution to the shared network:
if an operator contributes twice as much as another, it should roughly get twice the resources. To this end, each operator is assigned a network share so ∈ [0, 1], to represent its level of contribution to the network. Without loss of generality, these shares are normalized so that
o∈Oso= 1.
The proposed criterion allocates resources across operators dynamically, tracking changes in the numbers and locations of operators’ mobile users and the associated transmission rates cub. When doing this, we need to make sure that
1For instance, in LTE/LTE-Afubdenotes the fraction of physical Resource Blocks, in FDM the fraction of bandwidth and in TDM the fraction of time
2Note that such average rates depend on the choice of modulation and coding scheme(s) selected for the user, after averaging out short-term fluctuations.
(i) network resources are fairly shared among the various operators according to their share, and (ii) at the same time, the resources allocated to a given operator are fairly shared among the users of that operator. To achieve this, we follow an approach akin to that in [27]3: we maximize the overall network utility resulting from aggregating operator utilities, where the utility of an operator is in turn the aggregation of its users’ utilities. To this end, we define the overall network utility as the sum of operators’ utilities weighted by the shares,
W(x, f) =
o∈O
soUo(x, f),
and the operator utility as the sum utility of the operator’s users normalized by the number of users (where a user’s utility is logarithmic in its rate),
Uo(x, f) = 1
|Uo|
u∈Uo
log(ru(x, f)),
By weighting the operator utilities with the shares, we give higher priority to operators with larger shares, and by nor- malizing with the number of users, we avoid that operators with more users are better off. For instance, with this choice, under uniformly loaded base stations an operator with twice the share of another one will get twice as many resources, independent of the number of users of each. Combining the above equations, one can rewrite the network utility as follows:
W(x, f) =
o∈O
u∈Uo
wulog(ru(x, f)), (1) where the user weights wuare defined as the operator network share divided by the current number of users of the operator, i.e., wu = so/|Uo| (in simple terms, the network share of an operator is divided equally amongst its current users).4
With the above, we can now formulate the MORA optimiza- tion problem as follows. Such optimization corresponds to the weighted proportional fair criterion (see e.g. [4]) extended to a multi-operator setting that considers utilities of the operators, rather than the ones of the individual users5:
maxx,f W(x, f), (2a)
subject to: ru(x, f) =
b∈B
xubfubcub, ∀u (2b)
b∈B
xub= 1 and xub∈ {0, 1}, ∀b, u (2c)
u∈U
fubxub≤ 1 and fub≥ 0, ∀b, u. (2d)
In the sequel we shall letxMORA,fMORAdenote a (possibly not unique) optimal solution to this optimization problem.
3Reference [27] addresses a similar problem to ours in the context of users and flows, as it aims at allocating resources fairly to users while preserving fairness among the flows of each user.
4While our definition of network utility coincides with that for weighted proportional fairness, the criterion proposed here is fundamentally different:
we consider resource allocation across time and vary the weights with the number of users, while weighted proportional fairness typically focuses on a static scenario and relies on fixed weights.
5Note that (2c) ensures that a user is associated with one (and only one) base station.
This formulation provides the optimal resource allocation at a given time under the current cub values (given by the selected modulation-coding schemes); in a dynamic setting, such allocations would be re-evaluated when any of the cub values change, due to changes in the (average) channel quality.
Note that, once MORA returns the user association x and resource allocation f, physical layer techniques (such as MU-MIMO or power control) are employed to optimize per- formance, under the constraint that users are provided with rates proportional to the ru values given by MORA.
C. Properties of MORA Resource Allocation
Next, we show that the MORA criterion satisfies some desirable properties both in the way base stations’ resources are allocated to associated users, and the way users are associated with base stations.
1) Per-Base Station Resource Allocation: Let us first con- sider a general setting, where user associations to base sta- tions are fixed, to see how MORA allocates base station resources. Let x∗ be the fixed (not necessarily optimal) user to base station association. If we optimize the resource allocation f for this user association, i.e., maxfW(x∗,f) subject to (2b) and (2c), it can be seen from [8, Lemma 5.1]
that the resulting resource allocation is unique and given by fM(x∗) = (fubM(x∗) : u ∈ U, b ∈ B), where
fubM(x∗) =wux∗ub
v∈Uwvx∗vb. (3) Further ifx∗ = xMORA, then fM(x∗) = fMORA, i.e., we have MORA optimal allocation of network resources.
The above result is fairly intuitive. Users associated with a given base station are allocated resources proportionally to their weights wu. This can be viewed as follows. The share of an operator represents the total budget of the operator.
When assigning a weight wu= so/|Uo| to users, this share is distributed among the operator’s users, and hence the user’s weight represents the budget of a user. As the resources allocated to a user are inversely proportional to the sum of weights at her base station, the sum of weights can be viewed as the cost of a unit of resource at the base station.
Thus, operators with users associated with heavily loaded base stations will have to pay a higher cost (e.g, increase their network share or limit their overall number of users) or receive fewer resources.
The above shows that the number of active users that operators have on the network and their spatial distribution will impact the resources allocated under MORA. Indeed, allo- cations across base stations are coupled together through|Uo|, i.e., an operator with a large number of active users will have lower weights and likely lower per-user allocations. At the same time, the resources obtained by an operator heavily depend on the load at base stations to which its users will be associated with.
2) User Association: Next we study the MORA user associations. Building on the optimality of our formu- lation, we can show that the resource allocation result- ing from MORA is Pareto-optimal, which means that for
any alternative allocation (x,f) for which ru(x,f) >
ru(xMORA,fMORA) for some u, we necessarily have rv(x,f) < rv(xMORA,fMORA) for some v = u. Indeed, if this was not the case then W (x,f) would be larger than W (xMORA,fMORA), which contradicts the fact that the optimal MORA allocation (xMORA,fMORA) maxi- mizes W (x, f).
Thus, Pareto optimality in this context means that if under some other user association choice, a user sees a higher throughput than that under MORA then there must be another user which sees a lower throughput allocation. Note that this need not always be the case. Consider, for instance, a network with |U| users, such that the largest cub of each user corresponds to a different base station. While the optimal allocation would associate each user to the base station with largest cub, a criterion based on local decisions that looks at users one by one may lead to a different association. The above result guarantees that this will not happen under MORA.
D. Gains and Savings
In the following we evaluate the benefits of MORA. To that end, we introduce a simple baseline – static slicing (SS), a proxy for not sharing resources at all.6
1) Static Slicing (SS) Baseline: Suppose each operator contracts for a fixed slice/fraction soof the network resources at each base station for its exclusive use. The operator can of course still optimize its users associations, xo = (xub : u ∈ Uo, b ∈ B), and allocation of resources fo = (fub : u∈ Uo, b∈ B), so as to maximize its utility. Specifically each operator o∈ O can determine its user association and resource allocations based on:
maxxo,fo Uo(xo,fo)
subject to ru(xo,fo) =
b∈B
xubfubcub, ∀u ∈ Uo,
b∈B
xub= 1, ∀u ∈ Uo,
u∈Uo
fubxub ≤ so, ∀b ∈ B,
xub ∈ {0, 1}, fub≥ 0, ∀b ∈ B, ∀u ∈ Uo. (4) This is similar to MORA except limited to the operator o’s current usersUo and the resource constraint corresponds only to the fixed slice so allocated to the operator at each base station. Although the user associations and resource allocations under static slicing are independently optimized by each operator, we shall let xSS,fSS be a (possibly not unique) optimal choice across all operators under static slicing. Also paralleling our discussion of MORA, it is easy to show that if one fixes a feasible user association x∗, (4) is convex and yields resource allocations given by
fS(x∗) := (fub∗ (x∗) : ∀u ∈ U, ∀b ∈ B),
6By slicing we refer to the way resources are shared (or sliced) among operators (while resource allocation refers to the allocation of resources to specific users). In contrast to the dynamic nature of MORA-based slicing, static slicing divides the infrastructure in fixed fractions.
where
fub∗(x∗) =x∗ubso
v∈Uox∗vb1{u ∈ Uo}, (5) i.e., this is again a weighted proportionally fair allocation of the operators’ slice of the base station resources.
2) Operator Utility Gains and Protection: The overall net- work utility under MORA is clearly larger than that under the more constrained allocations possible under SS. This however does not guarantee that a given operator’s utility under MORA is greater than that under SS. Below we show that for the same user association an operator utility under MORA exceeds that under SS, indicating that beyond the overall network utility, we have that each operator is indeed better off. This shows that MORA effectively protects operators when sharing their resources with other operators, which is very important to ensure that operators accept this criterion. Note that the result is completely general and holds for any possible scenario.7
Theorem 1: For a given user association x, MORA’s resource allocationfM(x) (see Eq. 3) achieves a higher utility than that of SS given byfS(x) (see Eq. 5), i.e., for all o ∈ O
Uo(x, fM(x)) ≥ Uo(x, fS(x)).
3) Capacity Savings: Next we consider the capacity savings resulting from operators sharing infrastructure. Specifically we compare the spectrum capacities, i.e., total amount of resource, required to achieve the same average utility per operator under MORA and SS. The aim is to give some intuition on the typical savings one might expect and its dependence on the network load, number of operators and their shares. For tractability we will examine a scenario where traffic is spatially homogenous and operators’ network shares are proportional to their load.
We consider a network model in which there is a fixed total number of users|U| of which each operator contributes a fixed number of users proportional its network share so, i.e., no= so|U| which are assumed to be integer valued. Each operator’s users are randomly (uniformly) distributed amongst the |B|
base stations, so the number of users of operator o associated with base station b, is given by a random variable No,b, such that No,b ∼ Binomial(no,|B|1 ). The total number of users at base station b is denoted by a random variable Nb =
o∈ONo,b ∼ Binomial(|U|,|B|1 ). We also assume for simplicity that users have the same capacity cub = c to the base stations with which they associate.
Note that under the above traffic model all users u have the same weight wu = nso
o = 1/|U|. Thus expected overall network utility under MORA is given by:
W¯ = E
o∈O
b∈B
Nobwulog
c Nb
= E
b∈B
o∈O
Nob
|U| log
c Nb
= E
b∈B
Nb
|U|log
c Nb
= |B|
|U|E
Nblog
c Nb
,
7The proofs of the theorems are provided in the Appendix.
where the last equality follows by using the uniformity of traffic across base stations. Moreover, under our model the network utility ¯W is the average utility across all users, which by symmetry is equal to the expected utility of a given operator o under MORA, i.e., ¯UoMORA= ¯W.
Now applying Taylor’s approximation to the function xlog(c/x) at E[Nb] we obtain
Nblog
c Nb
≈ E[Nb] log
c E[Nb]
+
log
c E[Nb]
− 1
· (Nb− E[Nb]) − 1
2E[Nb](Nb− E[Nb])2, which in turn gives
E
Nblog
c Nb
≈ E[Nb] log
c E[Nb]
− 1
2E[Nb]Var(Nb).
Since Nb ∼ Binomial(|U|,|B|1 ) we have that Var(Nb) =
|U||B|(1 − |B|1 ) ≈|U||B|, and so U¯oMORA≈ log
c E[Nb]
− |B|
2|U|. (6)
Let Δo denote the extra capacity that operator o would require under SS to achieve the above utility. The expected utility experienced by operator o under SS is given by
U¯oSS = E
b∈B
No,b no log
soc(1 + Δo) No,b
= |B|
noE
No,blog
soc No,b
+ log(1 + Δo).
Again using a Taylor expansion this can be approximated as
U¯oSS ≈ log
soc E[No,b]
−|B|
no
Var(No,b)
2E[No,b] + log(1 + Δo).
Noting that Var(No,b) ≈ so|U|
|B| = |B|no we have that U¯oSS ≈ log
c E[Nb]
− |B|
2no + log(1 + Δo). (7) Finally equating the expected utilities, i.e., (6) and (7), we obtain the following estimate of the necessary extra capac- ity Δorequired when static slicing rather than MORA is used:
log(1 + Δo) ≈ |B|
2no × (1 − so). (8) where under our traffic load model no= so|U|.
This result gives a clear intuition on the possible savings resulting from sharing the infrastructure with MORA dynamic slicing. In particular, the savings increase exponentially in the product of two terms. The first is inversely proportional to the average number of users operator o has per base station, i.e., no/|B|; indeed, if the operator has a large number of users, its multiplexing gain is already high without sharing the infrastructure, and hence there is little gain from sharing.
The second term is large when the operator has a small network share: if its share is high, the operator is using most of the network resources and there is little sharing.
In summary, capacity savings will be highest when infrastructure is shared by a large number of operators each with a small number of users per base station. With current trends towards small cells, the number of users per base station is expected to be small, suggesting that infrastructure sharing may be particularly beneficial.
III. APPROXIMATIONALGORITHM FORMORA The analysis in previous section and simulations to be presented in the sequel suggest that MORA resource allocation across operators not only has desirable characteristics but will make efficient use of resources while protecting operators from one another. Unfortunately, as we show below, the complexity and information overheads associated with doing so for are already high for a static system, and excessive when operators’
mobile users and associated channels are subject to constant change. In this section we further discuss the state-of-the-art algorithms to tackle MORA, and then propose an approxima- tion algorithm based on a sequence of theoretical results and insights that support the design.
A. Complexity and State-of-the-Art Algorithms
The optimization problem underlying MORA is a non- linear integer programming problem, which can be shown to be NP-hard and hence there is no polynomial time algorithm unless P = N P .
Theorem 2: The MORA problem is NP-hard.
There have been a number of works in the literature devoted to solving problems similar to MORA. In particu- lar, [8] proposes an approximation algorithm for the single operator case with guaranteed performance bounds. However, their approach is still computationally demanding; indeed, the results in Section IV-D, show that for a network with only 100 users, the algorithm takes 20 seconds on a dual- core 2.8GHz processor. Given that this would need to be executed every time cubvalues change or new users enter/leave the network,this seems computationally impractical. Moreover the proposed approach is centralized, so there would be a substantial information overhead to gather the cubof each user to each potential base stations, given the amount of data and dynamic nature of mobile users.
In the multi-operator setting, [22] proposes an approach based on using a standard non-linear solver to address a problem similar to MORA. Unfortunately the approach is also very complex and centralized. Indeed, our evaluation of this proposal in Section IV-D, shows that the time required to execute this algorithm increases sharply with the number of users, making it impractical at about 50 users. Moreover, [22] does not provide any analytical performance bounds.
In summary, to make dynamic multi-operator resource shar- ing possible, a new radically simplified approach is required.
It should have low computational complexity and be based on distributed operation requiring only local information, to allow near real-time operation.
B. Algorithm Design
In the following, we devise an algorithm for MORA that can be used in practical deployments. In contrast to previous
approaches, our algorithm involves a low computational com- plexity and relies on data that can be gathered from neighbor- ing base stations, allowing for a distributed implementation.8
Given the user dynamics, i.e., joining, moving and leaving the network, an offline algorithm that computes an optimal resource allocation for a fixed set of users is impractical.
Instead, we will pursue an approach that tracks users dynam- ics, and occasionally adjusts resource allocations by modifying current or new users’ associations. Since reassociations of current users correspond to handoffs, their number should be kept to a minimum. To design such an algorithm, we need to answer
• Do we really need to reassociate users?
• Where should users be (re)associated to?
• In which order should users be reassociated?
• How many reassociations do we need?
For each of these questions, in the following we provide some theoretical analysis that eventually leads to our proposed algorithm. In all cases, once a user association x is set, resources at each base station are allocated according MORA’s resource allocationfM(x).
1) Need for Reassociations: Following the standard termi- nology of online algorithms, we say that an algorithm is online if, upon a user joining the network, it only decides how to associate the new user, without triggering any reassociations of existing users. We say the algorithm is semi-online if it can further trigger reassociations of a limited number of users.
Thus our first question is whether an online algorithm would suffice. The following theorem suggests that the performance of an online algorithm can be arbitrarily bad, motivating us to consider semi-online approaches.
Theorem 3: Consider an online algorithm that triggers no reassociations of existing users. Let(x,f) denote the solution resulting from this algorithm and(xMORA,fMORA) a MORA optimal solution. Then, W(xMORA,fMORA)−W (x,f) can- not be bounded.
2) Criterion for (Re)associations: Next we address the question regarding how to associate, or reassociate, users to base stations. In particular, consider a Distributed Greedy algorithm wherein we iteratively examine (in arbitrary order) if there is a user which could change her association to increase her rate, and if this is the case, she chooses to re-associate with the base station providing the largest rate. The following result characterizes the performance of this algorithm if an equilibrium is reached.
Theorem 4: Let (x,f) be an equilibrium allocation for the Distributed Greedy algorithm, and (xMORA,fMORA) a MORA optimal solution, then9
W(x,f) ≥ W (xMORA,fMORA) − log(e).
There exists an instance of the problem for which it holds that W(x,f) = W (xMORA,fMORA) − log (2).
8Note that, while the algorithm implementation is distributed, the logic is centralized: i.e., we assume that the algorithm is run centrally by a single entity, without the intervention of the different operators.
9To gain some intuition on this bound, we note that a log(e) gap is equivalent to reducing the throughput of each user by a factor ofe.
Note that the above bound of log(e) is fairly close to the log(2) bound provided by [8]. This is quite remarkable, considering that the algorithm proposed in [8] is centralized and much more complex. Furthermore, the theorem shows that the bound is rather tight, as there exists a problem instance that provides a gap of log(2), which is quite close to the log(e) bound.
While the above theorem bounds network utility in equilibrium, we have not established the convergence of this algorithm to an equilibrium. The convergence of this type of algorithms has received substantial attention in the litera- ture [12], [28], [29]. Indeed, since the throughput of user u is an increasing function of cub/
v∈Uwvxvb, the Distributed Greedy algorithm can be viewed as a congestion game in which the load at a base station is given by the sum of weights of the users at the base station, lb =
v∈Uwvxvb, and a user seeks to minimize aublb, where aub= 1/cub. This game falls in the category of a singleton weighted congestion game with player-specific multiplicative constants and linear variable cost. Based on the lack of a counter-example and the existence of polynomial-time algorithms for special cases, [28] conjectures that this type of games have an equilib- rium (see [28, Conjecture 3.7]). Based on the simulations we have run for numerous instances of the game, we fur- ther conjecture that the Distributed Greedy algorithm (which implements a best response dynamics) converges to this equilibrium.
In particular, Distributed Greedy satisfies W (x,f) ≥ W(xMORA, fMORA) − log(e), while [8] proposes an algorithm that provides a throughput larger than ru(xMORA,fMORA)/(2 + ) to all users, which translates into W (x,f) ≥ W (xMORA,fMORA) − log (2 + ); hence, the algorithm of [8] provides only a slightly tighter bound than Distributed Greedy.
3) Order of Reassociations: While our analysis of the Dis- tributed Greedy algorithm suggests a user should (re)associate to maximize her rate, it does not indicate in which order user reassociations should be considered to speed up convergence.
To address this, we consider the Greedy Largest Gain algo- rithm, which operates as the Distributed Greedy algorithm but at each iteration updates the association of the user achieving the highest gain, i.e., the one achieving the largest rnewu /roldu , where ruold is the user’s current throughput and rnewu is the throughput she would receive under the improved association.
The following theorem shows that the Greedy Largest Gain algorithm exhibits a desirable convergence property. In partic- ular, one can guarantee that at each iteration the network utility increases until it reaches W (xMORA,fMORA)−2 log(e), and from then on it never decreases below W (xMORA,fMORA)−
(2 + maxuwu) log(e). Note that Distributed Greedy does not exhibit this kind of behavior: if we select users in an arbitrary order, the network utility may decrease at any iteration (as the increase in utility of the reassociated user may be smaller than the decrease experienced by the other users).
Theorem 5: Let (xi,fi) be the solution at the ith iteration of the Greedy Largest Gain algorithm and (xMORA,fMORA) a MORA optimal solution. Then W (xi,fi)
increases at each iteration until W(xi,fi) ≥ W(xMORA,fMORA) − 2 log(e), and thereafter it never decreases below W(xMORA,fMORA)−(2+maxuwu) log(e).
4) Proposed Algorithm: Greedy Local Largest Gain. Based on the above considerations we now propose our algorithm for MORA, the Greedy Local Largest Gain algorithm. We shall first describe how it operates at a high level, and then provide a more detailed algorithmic description. When a user joins the network, she greedily joins the base station providing the largest throughput. However, as we have seen, we may need to consider triggering user reassociations. To limit their number and associated handoffs overheads we constrain these to at most m. For the first m − 1 reassociations, users choose the base station that provides the largest throughput, but in the mth the user chooses the base station so as to maximize the network utility W (x, f). In each of these steps, we select which user to reassociate (if any) based on Greedy Largest Gain criterion, but instead of considering all users in the network, involving possibly a high overhead, we restrict the selection locally to users associated with only two base stations (see below).
In a dynamic and time-varying setting, the algorithm needs to consider the following cases: (i) a user joins the network, (ii) leaves, or (iii) changes her location. The algorithm for a joining user is detailed in the pseudocode of Algorithm 1.
The rationale is as follows. In the optimal allocation, users are somehow balanced among base stations, users’ weights playing a role in this balance. When a new user joins the network, the balance is broken and the base station with which the user associates may have too many users. Hence, in the first step we reassociate one of the users of this base station.
In the next step, the base station that received the reassociated user may have too many users; however, depending on the weights of the joining and reassociated users, the original base station may still have too many users as well. Hence, we consider the users from the two base stations as candidates for reassociation. We repeat this, considering users from two base stations, in the subsequent steps. Finally, in the last step, to avoid that the reassociation of a user harms the overall performance, we select the base station association that maximizes the overall network utility rather than the throughput of the reassociated user.
When a user leaves the network, the algorithm is quite similar (pseudocode omitted for space reasons). When she moves, her cub values to the neighboring base stations may change; if, as a result of these changes, at some point the user would receive a larger throughput in a new base station, we reassociate her to this base station. Then, the old base station executes the same algorithm as when a user leaves the network while the new base station executes the algorithm corresponding to a joining user.
5) Controlling the Number of Reassociations: The remain- ing question is how to set the limit on the number of reassociations m, which determines the trade-off between the performance of the algorithm and reassociation overhead. Such trade-offs have been analyzed for a similar setting in [30], which aims to distribute tasks among servers (where each task can only be associated to a restricted set of servers)
Algorithm 1 GLLG User Joining Definitions:
rv,b: throughput of user v if she associates to b;
rv: current throughput of user v;
Ub: set of users associated to b, (u ∈ U s.t. xu,b= 1);
U{c∪p}: set of users associated to c or p;
Wu,q : network utility if user u associates to q;
Input: x
User v joins the network:
b= arg max
b∈B
rv,b;
xv,b = 1 ← Associate user v with base station b; [u∗, p∗] = arg max
(u,p)∈Ub×B ru,p
ru ; if ru∗,p∗/ru>1 then
Associate user u∗ with base station p∗, xu∗p∗ = 1;
else stop
c= p∗ (current base station);
p= b (previous base station);
for m− 1 times do [u∗, q∗] = arg max
(u,q)∈U{c∪p}×B ru,q
ru ; if ru∗,q∗/ru>1 then
Associate user u∗ with base station q∗, xu∗q∗ = 1;
c← q∗; p← previous base station of user u∗; else
stop
W ← current network utility;
[u∗, q∗] = arg max
(u,q)∈U{c∪p}×B Wu,q
W ; if Wu∗,q∗/W >1 then
Associate user u∗ with base station q∗, xu∗q∗ = 1;
in such a way that the maximum load across all servers is minimized. This problem is similar to ours, with tasks and servers corresponding to users and base stations respectively, in the particular case where all users have the same wuand cub. Not unlike their setting, the performance in this case is opti- mized when base station loads are as balanced as possible (i.e., the highest load is minimized). According to the analysis of [30], the performance in terms of the highest load with our algorithm (which has a limit of m reassociations) over the highest load with the optimal algorithm (with no constraint m) is given by O(e1−ln|B|m ). This shows that algorithm’s perfor- mance improves rapidly (exponentially) in m, and suggests a small m suffices to achieve near-optimal network utility.
To further explore the impact of m on network utility, we present the following simulation results (see Section IV for a description of the simulation setup). Here, W (m) is the network utility achieved for a given m value, W (∞) is the utility with unconstrained overhead, W (0) is the utility with no reassociations, and GW(m) .= 1 −W (m)−W (∞)
W (0)−W (∞) represents the normalized utility gain with m reassociations, showing how close we get to the unconstrained overhead utility.
Fig. 1 depicts this gain as a function of m for different
Fig. 1. Normalized utility gain as a function ofm.
scenarios. As can be seen, utility gains increase very sharply.
Furthermore, for m = 3 the gains are already very close to their maximum value; based on this, we set m equal to 3 (this is indeed the value used in the experiments of Section IV).
With this setting, the proposed algorithm only introduces a small overhead, since our approach may trigger up to three handovers for every handover performed by a “traditional”
solution [31].
IV. PERFORMANCEEVALUATION
Next, we evaluate the performance of our proposed approach. The mobile network scenario considered is based on the IMT Advanced evaluation guidelines for dense ‘small cell’
deployments [32]. It consists of base stations with an intersite distance of 200 meters in a hexagonal cell layout with 3 sector antennas (thus in this setting users will associate with sectors rather than the base stations we used in our algorithm descrip- tion). The Signal Interference to Noise Ratio (SINR) is com- puted as in [25], SINRub = Pbgub/(
k∈B,k=bPkguk + σ2), where Pb is the transmit power and gub denotes the channel gain between user u and base station b, which includes path loss, shadowing, fast fading and antenna gain. Following [32], we set Pb = 41 dBm, σ2 = −104dB, path loss equal to 36.7 log10(dist) + 22.7 + 26 log10(fc) for carrier frequency fc = 2.5GHz, and antenna gain of 17 dBi. The shadowing factor is given by a log-normal function with a standard deviation of 8dB (as in [25]) updated every second, and fast fading follows a Rayleigh distribution dependent of the user speed and the angle of incidence (as in [33]). Achievable rates are then computed with the Shannon formula, BW log2(1 + SINRub), for the average SINRub given by fading and shad- owing [24] and a channel bandwidth of BW = 10MHz [24].
Finally, the modulation-coding scheme is selected according to the SINRub thresholds reported in [34]. Unless otherwise stated (i) users move according to the Random Waypoint Model (RWP) with speeds uniformly distributed between 0.2 and 4 m/s and pause intervals between 0 and 10 seconds, (ii) network size|B| is 57 sectors, (iii) all operators have the same share, and (iv) the number of users of each operator is proportional to so, i.e.,|Uo| = |U| · so. Confidence intervals are below 1%.
Fig. 2. Utility gains for different approaches as a function of the network size.
A. Utility Gains
We start by evaluating the gains in terms of the overall network utility. We consider a scenario with a user density of 10 users/sector and 3 operators, and plot W (x, f) as a function of the network size |B|. In this setting, we com- pare the performance of our algorithm for dynamic sharing, Greedy Local Largest Gain (‘GLLG’), against the following approaches:
i) SINR-based Static Slicing (‘SINR SS’): the resources of each sector are statically divided among operators and users associate with the based station with highest SINR;
ii) Distributed Greedy Static Slicing (‘DG SS’): resources are also sliced statically and user associations fol- low the Distributed Greedy algorithm discussed in Section III-B.2;
iii) Distributed Greedy (‘DG’): this is the algorithm for dynamic sharing presented in Section III-B.2;
iv) Centralized (‘Centralized’): this is the centralized algo- rithm proposed in [8].
The results are exhibited in Fig. 2. We draw the following conclusions: (i) significant gains result from both improving user association (DG SS vs. SINR SS) and sharing resources dynamically (DG vs. DG SS); (ii) the Distributed Greedy approach of Section III-B.2 performs almost at the same level of the baseline approach of [8] (DG vs. Centralized);
and (iii) the proposed approach performs closely to these two approaches, although it pays a small price for reducing the handoff overheads (GLLG vs. DG).
In addition to the overall network gain, it is also interesting to look at the gains of the individual operators. Theorem 1 showed that the difference in operator’s utility under MORA and SS is positive as long as we have the same user association in both approaches; however, we would expect this to hold in general, i.e., even when we have different user associations.
To this end, we have evaluated the difference between the operator’s utility under MORA and SS over a large number of different scenarios and settings. We have observed that in all cases, MORA always provides better performance than SS to all individual operators, which confirms that MORA effectively protects all operators, ensuring gains to all of them.
Fig. 3. Capacity savings for different scenarios as a function of the number of operators.
Fig. 4. Validation of the theoretical results on capacity savings.
B. Capacity Savings
We next evaluate the benefits of our approach to operators based on the capacity savings they would achieve. Specifi- cally, consider a network operated under our algorithm for dynamic sharing, where the capacity (i.e., total amount of resource) of each base station is given by CGLLG, and let Cbaseline be the base stations’ capacity required to achieve the same network utility under two baselines: (a) static slic- ing with SINR-based user association, and (b) static slicing with enhanced user association (i.e., using our algorithm for user association). These two baselines allow us to study the potential gains earned due to a smarter user association and the gains achieved by dynamic resource sharing. Fig. 3 illustrates the corresponding capacity savings, computed as Δ = (Cbaseline− CGLLG)/CGLLG, for different numbers of operators,|O| ∈ {2, . . . , 6}, and three different user densities,
|U|/|B| = 5 (low density), |U|/|B| = 10 (medium) and
|U|/|B| = 15 (high). The results show that substantial gains can be realized, and that gains increase with the number of operators and decrease with per-sector user load. The latter is indeed rather intuitive, since under light user loads static slicing performs poorly while MORA obtains substantial benefits from statistical multiplexing.
In order to gain additional insight into the impact of the various factors, Fig. 4 displays the influence of the share of the operator (so) and the average load per base station