Choquet theorem and its applications

(1)

Choquet theorem

and its applications

Master’s final project

Author:

Antonio Zarauz Moreno

Advisor:

Dr. Juan Carlos Navarro Pascual

MSc in Mathematics

USOS DEL LOGO DE LA FACULTAD DE CIENCIAS EXPERIMENTALES

LOGOTIPO EN COLOR UNIVERSIDAD

DE ALMERÍA

July, 2017

(2)

(3)

Abstract

This work contains a clear introduction to integral representation theory, show-ing some classical results combined with a list of elaborated examples. The choice of this topic has not been made by chance; one of the main reasons is the fact that it comprises a lot of areas of pure mathematics, such as Functional Analysis, Topo-logy or Measure Theory. In addition, it is the natural continuation of the degree’s final project carried out with the same advisor.

In order to be more precise with the contents of these pages, we proceed to make a detailed exposition of the outline of every chapter:

The first chapter summarises briefly all the necessary prerequisites, which can be extended in the aforementioned dissertation. The first part contains the definition of Radon measure and topological vector space motivated by the definition of normed space, as well as other considerations concerning weak topologies, convex sets and the convex hull of a set. Secondly, a few classical theorems of Functional Analysis are introduced, so long as they will be taken into account throughout this theory.

The second chapter is intended to introduce the reader to integral representa-tion theory, revisiting Minkowski-Caratheodory and Krein-Milman theorems in this setting, and explaining in depth the example of the spaceC(K), which has enoughtoolsby itself to be distinguished as an almost trivial situation. We find out that the reformulation of Krein-Milman theorem does not meet our needs, as long as it relies on the closure of the extreme points of the given set, and in some cases compact convex sets in Banach spaces coincides with the closure of its extreme points.

The third chapter makes a concise review of Choquet theorem, both in met-rizable and nonmetmet-rizable cases, using the results shown in the previous chapters. It is particularly interesting to point out the examples carried out in the nonmetrizable setting to motivate the new construction of the support of a measure, as well as the provided applications to Rainwater and Haydon theorems. Among the main reasons to distinguish the metrizable setting, we may point out that, in those spaces, we can always find a Radon measure with total support, the equivalence of the metrizability with the existence of strictly convex functions onA, or the measurability of the set of extreme points of a compact convex subset. We also show a beautiful characterization of the extreme points of a compact convex set in terms of the upper envelope of a bounded function.

The last chapter is devoted to the study of the uniqueness of the integral rep-resentation, which leads us to motivate the definition of simplex in infinite dimensional spaces. Also, a new viewpoint of Riesz theorem is obtained in order to prove the cornerstone of this chapter Choquet-Meyer theorem.

(6)

(7)

1

Preliminaries

The reader will be assumed to have a strong background in topology, measure theory and functional analysis. However, given the relevance of some concepts and results, it would be useful to introduce them in order to know more precisely the applications we have given to them.

1.1 Topological vector spaces

Topological vector spaces are a generalisation of normed spaces in which we can obtain (under some conditions such as local convexity or metrizability) similar properties and results. Recall that a topological vector space is a pair (X, τ) where

Xis a vector space over the fieldK=R∨C, andτis a compatible topology with the vector structure inX; that is, the maps (x, y)7→_x₊_y_{and (}_{α, x}₎7→_αx_{are continuous}

from X×_X _onto_X _{and from} _K×_X _onto_X _{respectively, considering the product}

topology in each space.

Secondly, a normed space is a pair (X,k·k_{) where}_X _{is a vector space and} k·k_a

norm inX. Since the topology induced by the norm is compatible with the vector structure, normed space form a strongly relevant example of topological vector spaces. There also are other structures which are compatible with the norm, such as the weak topology of a normed spaceX, denoted byω, and the weak-star topology, written asω∗. As usual, we writeXinstead of (X, τ) or (X,k·k_{) when we are making}

reference to a topological vector space or a normed space, respectively.

Let n be a natural number and X a Hausdorff topological vector space with dim(X) =n. Then, every linear bijection fromKnontoX is bicontinuous, henceX is isomorphic as a vector space to Kn and homeomorphic as a topological vector space to the Euclidean space.

Now we define the convex hull of a setAin a vector spaceX.

Definition 1.1. LetXbe a vector space and considerA⊂_X_{. The convex hull of}_A_{is the}

intersection of all the convex subsets ofXcontainingA:

co(A) =

      

n

X

i=1

λixi:n∈N, xi∈A, λi∈R+0,∀i={1, . . . , n},

n

X

i=1

λi= 1

      

, (1.1)

Figure 1.1: Convex hull of a cow.

(8)

Proposition 1.1. LetC_{the family of all the convex sets of a vector space}_X_{. Then,}

1. Whatever{_C_i}_i_∈_I ⊂ C _is∩_i_∈_I_C_i∈ C_.

2. C_{satisfies that}_A₊_B∈ C_,_λA∈ C_{for all}_{A, B}∈ C_and_λ∈_R_{. In addition,}₍_λ₊_µ₎_A₌

λA+µAfor everyλ, µ∈_R_{such that}_λµ≥₀_.1

3. co(·_{) :}_X→ C_{is a monotone and additive operator.}

4. Ais convex iffA=co(A).

When we deal with topological vector spaces, we can go one step further. Proposition 1.2. Given a setA⊂_X_{, where}_X_{is a topological vector space, we have:}

1. int(A)andAare convex sets ifAis convex.

2. co(·₎_{maps open sets into open sets.}

3. co(·₎_{maps bounded sets into bounded sets when}_X_{is locally convex.}

4. co(·₎_{maps compact sets into compact sets.}2

5. co(·₎_{maps precompact sets into precompact sets if}_X_{is locally convex.}

6. IfAis convex and int(A),∅, then int(A) =int(A)and int(A) =A.

For more details about the proof of these results and related topics, one can consult [3] and [5].

1.2 Measure theory

Definition 1.2. LetXbe a set andΣaσ-algebra overX. A functionµ:Σ→_[−∞_,₊∞_]

is a measure if it satisfies the following conditions:

1. For everyE∈_Σ,_µ₍_E₎≥₀_.

2. µ(∅_{) = 0}_.

3. For every collection{_E_n}+∞

n=1⊂Σpairwise disjoint,µ(∪+

∞

n=1En) =P+

∞

n=1µ(En).

If only the second and third conditions of the definition of measure above are met, andµtakes on at most one of the values±∞_{, then}_µ_{is called a signed measure.}

The members ofΣare called measurable sets.

The stronger condition over X is required, the more enriching properties we can obtain from the measureX. In particular, if we assume thatX is a Hausdorff topological space, which is the setting we are concerned on, one can define the Borel σ-algebra overX to be the one generated by the open sets of the topology,

B₍_X_{). In order to combine measures and topology, the next definition is introduced.}

Definition 1.3. A Radon measure is a measure µ defined over B₍_X₎ _{of a Hausdor}ff

topological spaceXsatisfying the following properties:

1_{Indeed, one can characterise}C_{through this property, in an even easier way than the implication} we have already proved. In fact, ifAis not convex, we can findx∈_X_,_λ₌_µ₌1

2 satisfying thatx∈A

butx<1₂A+1₂A.

(9)

1. For every pointx∈_X_{there exists a neighbourhood which has finite measure.}

2. For every Borel setU,µ(U) = sup{_µ₍_K_{) :} _K⊂_{U , K} _compact}_.

The second property is calledinner regularity. WhenXis locally compact, every finite Radon measure is alsoouter regular; that is, for every Borel setU,

µ(U) = inf{_µ₍_K_{) :} _U ⊃_{K, K}_compact}

Definition 1.4. A topological spaceX is called a Radon space if every finite measure

defined onB₍_X₎_{is a Radon measure.}

For instance, Euclidean spaces are Radon spaces, and separable complete metric spaces are Radon spaces as well.

The emerged condition of regularity in Radon measures has something to do with continuous linear functionals with compact support. LetXbe a locally com-pact Hausdorff space andC_c₍_X,_K_{) the space of continuous, compactly supported}

functions fromXintoK. For any compact subsetK ⊂_X_{, denote by}C_c₍_{X, K}_;_{K) the}

space of those functions which vanishes outsideK. This space, endowed with the sup-norm, becomes a Banach space.

Definition 1.5. AK-linear formµonCc(X,K)is called a Radon measure whenever, for

every compact subsetK⊂_X_{, its restriction to}C_c₍_{X, K}_;_K₎_{is continuous.}

We can prove, with the help of Riesz representation theorem, that any non-negative and bounded Radon measure in this sense is the restriction to C_c₍_X_{) of}

the integral with respect to a unique (non-finite) Radon measure (in the sense of definition1.3). To find more information about this, one may check [2].

1.3 Functional analysis

In this section we will essentially introduce the required theorems to develop the integral representation theory. First of all, a couple of results concerning com-pact convex set on locally convex topological vector spaces are given.

Theorem 1.1 (Minkowski-Carathéodory). Let A⊂ _X _{a compact convex subset of a}

finite-dimensional spaceX(with dim(X) =n). Then,

A=co(ext(A)),

namely, everya∈_A_{is a convex combination of}_n_{+ 1}_{extreme points in}_A_{as much.}

Theorem 1.2(Krein-Milman). LetXbe a locally convex HausdorffTVS, and∅_,_A⊂_X

a compact convex set. Then, ext(A)is nonempty and

co(ext(A)) =A.

We will devote the following chapter to revisit both results and formulate them into the integral-representation setting. A few applications of Krein-Milman the-orem will be required. The first of them, Banach-Alaoglu thethe-orem brings lots of examples of compact sets in infinite-dimensional spaces with the appropiate topo-logy. In particular, whenXis a normed space andV =BX, which is also a

neigh-bourhood of 0, it follows that B_X∗ is ω∗-compact, hence every normed space in

(10)

Theorem 1.3. [Banach-Alaoglu] LetXbe a TVS andV a neighbourhood of0. Then the set

K={_Λ∈_X∗_:|_Λ_x| ≤₁_,∀_x∈_V}

isω∗-compact.

An example of description of extreme points is given by Arens-Kelley theorem, who studied the unit ball ofC(K)∗.

Theorem 1.4(Arens-Kelley). Let C(X) be the space of continuous functions on the

compact HausdorffspaceX. For eachx∈_X_let_δ

x∈C(X)

∗

defined by

δx(f) =f(x),∀f ∈C(X).

Then, EC(X)∗ =_T{δ

x: x∈X}.

Last but not least important is this application of Krein-Milman theorem, which concerns the subalgebras A of C_R(X), the real-valued functions on X, which are dense ink·k_∞_{. Just the existence of extreme points in compact convex sets is}

power-ful, as it would be appreciated in a carefully reading of the proof of the next the-orem.

Theorem 1.5 (Stone-Weierstrass). LetX be a compact Hausdorff space. Let Abe a subalgebra ofC_R(X)so that for anyx, y∈_X_and_{α, β}∈_R_{, there exists}_f ∈_A_so_f₍_x_{) =}_α

(11)

2

Introduction to integral representation

After having established Krein-Milman theorem, one can get more powerful results introducing some concepts about measure theory. To be more precise, the main purpose of this chapter is to reformulate the the most profitable results of the previous ones, such as Carathéodory and Krein-Milman theorem, in terms of measures supported by the set of extreme points of a given compact convex set, instead of employing convex linear combinations.

2.1 Krein-Milman theorem revisited

To begin with, let us rewrite Minkowski-Carathéodory theorem (1.1) to show with a first example the meaning of arepresentingmeasure. Given a compact con-vex subsetAof a finite-dimensional topological vector spaceX (dim(X) =n) and

a∈_A_{, then there exists}{_x_i}n+1

i=1 extreme points and {λi}ni=1+1 ⊂R+0 with

Pn+1

i=1λi = 1

such thata=Pn+1

i=1λixi. Ifδxiis thepoint massatxi, we will denote byεxithe Borel measure which equals to 1 on any Borel set which containsxi, and equals 0

oth-erwise. Letµ = Pn+1

i=1λiεxi; then µ is a Borel measure on A, µ ≥0 and µ(A) = 1. Furthermore, for any continuous functionalf ∈_X∗_,

f(a) =

n+1

X

i=1

λif(xi) =

Z

A

f dµ.

This last assertion is what we mean when we say thatµrepresentsa∈_A_.

Definition 2.1. Suppose thatAis a nonempty compact subset of a locally convex space

X, and thatµis a probability measure onA. A pointx∈_X_{is said to be represented by}_µ

if

f(x) =

Z

A

f dµ, ∀_f ∈_X∗_.

It will be denoted sometimes by µ(f) instead of R_Af dµ (other terminology: “x is the

barycenter ofµ”, “xis the resultant of ofµ”).

The restriction thatXbe locally convex is to ensure the existence of sufficiently many functionals in X∗ to separate points; this guarantees that there is at most one point represented by µ. Note that each x ∈ _X _{is trivially represented by} _ε_x_;

the most interesting factor brought out by the above example is that, for every compact convex subsetAof a finite-dimensional spaceX, every point ofAmay be represented by a probability measure which issupportedby the extreme points of

A.

Definition 2.2. Ifµis a nonnegative Borel measure on the compact HausdorffspaceX

andSis a Borel subset ofX,µis said to be supported bySifµ(X\_S_{) = 0}_.

This is one of the fundamental parts we are going to deal with in the devel-opment of this chapter. Indeed, the reformulation and extension of Krein-Milman theorem will be given by the existence of a family of Borel measures{_µ_a}_a_∈_A

suppor-ted by the extreme points of a compact convex setAwhich represent everya∈_A_.

It is also important to recall that the extreme points are characterised by the fact that they only have one representing measure.

(12)

Proposition 2.1. Suppose that Ais a compact convex subset of a locally convex space

X, and leta∈_A_{. Then}_a_{is an extreme point of}_A_{if, and only if, the point mass}_ε_a_{is the}

only probability measure onAwhich representsa.

Proof. ⇒_{) Suppose that}_a∈_ext(_A_{) and that the measure}_µ_represents_a_{. By}

regular-ity ofµ, to see thatµis supported by{_a} _{it su}ffi_{ces to show that}_µ₍_D_{) = 0 for each}

compact setD ⊂_A\{_a}_{. Suppose}_µ₍_D₎_>_{0 for some such}_D_{; from compactness of}

D it follows that there is some pointx∈_D _{such that}_µ₍_U∩_A₎_>_{0 for every}

neigh-bourhoodU of x. ChooseU to be a closed convex neighbourhood ofx such that

K:=U∩_A⊂_A\{_a}_.

A

a

D

x

U

K

The setKis compact and convex, and 0< r:=µ(K)<1, so long as ifµ(K) = 1 the resultantaofµwould be inK. Now defineµ1andµ2 Borel measures conAgiven by

µ1(B) =

1

rµ(B∩A), µ2(B) =

1

1−_rµ(B∩(A\K)).

Letai be the resultant ofµi; sinceµ1(K) = 1,a1∈K anda1,a. Furthermore,

µ=rµ1+ (1−r)µ2⇒a=ra1+ (1−r)a2,

contradiction.

⇐_{) Suppose that}_a_<_ext(_A_{); then there exists}_n∈_N,{_a_i}n

i=1⊂ext(A) and{λi}ni=1⊂ R+₀such thatPni=1λi= 1 and

a=

n

X

i=1

λ_ia_i.

Thenµ=Pn

i=1λiεai ,εaalso representsa.

Example 2.1. In order to become acquainted with the results we are considering, we dedicate a first (infinite-dimensional) example to the space of continuous functions on a

compact Hausdorffspace. LetY be a compact Hausdorffspace,C(Y)the Banach space of

all continuous real-valued functions onY andA⊂_X_:=_C₍_Y₎∗_{the set of all continuous}

linear functions onC(Y)such that L(1) = 1 =k_Lk_{. Then}_A_{is a compact convex subset}

of the locally convex space X in its w∗-topology. In fact, the first equality give us the

(13)

Convexity: Let L, L0 ∈_A _and_t∈_]0_,_1[_{. Then,} _L_t ₌_tL_{+ (1}−_t₎_L0 _{is also a linear}

functional onXsatisfyingLt(1) =tL(1) + (1−t)L0(1) = 1andkLtk ≤tkLk+ (1−

t)k_L0k_{= 1}_{, but since}k_L_t₍₁₎k_{= 1}_{, the equality holds.}

Compactness: A is a closed subset of SX, hence a closed subset of BX; then by

Banach-Alaoglu theorem (1.3) we conclude thatAis aw∗-compact subset.

The Riesz-Markov-Kakutani theorem assets that to each L in A there corresponds a

unique probability measureµonY such that

L(f) =

Z

Y

f dµ, ∀_f ∈_C₍_Y₎_.

Also, Y is homeomorphic with the set of extreme points ofAby Arens-Kelley theorem

(1.4), hence one may considerµas a probability measure onB₍_A₎_{which vanish on the}

open setA\_Y1_{, so}_µ_{is supported by the set of extreme points of}_A_.

One only need to recall thatX∗, the space of w∗-continuous linear functionals onC(Y)∗, consists precisely of those functionalsL7→_L₍_f_{) (}_f ∈_C₍_Y_{)) in order to see}

that this is a representation theorem of the type we are considering.

There are some considerations that it should be made under the previous ex-ample, as long as they are not necessarily affirmative in more general situations.

The compactness of the set of extreme points ofAlet us to use, in particular, that it is a Borel set.

The representation was unique; this is a more settle detail that will be studied in depth later.

It is clear that any probability measure µ on Y defines, by f 7→ R

Xf dµ, a linear

functional on C(Y) which is inX. This fact is true under fairly general circum-stances; first one should recall that it suffices for the intersection of an arbitrary family of compact sets to be nonempty that any finite intersection is nonempty as well, which let us to define a finite rank function in the following result.

Proposition 2.2. Suppose thatY is a compact subset of a locally convex spaceX, and

thatA:=co(Y)is compact. Ifµis a probability measure onY, then there exists a unique

pointa ∈_A _{which is represented by}_µ_{, and the function} _µ 7→R

Yf dµ is an affinew

∗

-continuous map fromC(Y)∗intoA.

Proof. The goal is showing that the compact convex setAcontains a pointasuch

thatf(a) =R_Yf dµfor eachf ∈_X∗_{. Let}_f ∈_X∗_{and consider the closed hyperplane}

Hf ={a∈A: f(a) =µ(f)}.

To check that ∩_f_∈_X∗H

f is nonempty, it will be used that Ais compact to define,

given a finite family{_f_i}n

i=1⊂X

∗

, the following function (see the previous comments to the statement of the proposition)

T : A −→ _Rn

a 7−→ ₍_f₁₍_a₎_{, . . . , f}_n₍_a₎₎_.

T is a continuous linear map, henceT Ais compact and convex.

1_{The set}_Y_{is now considered via the inclusion}_y_7→_δ

(14)

It suffices to verify that p = (µ(f1), . . . , µ(fn)) ∈ T A. If p < T A, there exists a linear functional onRnwhich separatespandT A; representing this functional by

b= (b1, . . . , bn) this means that

(b, p)>sup{₍_{b, T a}_{) :} _a∈_A}_.

If we defineg∈_X∗_{given by}_g₍_x_{) =}Pn

i=1bifi, the previous inequality becomes

Z

Y

g dµ >supg(A).

SinceY ⊂_A_and_µ₍_Y_{) = 1, this is impossible.}

To prove the second assertion, let{_µ_i}_i_∈_I _{be a net of probability measures on}_Y

which convergesw∗ inC(A)∗to the probability measureµ, and let{_x_i}_i_∈_I _and _x_be

their respective resultants. Thanks to the compactness ofX, to show that{_x_i}_i_∈_I →_x

it suffices to check that every subnet{_x_j}_j_∈_J _of{_x_i}_i_∈_I _{converges to}_x_{. But}{_x_j}_j_∈_J →_y_,

which means that

f(x_j) =µ_j(f)→_µ₍_f_{) =}_f₍_x₎_, ∀_f ∈_X∗_,

and since the latter separates points ofA,y=x.

Note that the compactness of Amay be avoided when the space meets some conditions, such as complete spaces. In order to rewrite Krein-Milman theorem, we describe the closed convex hull of a compact set in terms of barycenters of measures; it is just an extension of Proposition2.1

Proposition 2.3. LetY be a compact subset of a locally convex space X. A pointx ∈

X lies inA :=co(Y) if, and only if, there exists a probability measure µ on Y which

representsx.

Proof. ⇐_{) Let}_µ _{be a probability measure on}_Y _{which represents}_x∈_X_{. For every}

f ∈_C₍_Y_),

f(x) =µ(f)≤_sup_f₍_Y₎≤_sup_f₍_A₎_.

SinceAis closed and convex,x∈_A_.

⇒_{) For every} _x ∈ _Y _{there exists a net} {_x_i}_i_∈_I _{in the convex hull of} _Y _{such that} {_x_i}_i_∈_I →_x_{. Now we write}

xi=

ni

X

k=1

λkxik,

ni

X

k=1

λk=

ni

X

k=1

|_λ_k|_{= 1}_,{_xi

k} ni

k=1⊂Y

hence every xi can be represented by the measure µi =P_kni₌₁λkεxi_k. Since the set

of probability measures onY can be identified with aw∗-compact subset ofC(Y)∗ (Riesz theorem), there exists a subnet{_µ_α}_of{_µ_i}_{such that}

µα

w∗

−→_µ∈_C₍_Y₎∗

In particular, for everyf ∈_X∗_{we have that}_f|_Y ∈_C₍_Y_{), thus}

f(x) = limf(xi) = limf(xα) = lim

Z

Y

f dµα=

Z

Y

(15)

This proposition makes it easy to reformulate the Krein-Milman theorem. To be more precise, we will show the equivalence between Krein-Milman theorem and the statement:

Every point of a compact convex subsetAof a locally convex spaceXis the barycentre

of a probability measure onAwhich is supported by the closure of the extreme points of

A.

Suppose first that Krein-Milman holds and letx∈_A_{. If}_Y _{:= ext(}_A_{), then}_x∈_co_Y_.

Thanks to the above proposition,xis the barycentre of a probability measureµon

Y. Extending the measure (in the obvious way) toA, the result holds. Conversely, if

x∈_A_{and we define}_Y _{as in the previous implication, by proposition (2.3)}_x_{lies in}

the closed convex hull ofY, hence in the closed convex hull of the extreme points ofA.

It would be convenient to recall the Milman’s classical converseto the Krein-Milman theorem, which states that the closure of ext(A) is the smallest closed sub-set ofAgeneratingA.

Theorem 2.1(Milman). Suppose thatAis a compact convex subset of a locally convex spaceX, thatZ⊂_X_{and that}_A₌_coext₍_Z₎_{. Then ext}₍_A₎⊂_Z_.

Proof. Let Y = Z and suppose that x ∈ _ext(_A_{). By proposition} _2.3 _{there exists a}

measureµonY which representsx; by proposition2.1µ=εx, hencex∈Y.

It is now clear that Krein-Milman reformulation does not meet our needs, so long as it would be desirable for the measure to be supported by the extreme points of the compact convex set, instead of the closure of the mentioned set. In fact, V. L. Klee proved in 1957 that (with the appropiate considerations) that almost every compact convex set in a Banach spaceisthe closure of its extreme points.

The problem of finding such measures arises mainly from the measurability of the set of extreme points. When the set is metrizable, the next proposition brings an affirmative answer.

Proposition 2.4. IfAis metrizable, compact convex subset of a topological vector space, then ext(A)is aGδset.

Proof. Consider the sets

Fn=

x∈_A_:_x₌1

2(y+z), y, z∈A, d(y, z)≥

1

n

(16)

(17)

3

Choquet theorem

The chapter devoted to this theorem will be carried out in two different sections, according to the metrizability of the compact convex setA. It ensures, among other things, the measurability of the set of extreme points ofA, hence it makes easier the study of integral representation theorems. In the latter part, we study the general setting, for which we will have to redefine the concept of support of a measure.

3.1 Metrizable case

First of all, we will denote byA_{the set of a}ffi_{ne functions on a compact}

(met-rizable) spaceA, which is a subspace ofC(A) that contains the constant functions and separates points ofA. The following concept will be a cornerstone in the proof of Choquet’s theorem, since it will allow us to define a subadditive and positive homogeneous functional on the subspace A _{to extend it applying Hahn-Banach}

theorem.

Definition 3.1. Iff is a bounded function onAanda∈_A_{, the upper envelope of}_f _is

the function

f = inf{_h₍_a_{) :} _h∈ A ∧_h≥_f}

A few visual examples are shown now to introduce the idea intuitively.

0 1 2 3 4 5 6 7 8

0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5

sin(2t)+ln(1+t2₎ Upper envelope

0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

-1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6 0.8 1

Upper envelope

sin(:/t2₎

Figure 3.1: Upper envelope of some continuous functions.

The following properties hold for the upper envelope of a bounded function. Proposition 3.1. Letf , g be bounded functions onAandr≥₀_{. Then,}

1. f is concave, bounded and upper semicontinuous.

2. f ≤_f _{and the equality holds if}_f _{is concave and upper semicontinuous.}

3. f +g≤_f ₊_g_.

4. Ifg∈ A_,_f ₊_g₌_f ₊_g₌_f ₊_g_.

5. |_f −_g| ≤ k_f −_gk_.

6. rf =rf.

Proof.

(18)

1. To check thatf is concave, for everya, a0∈_A_and_t∈_[0_,_1],

f((1−_t₎_a₊_ta0_{) = inf}{_h₍₍₁−_t₎_a₊_ta0_{) :} _h∈ A ∧_h≥_f} ≥ _inf{₍₁−_t₎_h₍_a_{) +}_th₍_a0_{) :} _h∈ A ∧_h≥_f}

= (1−_t_{) inf}{_h₍_a_{) :} _h∈ A ∧_h≥_f}₊_t_inf{_h₍_a0_{) :} _h∈ A ∧_h≥_f}

= (1−_t₎_f₍_a_{) +}_tf₍_a0₎

To see thatf is bounded, recall that the functionh(x) = supf(A)<+∞ _{is a}

constant (hence affine) function on A and f(a) ≤_h₍_a_{) for every} _a∈ _A_{, thus}

f(a)≤_h₍_a₎_<₊∞_.

The function f is also upper semicontinuous. In fact, for everyλ ∈ _R _and

x∈_f−1_(]− ∞_{, λ}_{[), let}_g_{be the element of}_X∗_{such that}_g₍_x_{) =}f(x)

2 . Then the set (g+λ₂)−1(]− ∞_{, λ}_{[) is an open neighbourhood of}_x_in_f−1_(]− ∞_{, λ}_[).

2. The inequality f ≤ _f _{is straightforward by definition. To prove the other}

assertion, we have to notice that K = {₍_{a, r}_{) :} _f₍_a₎ ≥ _r} _{is closed and convex}

in the locally convex space X×_{R, so if} _f₍_a₀₎ _{< f}₍_a₀_{) at some point} _a₀ ∈_A_,

the separation theorem would provide the existence of a linear functional

L∈₍_X×_R₎∗_{such that}

supL(K)< λ < L(a0, f(a0)).

In particular, L(0,1) >0 and the affine function h(a) =r if L(a, r) =λ on A

exists andf < hwithh(a0)< f(a0), contradiction. 3. For everya∈_A_,

f +g(a) = inf{_h₍_a_{) :} _h∈ A ∧_h≥₍_f ₊_g₎}

≤ _inf{_h₍_a_{) :} _h∈ A ∧_h≥_f}_{+ inf}{_h₍_a_{) :} _h∈ A ∧_h≥_g}

= f(a) +g(a)

4. For everya∈_A_{, since the sum of two a}ffi_{ne functions is a}ffi_ne,

f +g(a) = inf{_h₍_a_{) :} _h∈ A ∧_h≥₍_f ₊_g₎}

= inf{₍_h₊_g₎₍_a_{) :} _h∈ A ∧_h≥_f}

= g(a) + inf{_h₍_a_{) :} _h∈ A ∧_h≥_f}

= g(a) +f(a)

5. Sincef ≤ k_fk_,

f = (f −_g_{) +}_g≤₍_f −_g_{) +}_g,

sof −_g≤_f −_g≤ k_f −_gk_{. Interchanging}_f _and_g_{yields the result.}

6. It is direct from the fact that multiplying a function by a non negative number preserves its condition of concave.

Remarks 3.1.

The first statement shows in particular that every functionf is Borel measurable.

To illustrate the first assertion of(3), one only has to considerg=−_f _and_f₍_a₎_,₀

(19)

An immediate conclusion of(3)and(6)is that, given a pointx0∈A, the functional

ponC(A)given by

p(g) =g(x₀)

is subadditive and positively homogeneous.

Now we are ready to introduce the metrizable version of Choquet theorem. Theorem 3.1. Suppose thatAis a metrizable compact convex subset of a locally convex

spaceX, and thatx0∈A. Then there is a probability measure onAwhich representsx0

and is supported by the extreme points ofA.

Proof. Since Ais metrizable,C(A) is separable, and we may choose a dense

unit-ary sequence in A_{, namely} {_h_n}_n_∈

N. In particular, it separates points of A. Let

f =P

n∈_N₂1nh2n; by CS-compactness of the unit ball of a Banach space, this element

belongs to BC(A). In addition,f is a strictly convex function: suppose thatx, y∈A,

x,y, thenhn(x),hn(y) for somen∈N, henceh2nis strictly convex and so isf.

Let B be the subspace A_{+ span}{_f}_{. The functional on} _B _{given by} _h₊_rf 7→

h(x0)+rf(x0) is dominated byp(see previous remark) onB, hence by Hahn-Banach theorem there exists a linear functionalmonC(A) such that

m(g)≤_g₍_x₀_{) for}_g _in_C₍_A_{) and}_m₍_h₊_rf_{) =}_h₍_x₀_{) +}_rf₍_x₀_{) on}_B.

In order to use the Riesz representation theorem, we need to prove thatmis con-tinuous, but that is clear from the fact that ifg∈_C₍_A₎_{, g} ≤_{0, then}_m₍_g₎≤_g₍_x₀₎≤_0.

Thus, there is a nonnegative regular Borel measureµ onX satisfyingm(g) =µ(g) forg∈_C₍_A_{). It only lefts to prove that}_µ_{is a probability measure and is supported}

by the extreme points ofA:

Probability measure: Sinceg≡_{1 on}_A_{belongs to}A ⊂_B_{, 1 =}_m_{(1) =}_µ_(1).

Supported by ext(A): We will show thatµvanishes on the complement of the set E ₌ {_a ∈_A_: _f₍_a_{) =}_f₍_a₎}_{, and then the fact that} E ⊂_ext(_A_{). On the one}

hand,f ≤_f_{, so}_µ₍_f₎≤_µ₍_f_{); on the other hand, if}_h∈ A_and_h≥_f_{, then}_h≥_f

and consequently

h(x₀) =m(h) =µ(h)≥_µ₍_f₎_,

which implies by definition thatµ(f)≤_f₍_x₀_{) =}_m₍_f_{) =}_µ₍_f_).

The setE _{is contained in ext(}_A_{), so long as if given}_x₌1

2(y+z) withy, z∈A, then the strict convexity off implies that

f(x)<1

2(f(y) +f(z))≤ 1

2(f(y) +f(z))≤f(x).

It is useful (and beautiful from a geometric viewpoint) to remark that the pre-vious setE _{actually coincides with ext(}_A_{). In addition, it will provide us a good}

motivation to introduce the nonmetrizable setting. To that end, it would be ap-propriate to identify the measures thatattain the same value inA_{in the following}

sense.

Definition 3.2. If µ and λ are probability measures such that µ(f) = λ(f) for each

(20)

We already know thanks to proposition (2.1) that εx is the only measure that

represent any extreme pointx, hence the following result will bring us the desired equalityE_{= ext(}_A_).

Proposition 3.2. Iff is a continuous function on the compact convex set A, then for eachx∈_A_,

f(x) = sup

µ(f) : µ∼_ε

x .

Proof. Letf0(x) = sup

µ(f) :µ∼_ε_x _{; we must show that}_f ₌_f0_{. First of all, it is clear}

from the definition thatf0 is concave. To prove thatf0 is upper semicontinuous,

fixr >0 and let{_x_i}_i_∈_I ⊂_A_{be a net converging to}_x _{with each}_f0₍_x_i₎≥_r_{. To check}

thatf0(x)≥_r_{, suppose that}_{ε >}_{0 and for each}_i∈_I _choose_µ_i∼_ε_x

i withµi(f)> r−ε.

Byw∗-compactness, there exists a probability measureµand a subnet{_µ_j}_j_∈_J _which

convergesw∗toµ. Ifg∈ A_{, then}

g(xj) =µj(g)→µ(g),

and sinceg(xj)→g(x) we see thatµ(g) =g(x) for everyx∈Aand thereforeµ∼εx.

Finally,

r−_ε≤_lim_µ_j₍_f_{) =}_µ₍_f₎≤_f0₍_x₎_.

Since the choice ofεwas arbitrary, we conclude thatf0(x)≥_r_.

Sincef0 is upper semicontinuous,{₍_{x, r}_{) :} _f0₍_x₎≥_r}_{is a closed convex subset of}

X×_{R; using a similar argument as in proposition (3.1), we get that}_f ≤_f0_{. On the}

other hand, ifh∈ A_,_x∈_A_and_h≥_f_{, then for any}_µ∼_ε_x_,

h(x) =µ(h)≥_µ₍_f₎_.

It follows thatf0(x)≤_h₍_x_{), so}_f0≤_f_.

3.2 Non-metrizable case

Suppose that Ais a nonmetrizable compact convex subset of a locally convex spaceX. Examples shown by Beshop-de Leeuw [7] reveal that the set of extreme points can be, far from a Gδ set, as bad as one may desire, and hence it does not

make any sense considering measures supported in a nonmeasurable set. It turns out that the best adaptation for a reformulation of a similar result relays on an adaptation of the definition of supported by for Borel measures. An alternative definition may require thatµ vanish on every Borel set which is disjoint from the set of extreme points, but (again) Bishop and de Leeuw have shown that it is not always possible to obtain representing measures with this property. If, however, one demands only thatµvanish on the Baire subsets ofAwhich contain no extreme points, then representation results can be obtained.

This step leaves unanswered the question of approaching the set of extreme points using Baire sets; in other words, how to how to find a net of measures such that their supports are every time closer to the extreme points. To that purpose, it will be defined an order in the class of regular Radon measures in the following way: givenA⊂_X_{a compact convex set and}_{µ, λ}_{Radon measures on}_A_,_µ_λ_{if and}

only ifµ(f)≥_λ₍_f_{) for every convex function}_f _on_A_{. The agreement of this}

(21)

Consider f a convex function and A ⊂ _R2 _{the square of vertexes} ₍₁_,₀₎_,₍₀_,₁₎_,

(−₁_,₀₎_,₍₀_,−₁₎_{. For a fixed}₍_{x, y}₎∈_A\{₍₀_,₀₎}_define

µ(x,y)=

1

4[δ(x,0)+δ(−x,0)+δ(0,y)+δ(0,−y)].

X Y

-1.0 -0.5 0.5 1.0

Now we check that iftx ≤x andty ≤y, thenµ(x,y) µ(tx,ty), hence for every

in-creasing sequences{_x_n}_n_∈

N,{yn}n∈Nwhich converge to1, the sequence{µ(xn,yn)}n∈N

is increasing as well (with the order_{previously defined).}

Z

A

f dµ(x,y) =

1

4[f(x,0) +f(−x,0) +f(0, y) +f(0,−y)] = 1

4

_x₊_t

x

2x f(x,0) +

x−_t

x

2x f(−x,0)

+1 4

_x₋_t

x

2x f(x,0) +

x+t_x

2x f(−x,0)

+ 1 4

"_y₊_t

y

2y f(0, y) +

y−_t

y

2y f(0,−y)

#

+1 4

"_y₋_t

y

2y f(0, y) +

y+t_y

2y f(0,−y)

#

≥ 1

4f

_x₊_t

x

2x x+

x−_t_x

2x (−x),0

+1 4f

_x₋_t

x

2x x+

x+tx

2x (−x),0

+ 1 4f 0,

y+ty

2y y+

y−_t_y

2y (−y)

!

+1 4f 0,

y−_t_y

2y y+

y+ty

2y (−y)

!

= 1 4

h

f(tx,0) +f(−tx,0) +f(0, ty) +f(0,−ty)

i

=

Z

A

f dµ(tx,ty)

Letnbe a natural number,A=B_Rn and

Sn={_x∈_Rn_:k_xk_{= 1}}_.

For any 0< r ≤₁_{, define the measures}_µ_r ₌ 1

|_rSn|χrSndx1. . . dxn. Once again, it

can be shown that given0< r₁≤_r₂≤₁_{, then} _µ_r

1Sn(f)≤µr2Sn(f)for any convex

(22)

For that purpose note that, for0< r≤₁_{, using the generalised spherical}

coordin-ates and Fubini’s theorem:

µrSn(f) =

Z

A

f dµ= 1

|_rSn|

Z

rSn

f dx1. . . dxn=C

Z

rSn

f dθ1. . . dθn−₁

= C Zπ 0 . . . Zπ 0

Z 2π

0

f dθn−₁

!

dθn−₂

!

. . .

!

dθ1

The next lemma will provide us a useful inequality:

Lemma 3.1. Given0< r₁≤_r

2≤1and a convex functionf,

f(r1, θ1, . . . , θn−₁)

| {z }

f_r+ 1

+f(−_r₁_{, θ}₁_{, . . . , θ}_n₋₁₎

| {z }

fr−₁

≤_f₍_r₂_{, θ}₁_{, . . . , θ}_n₋₁_{) +}_f₍−_r₂_{, θ}₁_{, . . . , θ}_n₋₁₎_.

Proof. Using the decompositionr1= r2₂+_r₂r1r2+r2

−_r₁

2r2 (

−_r₂_{) and the convexity of}

f,

fr₂++fr₂− =

r₂+r₁

2r2

fr₂++

r₂−_r₁

2r2

fr₂−

!

+ r2−r1 2r2

fr₂++

r₂+r₁

2r2

fr₂−

!

≥_f_r+

1 +fr

−

1.

Integrating the previous quantity forθ_n−₁∈[0, π]and any given0< r≤1,

I=

Zπ

0

fr+dθ_n−₁+

Zπ

0

fr−dθ

(23)

If we use the change of variables(r, θ1, . . . , θn−1)7→(−r, θ1, . . . , θn−1)in the second

integral,

I=

Z π

0

fr+dθ_n−₁−

Z−π

0

fr−dθ

n−₁=

Z π

0

fr+dθ_n−₁+

Z 0

−_π

fr−dθ

n−₁=

Z 2π

0

f dθn−₁.

Hence integrating the equation in (3.1) inθn−₁∈[0, π]we conclude that

Z2π

0

f dθn−₁≤

Z 2π

0

f dθn−₁

The Choquet-Bishop-de Leeuw theorem is shown now to devote the rest of this section to its proof.

Theorem 3.2. Suppose thatAis a compact convex set of a locally convex spaceXand

thatx0 ∈ A. Then there exists a probability measure µ onA which representsx0 and

which vanishes on every Baire subset ofAwhich is disjoint from the set of extreme points

ofA.

The necessary tools for proving the theorem version will be exposed below. Let

Cdenote the set of all convex functions on a compact convex set in a locally convex spaceA⊂_X_{. The subspace} _C−_C _{is a lattice under the usual partial ordering in}

C(A). Since it containsA_{, the set of all a}ffi_{ne functions on}_A_,_C−_C_{separates points}

of A and is dense in C(A) (in the norm topology) by Stone-Weierstrass theorem. Now we formally introduce the order relation introduced at the beginning of the section.

Definition 3.3. Ifλandµare nonnegative Borel measures onA, it is said thatλ_µ_if

λ(f)≥_µ₍_f₎_{for every}_f ∈_C_.

This relation is readily seen to be reflexive and transitive; the antisymmetry comes from the fact thatC−_C_{is dense in}_C₍_A_{). Note that if}_f ∈ A_{, then both}_f _and −_f _{lie in}_C_{, so that}_λ_µ_implies_λ₍_f_{) =}_µ₍_f_{); i.e.,}_λ_and_µ_{represent the same linear}

functional onA_{. It is also well worth noting that if}_µ∼_ε_x_{, then}_µ_ε_x_{: indeed, if}

f ∈ −_C_{, then}_f ₌_f _{and hence}

f(x) = inf{_h₍_x_{) :} _h∈ A_{, h}≥_f}_{= inf}{_µ₍_h_{) :} _h∈ A_{, h}≥_f} ≥_µ₍_f₎_.

According to the previous examples, one may expect that the maximal measures

(maximal with respect to the order “_{”) have their support every time}_closer_{to the}

set of extreme points ofA.

An application of Zorn’s lemma leads us to prove that for any given nonnegative measureλ, there exists a maximal measureµso thatµ_λ_.

Lemma 3.2. If λ is a nonnegative measure, there exists a maximal measure µ with

µ_λ_.

Proof. Suppose thatλ≥_{0 and let}_Z₌{_µ_: _µ≥₀∧_µ_λ}_{. To find a maximal element}

ofZ, letW be a chain inZ. One may regardW as a net (the directed “index set” being the elements of W themselves) which is contained in the w∗-compact set

{_µ≥_{0 :}_µ_{(1) =}_λ₍₁₎}_{. Thus, there exists}_µ₀≥_{0 and a subnet}{_µ_i}_i_∈_I ⊂_W _satisfying {_µ_i}_i_∈_I w

∗

(24)

Ifµ1∈ W, it follows from the definition of a subset that eventuallyµi µ1, and henceµ0µ1. We have proved thatµ0 is an upper bound ofW that belongs toZ; by Zorn’s lemma,Zcontains a maximal element.

It is only left to see that this measure is maximal, but that comes from the fact that ifη_µ_{, then}_η_λ_and_η∈_Z_{, so}_µ₌_η_.

The idea of the proof is simple: Ifx0∈A, choose a maximal measureµso that

µ_ε_x

0. As noted above, µ representsx0; it remains to show that the maximality

of the measure implies thatµvanishes on the Baire sets which contain no extreme points. The first step in that direction is contained in the following result.

Proposition 3.3. Ifµis a maximal measure onA, thenµ(f) =µ(f), for each continuous

functionf onA.

Proof. Letf ∈_C₍_A_{) be a continuous function and define the functional}_L_{on span(}_f₎

by

L(rf) =rµ(f).

Define the sublinear functionalponC(A) byp(g) =µ(g). Ifr≥_{0, then}_L₍_rf_{) =}_p₍_rf_),

while ifrz0, then

0 =rf −_rf ≤_rf _{+ (}−_rf_{) =}_rf −_{rf ,}

hence L(rf) = µ(rf) ≤_µ₍_rf_{) =}_p₍_rf_{). Thus,} _L≤ _p _{on span(}_f_{) and there exists an}

extensionL0ofLtoC(A) such thatL0≤_p_.

Ifg≤_{0, then}_g≤_{0, so}

L0(g)≤_p₍_g_{) =}_µ₍_g₎≤₀_.

It follows thatL0 ≥ _{0 and hence there exists a nonnegative measure}_ν _on _A _such

thatL0(g) =ν(g) for eachg∈_C₍_A_{). If}_g_{is convex, then}−_g_{is concave and}−_g₌−_g_{, si}

ν(−_f₎≤_p₍−_g_{) =}_µ₍−_g_{) =}_µ₍−_g₎_,

i.e., ν _µ_{. Since}_µ _{is maximal, we must have} _µ₌_ν_{, and therefore} _µ₍_f_{) =}_ν₍_f_{) =}

L(f) =µ(f).

It will be seen in the next chapter that the converse to this result is also true (lemma4.3); furthermore, this last proposition implies that the maximal measure

µis supported by{_x_: _f₍_x_{) =}_f₍_x₎}_{. As shown by proposition (3.2), each of these sets}

contains the extreme points ofA. IfC contained a stricly convex functionf0, we would have that

ext(A) ={_x_: _f₍_x_{) =}_f₍_x₎}_,

and the proof will be complete. However, the existence of a strictly convex function onAimplies its metrizability, see [10]. Instead, we prove that ext(A) is the inter-section of all the sets of the form{_x_: _f₍_x_{) =}_f₍_x₎} _for_f ∈_C_{. Indeed, if}_f₍_x_{) =}_f₍_x₎

for eachf ∈_C_and_x₌1

2(y+z) withy, z∈A, then

f(y) +f(z)≥₂_f₍_x_{) = 2}_f₍_x₎≥_f₍_y_{) +}_f₍_z₎≥_f₍_y_{) +}_f₍_z₎_,

i.e.,f(x) = 1₂[f(y) +f(z)] for eachf ∈_C_{. The same equality holds for}_f ∈ −_C_{, hence}

for each element ofC−_C_{. Since the latter subspace is dense in}_C₍_A_{), we must have}

x=y=z.

Choquet theorem and its applications

Choquet theorem

and its applications

Master’s final project

Antonio Zarauz Moreno

MSc in Mathematics

Contents

Abstract

1

Preliminaries

1.1

Topological vector spaces

1.2

Measure theory

1.3

Functional analysis

2

Introduction to integral representation

2.1

Krein-Milman theorem revisited

3

Choquet theorem

3.1

Metrizable case

3.2

Non-metrizable case