La revista digital - Candela María Belluschi

The weight matrix T of a discrete Hopfield network is symmetric. Therefore, the eigenvalues of T are all real and T can be completely characterized by its eigenvalues and corresponding orthogonal eigenvectors. Let these be denoted

λ1, . . . , λN e1, . . . , eN

→ For large p, the eigenvalues are ordinarily degenerate, which means that there are several eigenvectors with the same eigenvalue. The eigenvectors associated with a degenerate eigenvalue form a subspace.

→ The weight matrix T has a degenerate eigenvalue with a value of zero, whose relative subspace is called the null space. The null space exists by virtue of the fact that the number of fundamental memories, p, is smaller than the number of neurons N in the network. The presence of this null space is an intrinsic characteristics of the Hopfield network.

22_{If in Eq. (2.42) we assume I}

i6= 0 and Ui6= 0 this result is not true and we will have only local Lyapunov

functions, one for each ξµ. A detailed proof for a case similar to this has been done for the continuous time energy and can be found in Prop. 2.3.1.

23_{An equilibrium point for a differential equation ˙}_{x = f (t, x) is a point x}∗_{such that f (t, x}∗_{) = 0, ∀ t or}

equivalent is an equilibrium point for x(t + 1) = f (t, x(t)) if f (t, x∗_{) = x}∗_{for t = 0, 1, 2, . . .}

24_{As discussed above the same result holds also in the general case with an analogous proof. Indeed, we}

have used the hypothesis of orthogonality only in the analysis of the vector in ker(T ) (we have already seen how we can overcome this) and in the evaluation of the energy function in the stored pattern. In the general case we don’t know these values exactly, because they depend on the scalar product ξµ_{· ξ}β _{and so for this}

To understand better, we make an eigenanalysis of T , as done in Aiyer et al. [3]. Let e1_{, . . . , e}p1 _{be the eigenvectors with non zero eigenvalue λ}

1, . . . , λp1 and e

p1+1_{, . . . , e}N

those with null eigenvalue. A vector V can be written in terms of its component Vk _{in the}

direction of the eigenvector ek_{, k = 1, . . . , p}

1, plus its component q in the null subspace as

follows: V = p1 X k=1 Vk+ q (2.56) where V · ek= Vk= γkek

Similarly, the connection matrix, energy function and dynamic update equation can be expressed as follows: T = p1 X k=1 λkek(ek)T E = −1 2 p1 X k=1 λk|Vk|2= − 1 2 p1 X k=1 λkγ2k T V = p1 X k=1 λkvk (2.57)

It can be seen that to minimize E the network must move V so as to: • reduce to zero magnitude all Vk_{’s where λ}

k< 0

• increase the magnitude of all Vk_{’s where λ} k> 0

Let V be a memory vector and thus a network stable state. The condition of network stability requires Vi = sgn((T V )i) for i = 1, . . . , N = sgn p1 X k=1 λkVik using (2.56) =⇒ p1 X k=1 Vik+ qi = sgn p1 X k=1 λkVik (2.58)

The only way that (2.58) can be guaranteed for any set of memory vectors is if

qi= 0 for i = 1, . . . , N and λk = λ for k = 1, . . . , p1 with λ > 0

In other words:

1. To ensure q = 0, the null subspace must be orthogonal to all the memory vectors.

2. As a result, all the memory patterns must be completely specified byPp1

k=1V

k_{, hence}

the eigenvectors of T must at least span the subspace formed by the memory vectors. This will automatically ensure that the previous point is true.

3. So that λk = λ, ∀ k with k = 1, . . . , p1, the connection matrix must have a single

Let M denote the subspace spanned by the memory vectors. From 1. the complementary subspace to this will be the null subspace of T , denoted with Q. Thus a vector V can be written in terms of its component in M, denoted by ˜m, and its component in Q, denoted by q

V = ˜m + q, where ˜m · q = 0 (2.59) Remark. We note that if the memorized patterns formed an orthogonal basis, they correspond to the eigenvectors ek _{of T , as seen in section 2.2.4, and thus p}

1= p.

This eigenanalysis of the weight matrix T leads us to take the following viewpoint of the discrete Hopfield network used as a CAM:

→ The discrete Hopfield network acts as a vector projector in the sense that it projects a probe vector onto a subspace M spanned by the fundamental memory vectors. Then the underlying dynamics of the network drives the resulting projected vector to one of the corners of a unit hypercube where the energy function is minimized.

→ The p fundamental memories, spanning the subspace M, constitute a set of fixed points represented by certain corners of the unit hypercube. The other corners of this hypercube that lie in or near M are potential locations for spurious states, also referred to as spurious attractors. Spurious states represent network stable states of the Hopfield network that are different from the fundamental memories of the network and are characterized by a higher energy. Furthermore, their number increases very rapidly with the number of stored memory vectors p (Ayer et al. [3]).

Following Amit [4], the spurious states may be grouped as follows:

• Reversed fundamental memories. The spurious states are reversed (i.e. negative) versions of the fundamental memories of the network. To explain this kind of spurious state, we note that the energy function E is symmetric in the sense that its value remains unchanged if the states of the neurons are reversed (i.e. the state Vi is replaced

by −Vifor all i). Accordingly, if the fundamental memory ξµcorresponds to a particular

local minimum of the energy landscape, that same local minimum also corresponds to −ξµ_{. This sign reversal does not pose a problem in the retrieval of information if it is}

agreed to reverse all the information bits of a retrieved pattern if it is found that a particular bit designed as the “sign” bits is −1 instead of +1.

• Mixture states A mixture spurious state is a linear combination of an odd number of stored patterns. For example, consider the state

Vi= sgn(ξiµ1+ ξ µ2

i + ξ µ3

i )

which is a three-mixture spurious state. It is a state formed out of three fundamental memories ξµ1

i , ξ µ2

i , ξ µ3

i by a majority rule. The network stability condition y = sgn(T y)

is satisfied by such a state for a large network.

• Spin-glass states. This kind of spurious state is so named by analogy with spin-glass models of statistical mechanics. Spin-glass states are defined by local minima of the energy landscape that are not correlated with any of the fundamental memories of the network.

In the design of a Hopfield network as a content-addressable memory we are therefore faced with a tradeoff between two conflicting requirements:

1. the need to preserve the fundamental memory vectors as fixed points in the state space 2. the desire to have few spurious states.

In document Candela María Belluschi (página 25-29)