Restricted Boltzmann Machine and Deep Belief Net

Various applications of restricted Boltzmann machines for bad quality training data Maciej Zięba Wroclaw University of Technology 20.06.2014.

Assuming we know the connection weights in our RBM (we'll explain how to learn these below), to update the state of unit i: 1.

Restricted Boltzmann Machine and Deep Belief Net

Deep architectures are composed of multiple levels of non-linear operations, such as neural nets with many hidden layers.

In this post, I will try to shed some light on the intuition about Restricted Boltzmann Machines and the way they work.

The historical review shows that significant progress has been made in this field.

Since learning takes gradient descent approach, Learning can be extremely slow, due to repeated

Mean field theory turning BM to deterministic by

Another expensive method for global optimization

Most properties of offspring are inherited from

Each parent contributes different part of the

Biological evolution survival of the fittest

Genes that contribute to greater fitness are more

Genetic algorithm (relying more on cross-over)

Evolutionary programming (mutation is the primary

Evolutionary strategies (using real-value vectors

represented as a string of symbols (genes and

Population of individuals (at current generation)

Fitness function f estimates the goodness of

randomly select a pair of parents from the

individuals with higher fitness function values

crossover allows offspring to inherit and combine

mutation (randomly altering genes) may produce

Bad individuals are throw away when the limit of

Plus sub-optimal states generated from fast

All individual in the population are almost

Population size must be large (but how large?

some components may be missing or corrupted, some components may be permanently clamped to the

The input is represented by the visible units. The output values can be represented as a discrete value, a real value, or a vector of values ; Tolerant to noise in input data; Time factor.

the set of vectors appearing on the hidden

clamping phase each exemplar is clamped to

free-run phase none of the visible node is

probability that the system is stabilized

learning is to construct the weight matrix such

A measure of the closeness of two probability

BM learning takes the gradient descent approach

1.1. clamp one training vector to the visible

schedule until equilibrium is reached at a

1.3. continue to run the network for many cycles

After each cycle, determine which pairs of

1.4. average the co-occurrence results from 1.3

1.5. repeat steps 1.1 to 1.4 for all training

average the co-occurrence results to estimate

the same steps as 1.1 to 1.5 except no visible

BM is a stochastic machine not a deterministic

It has higher representative/computation power.

Restricted Boltzmann Machines and Deep Networks for Unsupervised Learning

Asymptotic Behavior of Stochastic Complexity of Complete Bipartite Graph-Type Boltzmann Machines

Methods Restricted Boltzmann Machines (RBM) RBMis a bipartie Markov Random Field with visible and hidden units.

p(v,h)∝e−E(v,h) Energy of a joint configuration −E(v,h)= vibi i∈vis ∑ + hkbk k∈hid ∑ + vivjwij i