Bolzmann Machine | Haobin Tan

Bolzmann Machine

Boltzmann Machine

Model can be represented by Graph:

Types:

Binary states

Stochastic

Energy of the network

$$ \begin{aligned} E &= -S^TWS - b^TS \\\\ &= -\sum\_{i Probability of input vector

v

p(v)= \frac{e^{-E(v)}}{\displaystyle \sum\_{u} e^{-E(u)}}

Updating the nodes

decrease the Energy of the network in average
reach Local Minimum (Equilibrium)
Stochastic process will avoid local minima
$\begin{array}{c} p\left(s\_{i}=1\right)=\frac{1}{1+e^{-z\_{i}}} \\\\ z\_{i}=\Delta E\_{i}=E\_{i=0}-E\_{i=1} \end{array}$

Use Temperature to allow for more changes in the beginning

Situations
- Present data vectors to the network
Problem
- Learn weights that generate these data with high probability
Approach
- Perform small updates on the weights
- Each time perform search problem

✅ Pros

⛔️ Cons

Energy:

\begin{aligned} E(v, h) &= -a^{\mathrm{T}} v-b^{\mathrm{T}} h-v^{\mathrm{T}} W h \\\\ &= -\sum\_{i} a\_{i} v\_{i}-\sum\_{j} b\_{j} h\_{j}-\sum_{i} \sum_{j} v_{i} w_{i j} h_{j} \end{aligned}

Probability of hidden unit:

p\left(h\_{j}=1 \mid V\right)=\sigma\left(b\_{j}+\sum\_{i=1}^{m} W\_{i j} v\_{i}\right)

Probability of input vector:

p\left(v\_{i} \mid H\right)=\sigma\left(a\_{i}+\sum\_{j=1}^{F} W\_{i j} h\_{j}\right)

$> \sigma(x)=\frac{1}{1+e^{-x}} >$

Free Energy:

\begin{array}{l} e^{-F(V)}=\sum\_{j=1}^{F} e^{-E(v, h)} \\\\ F(v)=-\sum\_{i=1}^{m} v\_{i} a\_{i}-\sum_{j=1}^{F} \log \left(1+e^{z_{j}}\right) \\\\ z_{j}=b\_{j}+\sum\_{i=1}^{m} W\_{i j} v\_{i} \end{array}

Last updated on 2024-09-05