Exponential Family
1. Introduction
A probability density in the exponential family takes the following form:
where is actually called sufficient statistics and is a normalization term (also noted as log-partition function) with the form
2. Sufficient Statistics
Definition. We say a function of random variable , , to be sufficient for if the conditional distribution of given is not a function of , i.e.
An alternative definition from a Bayesian perspective argues that is sufficient for if is conditionally independent from given .
Theorem. If a function of random variable , , satisfies
Then it is sufficient for .
Corollary. in the p.d.f. of exponential family is sufficient for .
3. Properties
- The log-partition function satisfies that
which is equal to say
- The second derivatives,
4. Legendre Conjugate
Definition. Let be a convex function. The Legendre Transform of is defined as
is called the convex conjugate of .
Notice
As we can see, is obviously convex.
.
5. Legendre Conjugate of Log-partition
Now consider the exponential family, the conjugate duality representation of convex functions is
The supremum is obtain at
and therefore
Therefore, can be interpreted as the negative entropy of where is the exponential family such that .
6. Exponential Family and EM algorithm
We derive the exact EM algorithm for exponential families with latent variables. Given observed variables and latent variables , we consider
and
The MLE for our parameters is obtained by maximizing the incomplete log-likelihood of the data:
where we define
The variational representation gives,
Therefore, we obtain a lower bound for the incomplete log-likelihood:
EM is thus a coordinate ascent on the lower bound
E step is called expectation step because the maximizer of for fixed is, by duality, the expectation
I guess here it is confusing, cause , but instead something only integral on .
7. Reference
Joan Bruna's DS-GA.1005 Inference and Representation Lecture Notes, Lecture 8.
Michael I. Jordan, An Introdution to Probabilistic Graphical Models, Chapter 8.