EM for Gaussian Mixtures
Posted by JS in computer science, ml, tags: algorithm, statisticsThe next algorithm in my continuing series of short, hackable implementations of common machine learning algorithms is fitting a Gaussian mixture model through expectation maximization.
This example follows section 9.2 in Bishop’s PRML. You can think of this kind of EM as “soft” clustering. We assume that the data has clusters, and that the cluster that any particular data point belongs to is missing information. It is precisely this kind of hidden information that EM attempts to recover.
You can think of the algorithm as guessing the hidden cluster for each point, then assuming that guess is correct, figuring out what the remaining distribution parameters should be. Then guess again, and recompute. Repeating this process often yields a useful estimate of the hidden parameters, as well explaining the visible data.


Entries (RSS)