This repository contains a gentle introduction to nonnegative matrix factorization (NMF). It gives an overview of the method's technical details, the most common numerical algorithms for solving the factorization problem, and its recent extensions.
We consider two applications of NMF to the fields of image processing and text mining.
In the first project, the painting Madonna of the Goldfinch (Madonna del Cardellino) by the Italian artist Raphael is optimally compressed through a low-rank approximation of the pixel intensities matrix.
In the second project, NMF is applied to recover the latent topics present in the complete works by the English poet and playwright William Shakespeare and classify the document corpus accordingly.
The analysis code for the image processing application is contained in the R Markdown report ImageCompression.Rmd
.
If you just what to have a look at the rendered notebook, please refer to ImageCompression.html
.
The Workspace
folder collects the workspace images with the estimated NMF models.
The text analysis of Shakespeare's works is illustrated in the Report on NMF
file (Section 2).
Suggestions and feedback are welcome (mail)!
Link to notebook on Kaggle: https://www.kaggle.com/elenageminiani/nmf-and-image-compression.