Apache Mahout’s goal is to build scalable machine learning libraries under the Apache license. Leveraging Apache Hadoop where practical, Mahout implements a number of machine learning algorithms for classification, clustering and collaborative filtering. In the companion article, first published on IBM’s developerWorks website, Lucid Imagination’s Chief Scientist, Grant Ingersoll, walks readers through using Mahout locally and on Amazon EC2. The code and data sets used in the article are linked below.

The ASF email sample data used in the developerWorks article on Mahout is available here.

The Github code used in the article is located on Lucid Imagination’s Github account.