Clustering data with similarity matrix in Python – Tutorial

What exactly is cluster analysis?

Cluster analysis or clustering is the task of grouping a set of objects in a way that objects in the same group are more similar to each other than to objects in other groups (clusters). It is a main task of exploratory data mining and a common technique for statistical data analysis, used in many fields, including machine learning, pattern recognition, image analysis, information retrieval, bioinformatics, data compression etc. Continue reading