home changes contents help options

Possible Mentors

Description

Clustering is the assignment of objects into groups (called clusters) so that objects from the same cluster are more similar to each other than objects from different clusters. Often similarity is assessed according to a distance measure. Clustering is a common technique for statistical data analysis, which is used in many fields, including machine learning, data mining, pattern recognition, image analysis and bioinformatics. The purpose of this project is to develop a set of tools in the form of Pd patches and abstractions that make various clustering techniques available in Pd. This could include (but not limited to) k-Means clustering, Principal Components Analysis and Multi-dimensional Scaling analysis.

Related projects

Resources to start:

Required Skills

Possible Breakdown of Steps