No paisa no feature: Cluster validation

What is cluster validation?

A process of estimating the goodness of cluster formed. That is, to ensure how accurate the obtained results of clustering.

To understand the cluster validation, consider the below figure 1 example confusion matrix obtained by the process of classification (unsupervised learning) or clustering (supervised learning).

Figure 1: Example of the confusion matrix.

Where,

The expected result of cluster 1: C^E₁. The expected result of cluster 2: C^E₂.
The expected result of cluster 3: C^E₃. The expected result of cluster 4: C^E₄_.
Obtained result of cluster 1: C^O₁. Obtained result of cluster 2: C^O₂.
Obtained result of cluster 3: C^O₃. Obtained result of cluster 4: C^O₄_.

Row: Indicates how many samples belong to a particular cluster?
Column: Indicates how many samples are placed in the cluster from which cluster?

Example: Arrangement of library books.

Calculating recall (recall ability) concerning:

Cluster C₁: