I ran a cluster analysis on sources of 370 text files, which were clustered by word similarity. The results generated were 65,536 pairs (i.e., 65,536 Pearson correlation coefficient). However, there should have 68,265 comparison pairs (369*370/2). About 2,729, or 4%, pairs were missing, and it would take a long time trying to figure out which pairs were missing. Your suggestions to fix this problem will be greatly appreciated.