Research Article

An Improved Pearson’s Correlation Proximity-Based Hierarchical Clustering for Mining Biological Association between Genes

Algorithm 1

Hierarchical clustering algorithm to measure the similar gene expression pattern.
Begin
Input gene expression dataset, threshold
Let the gene expression data be represented as vectors
∣ where lies between 1 and f}
Let the Expression Matrix be EM = ( , ) and be the log ratio factor
Let and be two gene values and vector, and
Let and be average values for two vectors and ,
Measure the proximity level using
( ) ( )/
For
 Form the expression patterns for rows
For
Form the expression profiles of samples
Obtain log ratio factor
End for
End for
If ( ) Similarity gene expression patterns are obtained
Else If ( ) Similarity gene expression patterns are not obtained
End