Research Article

An Efficient Parallelized Ontology Network-Based Semantic Similarity Measure for Big Biomedical Document Clustering

Algorithm 1

Algorithm of MapReduce-based data transformation.
Data transformation
Input: <d, list(m)>
Output: <m, list(d)>
Notation: Write (k, v) outputs <k, v>
Class mapper
 Method map (d, list(m))
  For each m ∈ list(m)
   Write (m, d)
  End for
Class reducer
 Method reduce (m, list(d))
   s ← string(list(d))
   Write (m, s)