Research Article
An Efficient Parallelized Ontology Network-Based Semantic Similarity Measure for Big Biomedical Document Clustering
Algorithm 1
Algorithm of MapReduce-based data transformation.
Data transformation | Input: <d, list(m)> | Output: <m, list(d)> | Notation: Write (k, v) outputs <k, v> | Class mapper | Method map (d, list(m)) | For each m ∈ list(m) | Write (m, d) | End for | Class reducer | Method reduce (m, list(d)) | s ← string(list(d)) | Write (m, s) |
|