Research Article
An Efficient Parallelized Ontology Network-Based Semantic Similarity Measure for Big Biomedical Document Clustering
Algorithm 1
Algorithm of MapReduce-based data transformation.
| Data transformation | | Input: <d, list(m)> | | Output: <m, list(d)> | | Notation: Write (k, v) outputs <k, v> | | Class mapper | | Method map (d, list(m)) | | For each m ∈ list(m) | | Write (m, d) | | End for | | Class reducer | | Method reduce (m, list(d)) | | s ← string(list(d)) | | Write (m, s) |
|