Research Article

Parallel Attribute Reduction Algorithm for Complex Heterogeneous Data Using MapReduce

Algorithm 4

Single condition attribute subset neighborhood-Reduce.
Input: <KEYM, VALUEM>
Output: <KEYR, VALUER>
   // let KEYR be the set of different key, and VALUER be the set of sample IDs subset value' with the same key'.
begin
  <KEYR, VALUER>=
  for <key, value>in <KEYM, VALUEM>do
   if key is not appeared in <KEYR, VALUER>
    <key, value>=<key, value>
   else
   if key=keyk
    <KEYR, VALUER>=<KEYR, VALUER>-<key, value>
    valuek=valuek value // combine samples with the same key
   end if
   end if
   <KEYR, VALUER>=<KEYR, VALUER> <key, value>
  end for
end