Research Article
n-Gram-Based Text Compression
Algorithm 4
Pseudocode of the decompression phase.
input: The encoded stream | output: The decoded stream | () inputstring ← encodedstream | () while length of inputstring > 0 do | () firstbyte = read first byte from the inputstring | () delete first byte of the the inputstring | () dict = get value of three bits of firstbyte | () if dict ≤ 5 then | () number = getnumberbytereadmore(dict) | () bytereadmore = read number byte more from the inputstring | () delete number byte of the inputstring | () indexstring = get last five bits of the firstbyte + the bytereadmore | () indexvalue = get value of the the indexstring | () output += decompress(indexvalue, dict) | () end | () else if dict = 6 then | () output += newline | () end | () else | () number = value of five last bits of the firstbyte | () bytereadmore = read number byte more from the inputstring | () output += decode for the bytereadmore | () end | () end |
|