Research Article
Automatic Extraction of Web Page Text Information Based on Network Topology Coincidence Degree
Table 3
Initial parameter settings of web site extraction.
| Parameter | Numerical value | Parameter | Numerical value |
| Number of download threads | 15 | Detection depth limit | 4 | Number of extracted information | 20 | Number of web addresses not detected | 5000 | Number of web addresses detected | 211 | Data set | 200 M |
|
|