Research Article

DLLog: An Online Log Parsing Approach for Large-Scale System

Table 5

Summary of datasets.

DatasetDataset sizeLogsLog templatesExplanations

HDFS1.47 GB11,175,62930Distributed system logs
Hadoop48.61 MB394,308298Distributed system logs
Spark2.75 GB33,236,604456Distributed system logs
OpenStack60.01 MB207,82051Distributed system logs
ZooKeeper9.95 MB74,38095Distributed system logs
BGL708.76 MB4,747,963619Supercomputer logs
HPC32.00 MB433,489104Supercomputer logs
Thunderbird29.60 GB211,212,1924,040Supercomputer logs
Linux2.25 MB25,567488Operating system logs
Mac16.09 MB117,2832,214Operating system logs
Windows26.09 GB114,608,3884,833Operating system logs
OpenSSH70.02 MB655,14662Service application logs
Apache4.90 MB56,48144Service application logs
Android3.38 GB30,348,04276,923Mobile system logs
HealthApp22.44 MB253,395220Mobile system logs
Proxifier2.42 MB21,3299Standalone software logs