Complexity

Research Article

Tensor Decomposition for Multiple-Instance Classification of High-Order Medical Data

TensMIL (training)

Input: training and test instances’ features U_train and U_test, subjects’ training labels Y_train, percentage of variance retained by PCA , the number of bins used for the histograms
Output: prediction model
1. Concatenate U_train and U_test along the first dimension into a matrix U.
2. Perform PCA for decorrelation and dimensionality reduction on the concatenated matrix U and get the scores T, using the m-leading singular values that preserve of data variance.
3. Split the truncated scores matrix T into the corresponding T_train and T_test (will be used in the testing phase) scores matrix.
4. Train a robust full quadratic regression model (Equation (10)) using T_train and y_train (the instance labels inherited by the corresponding bag labels) and get the instance labels predictions Pred_train for each instance
5. Split the vector Pred_train into subsets of equal sizes and store the cutting points to be used as histogram bin edges in the testing phase
6. For each of the training bags calculate the normalized cumulative histogram and construct the feature matrix A_train
7. Fit a QDA model to map A_train to Y_train (Equation (12)).