Random forest analysis to predict disease-free survival using FDG-PET and CT in non-small cell lung cancer

Congress:

ECR 2018

Poster Number:

C-2980

Type:

Scientific Exhibit

Keywords:

Hybrid Imaging, Lung, Computer applications, PET-CT, Image manipulation / Reconstruction, Neural networks, Computer Applications-Detection, diagnosis, Outcomes analysis, Experimental investigations, Tissue characterisation, Cancer, Outcomes

Authors:

M. Kirienko¹, L. Lozza², N. Gennaro¹, A. Rossi¹, E. Voulaz¹, A. Chiti¹, M. Sollini¹; ¹Milan/IT, ²Bergamo/IT

DOI:

10.1594/ecr2018/C-2980

DOI-Link:

https://dx.doi.org/10.1594/ecr2018/C-2980

Fig. 4: Fig. 4 Low-risk patient:CT, axial PET, PET/CT and a coronal reconstruction of...

Fig. 5: Fig.5 High-risk patient: CT, axial PET, PET/CT and a coronal reconstruction of...

Fig. 6: Fig. 6 Three-dimensional reconstruction of PET images in a low-risk patient

Fig. 7: Three-dimensional reconstruction of PET images in a high-risk patient.

Methods and materials

Random forests for classification were developed keeping the same training and validation sets as for the parametric analysis, to predict DFS. Seven different combinations of variables were considered: Clinical (263 patients, 5 features), CT (295, 41), PET (258, 43), PET+CT (258, 84), CT+Clinical (263, 46), PET+Clinical (231, 48), PET+CT+Clinical (231, 89)

Random forests for classification were developed keeping the same training and validation sets as for the parametric analysis. The outcome to be predicted was the DFS considered until the date of last access or the date of relapse (0=DSF / 1=relapse).

Seven different combinations of variables were considered among clinical predictors and imaging features derived from CT and PET :

• Clinical (263 patients, 5 features)

• CT (295 patients, 41 features)

• PET (258 patients, 43 features)

• PET + CT (258 patients, 84 features)

• CT + Clinical (263 patients, 46 features)

• PET + Clinical (231 patients, 48 features)

• PET + CT + Clinical (231 patients, 89 features)

For each dataset, a random forest model was built considering different number of trees and different split dimensions. Moreover, the relative weight assigned to the output classes was explored. Once hyper-parameters with a better performance in terms of AUC were identified, feature importance was extracted for the optimal models. Additional models were created considering the features with importance greater than the 25th, 50th, 75th and 80th quantiles, respectively. The search of the best split dimension was performed again on these new trees.

Clinical cases of PET/CT images of a low-risk and a high-risk patient are shown in Fig.4-6 and Fig.5-7.