Data_Sheet_1_Random Forests Based Group Importance Scores and Their Statistical Interpretation: Application for Alzheimer's Disease.pdf (1.73 MB)

Data_Sheet_1_Random Forests Based Group Importance Scores and Their Statistical Interpretation: Application for Alzheimer's Disease.pdf

Download (1.73 MB)
dataset
posted on 29.06.2018, 07:31 by Marie Wehenkel, Antonio Sutera, Christine Bastin, Pierre Geurts, Christophe Phillips

Machine learning approaches have been increasingly used in the neuroimaging field for the design of computer-aided diagnosis systems. In this paper, we focus on the ability of these methods to provide interpretable information about the brain regions that are the most informative about the disease or condition of interest. In particular, we investigate the benefit of group-based, instead of voxel-based, analyses in the context of Random Forests. Assuming a prior division of the voxels into non overlapping groups (defined by an atlas), we propose several procedures to derive group importances from individual voxel importances derived from Random Forests models. We then adapt several permutation schemes to turn group importance scores into more interpretable statistical scores that allow to determine the truly relevant groups in the importance rankings. The good behaviour of these methods is first assessed on artificial datasets. Then, they are applied on our own dataset of FDG-PET scans to identify the brain regions involved in the prognosis of Alzheimer's disease.

History

References

Licence

Exports