Letter to the Editor: On the term 'interaction' and related phrases in the literature on Random Forests

Brief Bioinform. 2015 Mar;16(2):338-45. doi: 10.1093/bib/bbu012. Epub 2014 Apr 9.

Abstract

In an interesting and quite exhaustive review on Random Forests (RF) methodology in bioinformatics Touw et al. address--among other topics--the problem of the detection of interactions between variables based on RF methodology. We feel that some important statistical concepts, such as 'interaction', 'conditional dependence' or 'correlation', are sometimes employed inconsistently in the bioinformatics literature in general and in the literature on RF in particular. In this letter to the Editor, we aim to clarify some of the central statistical concepts and point out some confusing interpretations concerning RF given by Touw et al. and other authors.

Keywords: conditional inference trees; conditional variable importance; correlation; interaction; random forest; statistics.

Publication types

  • Letter
  • Research Support, Non-U.S. Gov't
  • Comment

MeSH terms

  • Algorithms*
  • Biological Science Disciplines*
  • Data Mining*
  • Humans