Download
Hornung-et-al_2019_Block Forests_Random forests for blocks.pdf 1,21MB
WeightNameValue
1000 Titel
  • Block Forests: random forests for blocks of clinical and omics covariate data
1000 Autor/in
  1. Hornung, Roman |
  2. Wright, Marvin N. |
1000 Erscheinungsjahr 2019
1000 LeibnizOpen
1000 Publikationstyp
  1. Artikel |
1000 Online veröffentlicht
  • 2019-06-27
1000 Erschienen in
1000 Quellenangabe
  • 20:358
1000 FRL-Sammlung
1000 Copyrightjahr
  • 2019
1000 Lizenz
1000 Verlagsversion
  • https://doi.org/10.1186/s12859-019-2942-y |
  • https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6598279/ |
1000 Ergänzendes Material
  • https://bmcbioinformatics.biomedcentral.com/articles/10.1186/s12859-019-2942-y#additional-information |
1000 Publikationsstatus
1000 Begutachtungsstatus
1000 Sprache der Publikation
1000 Abstract/Summary
  • BACKGROUND: In the last years more and more multi-omics data are becoming available, that is, data featuring measurements of several types of omics data for each patient. Using multi-omics data as covariate data in outcome prediction is both promising and challenging due to the complex structure of such data. Random forest is a prediction method known for its ability to render complex dependency patterns between the outcome and the covariates. Against this background we developed five candidate random forest variants tailored to multi-omics covariate data. These variants modify the split point selection of random forest to incorporate the block structure of multi-omics data and can be applied to any outcome type for which a random forest variant exists, such as categorical, continuous and survival outcomes. Using 20 publicly available multi-omics data sets with survival outcome we compared the prediction performances of the block forest variants with alternatives. We also considered the common special case of having clinical covariates and measurements of a single omics data type available. RESULTS: We identify one variant termed “block forest” that outperformed all other approaches in the comparison study. In particular, it performed significantly better than standard random survival forest (adjusted p-value: 0.027). The two best performing variants have in common that the block choice is randomized in the split point selection procedure. In the case of having clinical covariates and a single omics data type available, the improvements of the variants over random survival forest were larger than in the case of the multi-omics data. The degrees of improvements over random survival forest varied strongly across data sets. Moreover, considering all clinical covariates mandatorily improved the performance. This result should however be interpreted with caution, because the level of predictive information contained in clinical covariates depends on the specific application. CONCLUSIONS: The new prediction method block forest for multi-omics data can significantly improve the prediction performance of random forest and outperformed alternatives in the comparison. Block forest is particularly effective for the special case of using clinical covariates in combination with measurements of a single omics data type.
1000 Sacherschließung
lokal Survival analysis
lokal Machine learning
lokal Cancer
lokal Statistics
lokal Random forest
lokal Prediction
lokal Multi-omics data
1000 Fächerklassifikation (DDC)
1000 Liste der Beteiligten
  1. https://orcid.org/0000-0002-6036-1495|https://frl.publisso.de/adhoc/uri/V3JpZ2h0LCBNYXJ2aW4gTi4=
1000 Label
1000 Förderer
  1. Deutsche Forschungsgemeinschaft |
1000 Fördernummer
  1. BO3139/4-3; BO3139/6-1; HO6422/1-2
1000 Förderprogramm
  1. -
1000 Dateien
  1. Block Forests_Random forests for blocks
1000 Förderung
  1. 1000 joinedFunding-child
    1000 Förderer Deutsche Forschungsgemeinschaft |
    1000 Förderprogramm -
    1000 Fördernummer BO3139/4-3; BO3139/6-1; HO6422/1-2
1000 Objektart article
1000 Beschrieben durch
1000 @id frl:6417081.rdf
1000 Erstellt am 2019-10-24T16:22:06.734+0200
1000 Erstellt von 266
1000 beschreibt frl:6417081
1000 Bearbeitet von 25
1000 Zuletzt bearbeitet Thu Jan 30 17:37:31 CET 2020
1000 Objekt bearb. Mon Oct 28 09:25:10 CET 2019
1000 Vgl. frl:6417081
1000 Oai Id
  1. oai:frl.publisso.de:frl:6417081 |
1000 Sichtbarkeit Metadaten public
1000 Sichtbarkeit Daten public
1000 Gegenstand von

View source