Medizin - Open Access LMU - Teil 16/22

Optimal classifier selection and negative bias in error rate estimation: an empirical study on high-dimensional prediction


Listen Later

Background: In biometric practice, researchers often apply a large number of different methods in a "trial-and-error" strategy to get as much as possible out of their data and, due to publication pressure or pressure from the consulting customer, present only the most favorable results. This strategy may induce a substantial optimistic bias in prediction error estimation, which is quantitatively assessed in the present manuscript. The focus of our work is on class prediction based on high-dimensional data (e. g. microarray data), since such analyses are particularly exposed to this kind of bias. Methods: In our study we consider a total of 124 variants of classifiers (possibly including variable selection or tuning steps) within a cross-validation evaluation scheme. The classifiers are applied to original and modified real microarray data sets, some of which are obtained by randomly permuting the class labels to mimic non-informative predictors while preserving their correlation structure. Results: We assess the minimal misclassification rate over the different variants of classifiers in order to quantify the bias arising when the optimal classifier is selected a posteriori in a data-driven manner. The bias resulting from the parameter tuning (including gene selection parameters as a special case) and the bias resulting from the choice of the classification method are examined both separately and jointly. Conclusions: The median minimal error rate over the investigated classifiers was as low as 31% and 41% based on permuted uninformative predictors from studies on colon cancer and prostate cancer, respectively. We conclude that the strategy to present only the optimal result is not acceptable because it yields a substantial bias in error rate estimation, and suggest alternative approaches for properly reporting classification accuracy.
...more
View all episodesView all episodes
Download on the App Store

Medizin - Open Access LMU - Teil 16/22By Ludwig-Maximilians-Universität München


More shows like Medizin - Open Access LMU - Teil 16/22

View all
Geld und Leben - Ringvorlesung (WiSe 2009-2010) by Ludwig-Maximilians-Universität München

Geld und Leben - Ringvorlesung (WiSe 2009-2010)

0 Listeners

MCMP – Mathematical Philosophy (Archive 2011/12) by MCMP Team

MCMP – Mathematical Philosophy (Archive 2011/12)

6 Listeners

Hegel lectures by Robert Brandom, LMU Munich by Robert Brandom, Axel Hutter

Hegel lectures by Robert Brandom, LMU Munich

6 Listeners

LMU Statistik I für Studierende der Wirtschaftswissenschaften by PD Dr. Christian Heumann

LMU Statistik I für Studierende der Wirtschaftswissenschaften

0 Listeners

Institut für Produktionswirtschaft und Controlling (LMU) by Prof. Dr. Dr. h.c. Hans-Ulrich Küpper

Institut für Produktionswirtschaft und Controlling (LMU)

0 Listeners

Center for Advanced Studies (CAS) Cutting Edge - SD by Center for Advanced Studies (CAS)

Center for Advanced Studies (CAS) Cutting Edge - SD

0 Listeners

LMU Rechtsphilosophie by Prof. Dr. jur. Dr. jur. h.c. mult. Bernd Schünemann

LMU Rechtsphilosophie

0 Listeners

MCMP – Philosophy of Science by MCMP Team

MCMP – Philosophy of Science

1 Listeners

LMU Physik 2 für Chemiker (PN2) SS2016 by Prof. Dr. Jan Lipfert

LMU Physik 2 für Chemiker (PN2) SS2016

0 Listeners

Fakultät für Chemie und Pharmazie - Digitale Hochschulschriften der LMU - Teil 05/06 by Ludwig-Maximilians-Universität München

Fakultät für Chemie und Pharmazie - Digitale Hochschulschriften der LMU - Teil 05/06

0 Listeners