Logo image
Study design in high-dimensional classification analysis
Journal article   Open access   Peer reviewed

Study design in high-dimensional classification analysis

Brisa N. Sanchez, Meihua Wu, Peter X. K. Song and Wen Wang
Biostatistics (Oxford, England), v 17(4), pp 722-736
01 Oct 2016
PMID: 27154835
Featured in Collection :   UN Sustainable Development Goals @ Drexel
url
https://pmc.ncbi.nlm.nih.gov/articles/PMC5031947/View
Published, Version of Record (VoR)Maybe Open Access (Publisher Bronze) Open
url
https://doi.org/10.1093/biostatistics/kxw018View
Published, Version of Record (VoR) Open

Abstract

Life Sciences & Biomedicine Mathematical & Computational Biology Science & Technology Statistics & Probability Mathematics Physical Sciences
Advances in high throughput technology have accelerated the use of hundreds to millions of biomarkers to construct classifiers that partition patients into different clinical conditions. Prior to classifier development in actual studies, a critical need is to determine the sample size required to reach a specified classification precision. We develop a systematic approach for sample size determination in high-dimensional (large p small n) classification analysis. Our method utilizes the probability of correct classification (PCC) as the optimization objective function and incorporates the higher criticism thresholding procedure for classifier development. Further, we derive the theoretical bound of maximal PCC gain from feature augmentation (e.g. when molecular and clinical predictors are combined in classifier development). Our methods are motivated and illustrated by a study using proteomics markers to classify post-kidney transplantation patients into stable and rejecting classes.

Metrics

15 Record Views
12 citations in Scopus

Details

UN Sustainable Development Goals (SDGs)

This publication has contributed to the advancement of the following goals:

#3 Good Health and Well-Being

InCites Highlights

Data related to this publication, from InCites Benchmarking & Analytics tool:

Collaboration types
Industry collaboration
Domestic collaboration
Web of Science research areas
Mathematical & Computational Biology
Statistics & Probability
Logo image