Logo image
A genetic algorithm for variable selection in logistic regression analysis of radiotherapy treatment outcomes
Journal article   Open access   Peer reviewed

A genetic algorithm for variable selection in logistic regression analysis of radiotherapy treatment outcomes

Olivier Gayou, Shiva K Das, Su-Min Zhou, Lawrence B Marks, David S Parda and Moyed Miften
Medical physics (Lancaster), v 35(12), pp 5426-5433
Dec 2008
PMID: 19175102
url
https://doi.org/10.1118/1.3005974View
Published, Version of Record (VoR)Open Access (License Unspecified) Open

Abstract

Algorithms Area Under Curve Carcinoma, Non-Small-Cell Lung - radiotherapy Humans Lung Neoplasms - radiotherapy Models, Genetic Models, Statistical Models, Theoretical Neoplasms - radiotherapy Radiometry Radiotherapy - methods Radiotherapy Planning, Computer-Assisted Regression Analysis ROC Curve Treatment Outcome
A given outcome of radiotherapy treatment can be modeled by analyzing its correlation with a combination of dosimetric, physiological, biological, and clinical factors, through a logistic regression fit of a large patient population. The quality of the fit is measured by the combination of the predictive power of this particular set of factors and the statistical significance of the individual factors in the model. We developed a genetic algorithm (GA), in which a small sample of all the possible combinations of variables are fitted to the patient data. New models are derived from the best models, through crossover and mutation operations, and are in turn fitted. The process is repeated until the sample converges to the combination of factors that best predicts the outcome. The GA was tested on a data set that investigated the incidence of lung injury in NSCLC patients treated with 3DCRT. The GA identified a model with two variables as the best predictor of radiation pneumonitis: the V30 (p=0.048) and the ongoing use of tobacco at the time of referral (p=0.074). This two-variable model was confirmed as the best model by analyzing all possible combinations of factors. In conclusion, genetic algorithms provide a reliable and fast way to select significant factors in logistic regression analysis of large clinical studies.

Metrics

13 Record Views
20 citations in Scopus

Details

UN Sustainable Development Goals (SDGs)

This publication has contributed to the advancement of the following goals:

#3 Good Health and Well-Being

InCites Highlights

Data related to this publication, from InCites Benchmarking & Analytics tool:

Collaboration types
Domestic collaboration
Web of Science research areas
Radiology, Nuclear Medicine & Medical Imaging
Logo image