Logo image
Natural Language Processing for Mimicking Clinical Trial Recruitment in Critical Care: A Semi-Automated Simulation Based on the LeoPARDS Trial
Journal article   Open access   Peer reviewed

Natural Language Processing for Mimicking Clinical Trial Recruitment in Critical Care: A Semi-Automated Simulation Based on the LeoPARDS Trial

Hegler C. Tissot, Anoop D. Shah, David Brealey, Steve Harris, Ruth Agbakoba, Amos Folarin, Luis Romao, Lukasz Roguski, Richard Dobson and Folkert W. Asselbergs
IEEE journal of biomedical and health informatics, v 24(10), pp 2950-2959
01 Oct 2020
PMID: 32149659
url
https://www.medrxiv.org/content/medrxiv/early/2019/09/26/19005603.full.pdfView

Abstract

Clinical trials Electric shock electronic medical records Health information management Informatics Medical diagnostic imaging Natural language processing patient monitoring Recruitment text processing Unified modeling language
Clinical trials often fail to recruit an adequate number of appropriate patients. Identifying eligible trial participants is resource-intensive when relying on manual review of clinical notes, particularly in critical care settings where the time window is short. Automated review of electronic health records (EHR) may help, but much of the information is in free text rather than a computable form. We applied natural language processing (NLP) to free text EHR data using the CogStack platform to simulate recruitment into the LeoPARDS study, a clinical trial aiming to reduce organ dysfunction in septic shock. We applied an algorithm to identify eligible patients using a moving 1-hour time window, and compared patients identified by our approach with those actually screened and recruited for the trial, for the time period that data were available. We manually reviewed records of a random sample of patients identified by the algorithm but not screened in the original trial. Our method identified 376 patients, including 34 patients with EHR data available who were actually recruited to LeoPARDS in our centre. The sensitivity of CogStack for identifying patients screened was 90% (95% CI 85%, 93%). Of the 203 patients identified by both manual screening and CogStack, the index date matched in 95 (47%) and CogStack was earlier in 94 (47%). In conclusion, analysis of EHR data using NLP could effectively replicate recruitment in a critical care trial, and identify some eligible patients at an earlier stage, potentially improving trial recruitment if implemented in real time.

Metrics

10 Record Views
44 citations in Scopus

Details

UN Sustainable Development Goals (SDGs)

This publication has contributed to the advancement of the following goals:

#3 Good Health and Well-Being

InCites Highlights

Data related to this publication, from InCites Benchmarking & Analytics tool:

Collaboration types
Domestic collaboration
International collaboration
Web of Science research areas
Computer Science, Information Systems
Computer Science, Interdisciplinary Applications
Mathematical & Computational Biology
Medical Informatics
Logo image