Logo image
Classifying Cancer Stage with Open-Source Clinical Large Language Models
Conference proceeding   Open access

Classifying Cancer Stage with Open-Source Clinical Large Language Models

Chia-Hsuan Chang, Mary M. Lucas, Grace Lu-Yao and Christopher C. Yang
2024 IEEE 12th International Conference on Healthcare Informatics (ICHI), pp 76-82
03 Jun 2024
url
https://arxiv.org/abs/2404.01589View

Abstract

cancer stage classification clinical large language model Large language models Medical services pathology report prompting Task analysis Training Training data Data Mining Pathology
Cancer stage classification is important for making treatment and care management plans for oncology patients. Information on staging is often included in unstructured form in clinical, pathology, radiology and other free-text reports in the electronic health record system, requiring extensive work to parse and obtain. To facilitate the extraction of this in-formation, previous NLP approaches rely on labeled training datasets, which are labor-intensive to prepare. In this study, we demonstrate that without any labeled training data, open-source clinical large language models (LLMs) can extract pathologic tumor-node-metastasis (pTNM) staging information from real-world pathology reports. Our experiments compare LLMs and a BERT-based model fine-tuned using the labeled data. Our findings suggest that while LLMs still exhibit subpar performance in Tumor (T) classification, with the appropriate adoption of prompting strategies, they can achieve comparable performance on Metastasis (M) classification and improved performance on Node (N) classification.

Metrics

14 Record Views
5 citations in Scopus

Details

UN Sustainable Development Goals (SDGs)

This publication has contributed to the advancement of the following goals:

#3 Good Health and Well-Being

InCites Highlights

Data related to this publication, from InCites Benchmarking & Analytics tool:

Collaboration types
Domestic collaboration
Web of Science research areas
Computer Science, Artificial Intelligence
Computer Science, Information Systems
Medical Informatics
Logo image