Journal article
Breast Cancer Symptom Clusters Derived From Social Media and Research Study Data Using Improved K -Medoid Clustering
IEEE transactions on computational social systems, v 3(2)
Jun 2016
PMID: 29152536
Featured in Collection : UN Sustainable Development Goals @ Drexel
Abstract
Most cancer patients, including patients with breast cancer, experience multiple symptoms simultaneously while receiving active treatment. Some symptoms tend to occur together and may be related, such as hot flashes and night sweats. Co-occurring symptoms may have a multiplicative effect on patients' functioning, mental health, and quality of life. Symptom clusters in the context of oncology were originally described as groups of three or more related symptoms. Some authors have suggested symptom clusters may have practical applications, such as the formulation of more effective therapeutic interventions that address the combined effects of symptoms rather than treating each symptom separately. Most studies that have sought to identify clusters in breast cancer survivors have relied on traditional research studies. Social media, such as online healthrelated forums, contain a bevy of user-generated content in the form of threads and posts, and could be used as a data source to identify and characterize symptom clusters among cancer patients. This paper seeks to determine patterns of symptom clusters in breast cancer survivors derived from both social media and research study data using improved K-medoid clustering. A total of 50426 publicly available messages were collected from Medhelp.com and 653 questionnaires were collected as part of a research study. The network of symptoms built from social media was sparse compared with that of the research study data, making the social media data easier to partition. The proposed revised K-medoid clustering helps to improve the clustering performance by reassigning some of the negative-average silhouette width (ASW) symptoms to other clusters after initial K-medoid clustering. This retains an overall nondecreasing ASW and avoids the problem of trapping in local optima. The overall ASW, individual ASW, and improved interpretation of the final clustering solution suggest improvement. The clustering results suggest that some symptom clusters are consistent across social media data and clinical data, such as gastrointestinal related symptoms, menopausal symptoms, mood-change symptoms, cognitive impairment, and pain-related symptoms. We recommend an integrative approach taking advantage of both data sources. Social media data could provide context for the interpretation of clustering results derived from research study data, while research study data could compensate for the risk of lower precision and recall found using social media data.
Metrics
Details
- Title
- Breast Cancer Symptom Clusters Derived From Social Media and Research Study Data Using Improved K -Medoid Clustering
- Creators
- Qing Ping - College of Computing & Informatics, Drexel, Philadelphia, PA, USAChristopher C Yang - College of Computing & Informatics, Drexel, Philadelphia, PA, USASarah A Marshall - Department of Biostatistical Sciences, Wake Forest School of Medicine, Winston-Salem, NC, USANancy E Avis - Department of Social Sciences and Health Policy, Wake Forest School of Medicine, Winston-Salem, NC, USAEdward H Ip - Department of Biostatistical Sciences and the Department of Social Sciences & Health Policy, Wake Forest School of Medicine, Winston-Salem, NC, USA
- Publication Details
- IEEE transactions on computational social systems, v 3(2)
- Publisher
- IEEE
- Grant note
- DIBBs-1443019; IIS-1650531; SES-1424875 / National Science Foundation (10.13039/100000001) P50-CA-180905-01; R21AG042761 / National Institutes of Health (10.13039/100000002) 17-01-1-0446 / U.S. Department of Defense (10.13039/100000005)
- Resource Type
- Journal article
- Language
- English
- Academic Unit
- Information Science
- Web of Science ID
- WOS:000433876300003
- Scopus ID
- 2-s2.0-84995520632
- Other Identifier
- 991014878161504721
UN Sustainable Development Goals (SDGs)
This publication has contributed to the advancement of the following goals:
InCites Highlights
Data related to this publication, from InCites Benchmarking & Analytics tool:
- Web of Science research areas
- Computer Science, Cybernetics
- Computer Science, Information Systems