Journal article
Tracking word semantic change in biomedical literature
International journal of medical informatics (Shannon, Ireland), v 109, pp 76-86
Jan 2018
PMID: 29195709
Abstract
•The paper examines word semantic change using a 30-year biomedical literature data.•A set of representative words is obtained based on frequency and topic distribution.•A skip-gram language model is used to measure global and local semantic change.•Word meanings are more stable in the 2000s than they were in the 1980s and 1990s.•No evidence is found to support the law of parallel change or the law of conformity.
Up to this point, research on written scholarly communication has focused primarily on syntactic, rather than semantic, analyses. Consequently, we have yet to understand semantic change as it applies to disciplinary discourse. The objective of this study is to illustrate word semantic change in biomedical literature. To that end, we identify a set of representative words in biomedical literature based on word frequency and word-topic probability distributions. A word2vec language model is then applied to the identified words in order to measure word- and topic-level semantic changes. We find that for the selected words in PubMed, overall, meanings are becoming more stable in the 2000s than they were in the 1980s and 1990s. At the topic level, the global distance of most topics (19 out of 20 tested) is declining, suggesting that the words used to discuss these topics are stabilizing semantically. Similarly, the local distance of most topics (19 out of 20) is also declining, showing that the meanings of words from these topics are becoming more consistent with those of their semantic neighbors. At the word level, this paper identifies two different trends in word semantics, as measured by the aforementioned distance metrics: on the one hand, words can form clusters with their semantic neighbors, and these words, as a cluster, coevolve semantically; on the other hand, words can drift apart from their semantic neighbors while nonetheless stabilizing in the global context. In relating our work to language laws on semantic change, we find no overwhelming evidence to support either the law of parallel change or the law of conformity.
Metrics
Details
- Title
- Tracking word semantic change in biomedical literature
- Creators
- Erjia Yan - College of Computing and Informatics, 3141 Chestnut Street, Drexel University, Philadelphia, PA 19104, USAYongjun Zhu - Weill Cornell Medicine, Cornell University, New York, NY 10065, USA
- Publication Details
- International journal of medical informatics (Shannon, Ireland), v 109, pp 76-86
- Publisher
- Elsevier
- Resource Type
- Journal article
- Language
- English
- Academic Unit
- Information Science
- Web of Science ID
- WOS:000416954800010
- Scopus ID
- 2-s2.0-85034051420
- Other Identifier
- 991014976821804721
InCites Highlights
Data related to this publication, from InCites Benchmarking & Analytics tool:
- Collaboration types
- Domestic collaboration
- Web of Science research areas
- Computer Science, Information Systems
- Health Care Sciences & Services
- Medical Informatics