Logo image
Assessing the impact of software on science: A bootstrapped learning of software entities in full-text papers
Journal article   Peer reviewed

Assessing the impact of software on science: A bootstrapped learning of software entities in full-text papers

Xuelian Pan, Erjia Yan, Qianqian Wang and Weina Hua
Journal of informetrics, v 9(4), pp 860-871
Oct 2015

Abstract

Bootstrapping Information extraction Software Citation analysis Entity extraction Software citation
•We propose an improved bootstrapping method to extract software entities from full-text papers.•A positive correlation is found between the number of mentions and the number citations.•Software is widely used in the science community along with a substantial uncitedness.•The 80/20 rule has been found in software mentions and citations. Although software has helped researchers conduct research, little is known of the impact of software on science. To fill this gap, this article proposes an improved bootstrapping method to extract software entities from full-text papers and assess their impact on science. Evaluation results show that the proposed entity extraction system outperforms three baseline methods on extracting software entities from full-text papers. The proposed method is then used to learn software entities from all papers published in PLoS ONE in 2014. More than 2000 unique software entities are obtained which accounted for more than 20,000 mentions and more than 7000 citations. The paper finds that software is commonly used in the scientific community along with a substantial uncitedness.

Metrics

11 Record Views
58 citations in Scopus
44 readers on Mendeley
1 readers on CiteULike

Details

UN Sustainable Development Goals (SDGs)

This publication has contributed to the advancement of the following goals:

#16 Peace, Justice and Strong Institutions
#3 Good Health and Well-Being

InCites Highlights

Data related to this publication, from InCites Benchmarking & Analytics tool:

Collaboration types
Domestic collaboration
International collaboration
Web of Science research areas
Computer Science, Interdisciplinary Applications
Information Science & Library Science
Logo image