Conference proceeding
Analysis of the term 'big data': Usage in biomedical publications
Proceedings - 2017 IEEE International Conference on Big Data, Big Data 2017, Vol.2018-January, pp.1253-1258
2018
Abstract
The meaning of the term 'Big Data' is still subject to debate, in spite of being widely used in biomedical publications. This confusion in definition leads to missed opportunities for peers to exchange knowledge and practices. A better understanding of Big Data may help researchers to identify themselves with the Big Data community. In this study, we investigate the most distinguishing features in 'Big Data'-labelled publications by comparing them against publication without this label (non-Big Data), using text mining and machine learning methods. Furthermore, the usage of the term Big Data was analysed over time. Our models could successfully make a distinction between publications labelled with 'Big Data' and those without. The most distinguishing features consisted of terms such as 'omics', 'computing', 'storage', and 'mining'. We observed that publications that do not use the term Big Data may also address topics that fall under well-accepted definitions of Big Data. Trends suggest that, while the use of the term Big Data increased, it is used less reliably now as compared to earlier years.
Metrics
7 Record Views
Details
- Title
- Analysis of the term 'big data': Usage in biomedical publications
- Creators
- A. J van AltenaP. D MoerlandA. H ZwindermanS. D OlabarriagaZoran ObradovicRicardo Baeza-YatesJeremy KepnerRaghunath NambiarChonggang WangMasashi ToyodaToyotaro SuzumuraXiaohua HuAlfredo CuzzocreaJian TangHui ZangJian-Yun NieRumi Ghosh
- Publication Details
- Proceedings - 2017 IEEE International Conference on Big Data, Big Data 2017, Vol.2018-January, pp.1253-1258
- Conference
- 2017 IEEE International Conference on Big Data, Big Data 2017
- Publisher
- Institute of Electrical and Electronics Engineers Inc
- Number of pages
- 1
- Resource Type
- Conference proceeding
- Language
- English
- Academic Unit
- Information Science (Informatics)
- Identifiers
- 991019189139704721