Conference proceeding
Bacterial named entity recognition based on dictionary and conditional random field
2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), v 2017-, pp 439-444
Nov 2017
Abstract
There are intensive computational efforts to discover large-scale microbial interactions from metagenomic abundance data, however, it is often difficult to validate such inferred interactions without a manually curated dataset. There are also a number of small-scale microbial interactions reported in massive literature with experimental confidence. Text mining can be employed to extract such microbial interactions from biomedical literature which could be a significant complement to abundance-based method. The key tasks of text mining include named entity recognition and relation extraction. Named entity recognition identifies the name of the specified type from the text. We manually annotated a corpus with 1344 abstracts from microbial literature for the task of bacterial named entity recognition. Six new features were added in addition to the general features of the biomedical field. Based on a bacterial dictionary and conditional random field (CRF), the bacterial named entity recognition model was trained and it achieved a performance with precision 89.118%, recall 81.598 % and F-measure 85.192%. The system and template are available at https://github.com/bluelilywxy/BacNER-V1.0.git.
Metrics
13 Record Views
12 citations in Web of Science
17 citations in Scopus
Details
- Title
- Bacterial named entity recognition based on dictionary and conditional random field
- Creators
- Xiaoyan Wang - Sch. of Comput., Central China Normal Univ., Wuhan, ChinaXingpeng Jiang - Central China Normal UniversityMengwen Liu - Coll. of Comput. & Inf., Drexel Univ., Philadelphia, PA, USATingting He - Sch. of Comput., Central China Normal Univ., Wuhan, ChinaXiaohua Hu - Sch. of Comput., Central China Normal Univ., Wuhan, China
- Publication Details
- 2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), v 2017-, pp 439-444
- Publisher
- IEEE
- Resource Type
- Conference proceeding
- Language
- English
- Academic Unit
- Information Science
- Scopus ID
- 2-s2.0-85046284309
- Other Identifier
- 991019170575504721