Journal article
Evolutionary Characters, Phenotypes and Ontologies: Curating Data from the Systematic Biology Literature
PloS one, v 5(5), pp e10708-e10708
20 May 2010
PMID: 20505755
Featured in Collection : UN Sustainable Development Goals @ Drexel
Abstract
Background: The wealth of phenotypic descriptions documented in the published articles, monographs, and dissertations of phylogenetic systematics is traditionally reported in a free-text format, and it is therefore largely inaccessible for linkage to biological databases for genetics, development, and phenotypes, and difficult to manage for large-scale integrative work. The Phenoscape project aims to represent these complex and detailed descriptions with rich and formal semantics that are amenable to computation and integration with phenotype data from other fields of biology. This entails reconceptualizing the traditional free-text characters into the computable Entity-Quality (EQ) formalism using ontologies.
Methodology/Principal Findings: We used ontologies and the EQ formalism to curate a collection of 47 phylogenetic studies on ostariophysan fishes (including catfishes, characins, minnows, knifefishes) and their relatives with the goal of integrating these complex phenotype descriptions with information from an existing model organism database (zebrafish, http://zfin.org). We developed a curation workflow for the collection of character, taxonomic and specimen data from these publications. A total of 4,617 phenotypic characters (10,512 states) for 3,449 taxa, primarily species, were curated into EQ formalism (for a total of 12,861 EQ statements) using anatomical and taxonomic terms from teleost-specific ontologies (Teleost Anatomy Ontology and Teleost Taxonomy Ontology) in combination with terms from a quality ontology (Phenotype and Trait Ontology). Standards and guidelines for consistently and accurately representing phenotypes were developed in response to the challenges that were evident from two annotation experiments and from feedback from curators.
Conclusions/Significance: The challenges we encountered and many of the curation standards and methods for improving consistency that we developed are generally applicable to any effort to represent phenotypes using ontologies. This is because an ontological representation of the detailed variations in phenotype, whether between mutant or wildtype, among individual humans, or across the diversity of species, requires a process by which a precise combination of terms from domain ontologies are selected and organized according to logical relations. The efficiencies that we have developed in this process will be useful for any attempt to annotate complex phenotypic descriptions using ontologies. We also discuss some ramifications of EQ representation for the domain of systematics.
Metrics
Details
- Title
- Evolutionary Characters, Phenotypes and Ontologies: Curating Data from the Systematic Biology Literature
- Creators
- Wasila M. Dahdul - University of South DakotaJames P. Balhoff - University of North Carolina at Chapel HillJeffrey Engeman - University of South DakotaTerry Grande - Loyola University ChicagoEric J. Hilton - William & MaryCartik Kothari - University of North Carolina at Chapel HillHilmar Lapp - National Evolutionary Synthesis CenterJohn G. Lundberg - Drexel UniversityPeter E. Midford - National Evolutionary Synthesis CenterTodd J. Vision - Natl Evolutionary Synth Ctr, Durham, NC USAMonte Westerfield - University of OregonPaula M. Mabee - University of South Dakota
- Publication Details
- PloS one, v 5(5), pp e10708-e10708
- Publisher
- Public Library Science
- Number of pages
- 12
- Grant note
- EF-0423641 / National Evolutionary Synthesis Center, NSF P41HG002659 / NATIONAL HUMAN GENOME RESEARCH INSTITUTE; United States Department of Health & Human Services; National Institutes of Health (NIH) - USA; NIH National Human Genome Research Institute (NHGRI) DBI 0641025 / National Science Foundation; National Science Foundation (NSF) HG002659 / National Institutes of Health; United States Department of Health & Human Services; National Institutes of Health (NIH) - USA 1062542 / Div Of Biological Infrastructure; National Science Foundation (NSF); NSF - Directorate for Biological Sciences (BIO)
- Resource Type
- Journal article
- Language
- English
- Academic Unit
- Biodiversity, Earth, and Environmental Science (BEES); Academy of Natural Sciences of Drexel University
- Web of Science ID
- WOS:000278017300003
- Scopus ID
- 2-s2.0-77956229224
- Other Identifier
- 991019335320704721
UN Sustainable Development Goals (SDGs)
This publication has contributed to the advancement of the following goals:
InCites Highlights
Data related to this publication, from InCites Benchmarking & Analytics tool:
- Collaboration types
- Domestic collaboration
- Web of Science research areas
- Zoology