Logo image
Computational Curation and the Application of Large-Scale Vocabularies
Conference proceeding

Computational Curation and the Application of Large-Scale Vocabularies

Sam Grabus and Jane Greenberg
2021 IEEE International Conference on Big Data (Big Data), pp 2220-2223
15 Dec 2021

Abstract

automatic curation Big Data Conferences controlled vocabularies Encyclopedias lemmatization natural language processing (NLP) Process control stemming Vocabulary
Paper presents an exploratory case study comparing stemming and lemmatization results for the automatic application of large-scale controlled vocabularies processed against archival encyclopedia entries. The results report relative recall and precision evaluations across both results. Research shows that while stemming has a higher relative recall, lemmatization results in a higher relevance score and eliminates the over-stemming challenges. Results provide insight into improving automatic curation workflows for archival resources.

Metrics

6 Record Views

Details

UN Sustainable Development Goals (SDGs)

This publication has contributed to the advancement of the following goals:

#4 Quality Education

InCites Highlights

Data related to this publication, from InCites Benchmarking & Analytics tool:

Web of Science research areas
Computer Science, Artificial Intelligence
Computer Science, Information Systems
Computer Science, Theory & Methods
Logo image