Book chapter
Text Preprocessing
Practical Text Analytics, pp 45-59
20 Oct 2018
Abstract
This chapter starts the process of preparing text data for analysis. This chapter introduces the choices that can be made to cleanse text data, including tokenizing, standardizing and cleaning, removing stop words, and stemming. The chapter also covers advanced topics in text preprocessing, such as n-grams, part-of-speech tagging, and custom dictionaries. The text preprocessing decisions influence the text document representation created for analysis.
Metrics
35 Record Views
Details
- Title
- Text Preprocessing
- Creators
- Murugan AnandarajanChelsey HillThomas Nolan
- Publication Details
- Practical Text Analytics, pp 45-59
- Series
- Advances in Analytics and Data Science
- Publisher
- Springer International Publishing; Cham
- Resource Type
- Book chapter
- Language
- English
- Academic Unit
- Decision Sciences (and Management Information Systems); Bennett S. LeBow College of Business; Television (and Media) Management; Drexel University
- Other Identifier
- 991019551544304721