Logo image
New search Researchers Research units
Sign in
Term-Document Representation
Book chapter

Term-Document Representation

Murugan Anandarajan, Chelsey Hill and Thomas Nolan
Practical Text Analytics, pp 61-73
20 Oct 2018

Abstract

Document frequency Document weighting Document-term matrix Inverse document frequency Inverted index Log frequency Term frequency Term frequency-inverse document frequency Term weighting Term-document matrix Weighting
This chapter details the process of converting documents into an analysis-ready term-document representation. Preprocessed text documents are first transformed into an inverted index for demonstrative purposes. Then, the inverted index is manipulated into a term-document or document-term matrix. The chapter concludes with descriptions of different weighting schemas for analysis-ready term-document representation.

Metrics

8 Record Views

Details

Logo image