Conference proceeding
Identifying Strategic Information from Scientific Articles through Sentence Classification
6th International Conference on Language Resources and Evaluation Conference (LREC-08), pp.1518-1522
30 May 2008
Abstract
We address here the need to assist users in rapidly accessing the most important or strategic information in the text corpus by identifying sentences carrying specific information. More precisely, we want to identify contribution of authors of scientific papers through a categorization of sentences using rhetorical and lexical cues. We built local grammars to annotate sentences in the corpus according to their rhetorical status: objective, new things, results, findings, hypotheses, conclusion, related_word, future work. The annotation is automatically projected automatically onto two other corpora to test their portability across several domains. The local grammars are implemented in the Unitex system. After sentence categorization, the annotated sentences are clustered and users can navigate the result by accessing specific information types. The results can be used for advanced information retrieval purposes.
Metrics
6 Record Views
Details
- Title
- Identifying Strategic Information from Scientific Articles through Sentence Classification
- Creators
- Fidelia Ibekwe-Sanjuan - Equipe de recherche de Lyon en sciences de l'information et de la communicationChaomei Chen - iSchool, University of Drexel, PhiladelphiaPinho Roberto - Department of Computer Science
- Publication Details
- 6th International Conference on Language Resources and Evaluation Conference (LREC-08), pp.1518-1522
- Conference
- 6th International Conference on Language Resources and Evaluation Conference (LREC-08), 6th
- Publisher
- ELDA
- Number of pages
- 1
- Resource Type
- Conference proceeding
- Language
- English
- Academic Unit
- Information Science (Informatics)
- Identifiers
- 991019173434404721