Book chapter
PIES: A Web Information Extraction System Using Ontology and Tag Patterns
Advances in Web-Age Information Management, pp 688-693
2005
Abstract
We propose a new web information extraction system, PIES, to convert web information into XML documents. PIES uses a user-specified ontology and HTML tag pattern descriptions. The ontology validates the web information the pattern descriptions extract. We designed a new language to describe HTML tag patterns and extraction rules. We implemented PIES and applied it to the US patent web site for evaluation.
Metrics
Details
- Title
- PIES: A Web Information Extraction System Using Ontology and Tag Patterns
- Creators
- Byung-Kwon Park - Dong-A UniversityHyoil Han - Drexel UniversityIl-Yeol Song - Drexel University
- Publication Details
- Advances in Web-Age Information Management, pp 688-693
- Series
- Lecture Notes in Computer Science
- Publisher
- Springer Berlin Heidelberg; Berlin, Heidelberg
- Resource Type
- Book chapter
- Language
- English
- Academic Unit
- Information Science
- Web of Science ID
- WOS:000233385300065
- Scopus ID
- 2-s2.0-33646520808
- Other Identifier
- 991019184072704721
InCites Highlights
Data related to this publication, from InCites Benchmarking & Analytics tool:
- Collaboration types
- Domestic collaboration
- International collaboration
- Web of Science research areas
- Computer Science, Information Systems
- Computer Science, Theory & Methods