Logo image
PIES: A Web Information Extraction System Using Ontology and Tag Patterns
Book chapter   Peer reviewed

PIES: A Web Information Extraction System Using Ontology and Tag Patterns

Byung-Kwon Park, Hyoil Han and Il-Yeol Song
Advances in Web-Age Information Management, pp 688-693
2005

Abstract

We propose a new web information extraction system, PIES, to convert web information into XML documents. PIES uses a user-specified ontology and HTML tag pattern descriptions. The ontology validates the web information the pattern descriptions extract. We designed a new language to describe HTML tag patterns and extraction rules. We implemented PIES and applied it to the US patent web site for evaluation.

Metrics

11 Record Views
3 citations in Scopus

Details

InCites Highlights

Data related to this publication, from InCites Benchmarking & Analytics tool:

Collaboration types
Domestic collaboration
International collaboration
Web of Science research areas
Computer Science, Information Systems
Computer Science, Theory & Methods
Logo image