Conference proceeding
A link classification based approach to website topic hierarchy generation
Proceedings of the 16th international conference on World Wide Web, pp 1127-1128
08 May 2007
Abstract
Hierarchical models are commonly used to organize a Website's content. A Website's content structure can be represented by a topic hierarchy, a directed tree rooted at a Website's homepage in which the vertices and edges correspond to Web pages and hyperlinks. In this work, we propose a new method for constructing the topic hierarchy of a Website. We model the Website's link structure using weighted directed graph, in which the edge weights are computed using a classifier that predicts if an edge connects a pair of nodes representing a topic and a sub-topic. We then pose the problem of building the topic hierarchy as finding the shortest-path tree and directed minimum spanning tree in the weighted graph. We've done extensive experiments using real Websites and obtained very promising results.
Metrics
13 Record Views
9 citations in Scopus
Details
- Title
- A link classification based approach to website topic hierarchy generation
- Creators
- Nan Liu - Chinese University of Hong KongChristopher C. Yang - Chinese University of Hong Kong
- Publication Details
- Proceedings of the 16th international conference on World Wide Web, pp 1127-1128
- Conference
- WWW'07: 16th International World Wide Web Conference (2007)
- Series
- ACM Conferences
- Publisher
- ACM
- Resource Type
- Conference proceeding
- Language
- English
- Academic Unit
- Information Science
- Scopus ID
- 2-s2.0-35348916020
- Other Identifier
- 991021855188904721