大连理工大学主页平台管理系统张宪超 Keywords extraction from web pages using semantic link analysis Home

Current position: Home >> Scientific Research >> Paper Publications

Keywords extraction from web pages using semantic link analysis

Release Time:2019-03-11 Hits:

Indexed by: Journal Article

Date of Publication: 2009-06-01

Journal: Journal of Information and Computational Science

Included Journals: Scopus、EI

Volume: 6

Issue: 3

Page Number: 1495-1503

ISSN: 15487741

Abstract: We firstly introduce the model of community identification to keywords extraction in web pages. Max- Flow algorithm can be used to identify a community in a local web graph which is concentrated on one topic. A web page contains a single relatively extensive topic too. Based on this observation, we treat words in a web page as nodes and relations between words as edges to construct a graph, rank words in the graph to find out some very important words as Seed-Keywords, and then input the graph and the Seed-Keywords to a modified version of Max-Flow algorithm to output a community, whose members are viewed as Target-Keywords. In this process, we do word sense disambiguation in a kind of context 'Topic-Block', whose precision is compared with a coarser-grained context, the whole web page and a finer-grained context, the basic element in HTML. The experiment results show that Topic-Block based word sense disambiguation is effective and Max-Flow algorithm can extract any number keywords adaptive to the size of web pages. 1548-7741/ Copyright ? 2009 Binary Information Press.

Prev One:基于k最相似聚类的子空间聚类算法

Next One:一种适用于大型站点的层次链接分析算法

Home

Scientific Research

Teaching Research

Awards and Honours

Enrollment Information

Student Information

My Album

Blog

Keywords extraction from web pages using semantic link analysis