MainComputersInternetSearching › Web-Page Summarization Using Clickthrough Data

Web-Page Summarization Using Clickthrough Data

Edit Page
Report
Scan day: 02 March 2014 UTC
29
Virus safety - good
Description: By Jian-Tao Sun, Dou Shen, HuaJun Zeng, Qiang Yang, Yuchang Lu and Zheng Chen. In: Proceedings of the 28th Annual International ACM SIGIR Conference, August 2005. The authors propose two adapted summarization methods that take advantage of the relationships discovered from clickthrough data. For those pages not covered by clickthrough data, they put forward a thematic lexicon approach to generate implicit knowledge. The methods are evaluated on a relatively small dataset consisting of manually annotated pages as well as a large dataset crawled from ODP.
Web-Page Summarization Using Clickthrough Data - Microsoft Research Web-Page Summarization Using Clickthrough Data Jian-Tao Sun, Dou Shen, Huajun Zeng, Qiang Yang, Yuchang Lu, and Zheng Chen Most previous Web-page summarization methods treat a Web page as plain text. However, such methods fail to un- cover the full knowledge associated with aWeb page to build a high-quality summary, because the Web contains many hidden relationships that are not used in these methods. Uncovering the inherent knowledge is important to building good Web-page summarizers. In this paper, we extract the extra knowledge from the clickthrough data of a Web search engine to improve Web-page summarization. We first ana- lyze the feasibility to utilize clickthrough data in text sum- marization, and then propose two adapted summarization methods that take advantage of the relationships discovered from the clickthrough data. For those pages not covered by the clickthrough data, we put forward a thematic lexi- con approach to generate implicit knowledge for them. Our methods are evaluated on a relatively small dataset consist- ing of manually annotated pages as well as a large dataset that is crawled from the Open Directory Project website. The experimental results indicate that significant improve- ments can be achieved through our proposed summarizer as compared with summarizers without using the clickthrough data.
Size: 1410 chars

Contact Information

Email:
Phone&Fax:
Address:
Extended:

WEBSITE Info

Page title:Web-Page Summarization Using Clickthrough Data - Microsoft Research
Keywords:
Description:
IP-address:131.107.65.14

WHOIS Info

NS
Name Server: NS1.MSFT.NET
Name Server: NS2.MSFT.NET
Name Server: NS3.MSFT.NET
Name Server: NS4.MSFT.NET
Name Server: NS5.MSFT.NET
WHOIS
Status: clientDeleteProhibited
Status: clientTransferProhibited
Status: clientUpdateProhibited
Status: serverDeleteProhibited
Status: serverTransferProhibited
Status: serverUpdateProhibited
Date
Creation Date: 02-may-1991
Expiration Date: 03-may-2021