ThinkMind // IMMM 2011, The First International Conference on Advances in Information Mining and Management // View article immm_2011_3_20_20111


Download full article

Mining Cross-document Relationships from Text

Authors:
Petr Knoth
Zdenek Zdrahal

Keywords: text mining; automatic link generation and typing; semantic similarity; digital libraries

Abstract:
The paper argues that automatic link generation and typing methods are needed to find and maintain cross-document links in large and growing textual collections. Such links are important to organise information and to support search and navigation. We present an experimental study on mining cross-document links from a collection of 5000 documents. We identify a set of link types and show that the value of semantic similarity can be used as a distinguishing indicator.

Pages: 55 to 60

Copyright: Copyright (c) IARIA, 2011

Publication date: October 23, 2011

Published in: conference

ISBN: 978-1-61208-162-5

Location: Barcelona, Spain

Dates: from October 23, 2011 to October 29, 2011

SERVICES CONTACT
2003 � ThinkMind. All rights reserved.
Read Terms of Service and Privacy Policy.